Roast topics
Find topics
Roast it!
Roast topics
Find topics
Find it!
Login
From:
Aidan Cooper
(Uncensored)
subscribe
PGN2FEN: A Benchmark for Evaluating LLM Chess Reasoning
https://www.aidancooper.co.uk/pgn2fen-benchmark/
links
backlinks
Tagged with:
llms
Introducing PGN2FEN — a benchmark for evaluating language models' ability to understand and transcribe chess game move sequences.