Login
Roast topics
Find topics
Find it!
From:
Aidan Cooper
(Uncensored)
subscribe
PGN2FEN: A Benchmark for Evaluating LLM Chess Reasoning
https://www.aidancooper.co.uk/pgn2fen-benchmark/
links
backlinks
Tagged with:
llms
Roast topics
Find topics
Roast it!
Introducing PGN2FEN — a benchmark for evaluating language models' ability to understand and transcribe chess game move sequences.