In the previous post I introduced the scaffolding for running a test on various LLMs where I give them several puzzles and prompt engineering hints to look at what helps them in reaching a solution, if ever. In this post, I’m going to present the problems and the scoring guideline for each problem. Read more...