In this article, we summarize the AI puzzle competition from my blog and answer two questions: which model is better and which prompt engineering hint is giving better results. The answers might surprise you, so give this a read :)| mihai.page
Before concluding the AI 2025 puzzle competition I asked LLMs a simple common sense question to see how they behave. They didn't perform that great.| mihai.page
It's finally here. I analyze QwQ and Deepsek on the 3 math puzzles problem and finish the round of benchmarks I ran in January.| mihai.page
In this article we look at 4 Llama models (via Perplexity) and see how they perform for the 3 puzzles in the competition.| mihai.page