The problems gave AI only a slim chance to show new capabilities| epochai.substack.com
It’s good at involved computations, improving at proofs, and useful for literature search. It still favors low-level grinds and leans on background knowledge.| Epoch AI
Deep Think utilizes extended, parallel thinking and novel reinforcement learning techniques for significantly improved problem-solving.| Google
MathArena: Evaluating LLMs on Uncontaminated Math Competitions| matharena.ai