It’s good at involved computations, improving at proofs, and useful for literature search. It still favors low-level grinds and leans on background knowledge.| Epoch AI
Reasoning models were as big of an improvement as the Transformer, at least on some benchmarks| epochai.substack.com
Available evidence suggests that rapid growth in reasoning training can continue for a year or so.| Epoch AI
We are launching the AI Benchmarking Hub: a platform presenting our evaluations of leading models on challenging benchmarks, with analysis of trends in AI capabilities.| Epoch AI