Stanford mathematician Ravi Vakil, president of the American Mathematical Society, expects AI’s impact on mathematics to come as a phase change, not a slow climb.| Epoch AI
57% of problems have been solved at least once| Epoch AI
No company has gone from $10B to $100B as fast as OpenAI projects to do| Epoch AI
A proof only 15 experts understand is less valuable than one any undergraduate can verify using a computer.| Epoch AI
OpenAI spent ~$7 billion on compute last year — most of this went to R&D| Epoch AI
GPT-5 Pro set a new record (13%), edging out Gemini 2.5 Deep Think by a single problem (not statistically significant). Grok 4 Heavy lags.| Epoch AI
We recently wrote that GPT-5 is likely to be trained on less compute than its predecessor. How did we reach this conclusion, and what do we actually know about how GPT-5 was trained?| Epoch AI
AI capabilities have been steadily improving across a wide range of skills, and show no sign of slowing down in the near term.| Epoch AI
We evaluated Gemini 2.5 Deep Think manually on FrontierMath as there is no API. The results: a new record!| Epoch AI
The new dashboard makes it easier to compare trends in performance across multiple benchmarks| Epoch AI
Sora 2 can solve questions from LLM benchmarks, despite being a video model.| Epoch AI
Greta Panova wrote a math problem so difficult that today’s most advanced AI models don’t know where to begin.| epochai.substack.com
When mathematicians make breakthroughs, they hallucinate too.| Epoch AI
OpenAI has the inference compute to deploy millions of digital workers, but only on a narrow set of tasks – for now.| Epoch AI
Control your inbox — Subscribe to what you want, skip what you don't| Epoch AI
The value of economic theory in thinking about AGI, detecting whether the “economic singularity” is coming, and what’s wrong with existing work on explosive growth and the economics of AI| Epoch AI
New AI Companies Data Hub, AI in 2030 report, Grok 4’s training footprint, implications of long-context inference, two new podcast episodes, and more.| Epoch AI
Explore key data on frontier AI companies — revenue, funding, staff, usage rates and compute spend| Epoch AI
OpenAI focused on scaling post-training on a smaller model| Epoch AI
AI companies often grade their own homework. How do they compare to our own findings?| epochai.substack.com
Many multi-agent setups are based on fancy prompts, but this is unlikely to persist| epochai.substack.com
The problems gave AI only a slim chance to show new capabilities| epochai.substack.com
Reasoning models were as big of an improvement as the Transformer, at least on some benchmarks| epochai.substack.com
AI's biggest impact will come from broad labor automation—not R&D—driving economic growth through scale, not scientific breakthroughs.| epochai.substack.com
Insights and analysis on AI trends, developments, and research from Epoch AI. Click to read Epoch AI, a Substack publication with thousands of subscribers.| epochai.substack.com