Our database of benchmark results, featuring the performance of leading AI models on challenging tasks. It includes results from benchmarks evaluated internally by Epoch AI as well as data collected from external sources. The dashboard tracks AI progress over time, and correlates benchmark scores with key factors like compute or model accessibility.| Epoch AI
This Gradient Updates issue explores DeepSeek-R1’s architecture, training cost, and pricing, showing how it rivals OpenAI’s o1 at 30x lower cost.| Epoch AI
Microsoft continues to add to the conversation by unveiling its newest models, Phi-4-reasoning, Phi-4-reasoning-plus, and Phi-4-mini-reasoning. Learn more.| Microsoft Azure Blog
Weekly commentary on AI news and developments. Click to read Epoch AI, a Substack publication with thousands of subscribers.| epochai.substack.com
We’re on a journey to advance and democratize artificial intelligence through open source and open science.| huggingface.co