It’s good at involved computations, improving at proofs, and useful for literature search. It still favors low-level grinds and leans on background knowledge.| Epoch AI
Plus, steganography and future superintelligences| importai.substack.com
A guest post by Walter Dean and Alberto Naibo| siliconreckoner.substack.com
Our director explains Epoch AI’s mission and how we decide our priorities. In short, we work on projects to understand the trajectory of AI, share this knowledge publicly, and inform important decisions about AI.| Epoch AI
In recent months, the CEOs of leading AI companies have grown increasingly confident about rapid progress: OpenAI's Sam Altman: Shifted from saying in November "the rate of progress continues" to declaring in January "we are now confident we know how to build AGI" Anthropic's Dario Amodei: Stated in January "I'm more confident than I've ever been that we're close to powerful capabilities... in the next 2-3 years" Google DeepMind's Demis Hassabis: Changed from "as soon as 10 years" in autumn t...| 80,000 Hours
And the role of evaluations in AI governance| www.hyperdimensional.co
Most people think of AI as a pattern-matching chatbot – good at writing emails, terrible at real thinking.| benjamintodd.substack.com
We clarify that OpenAI commissioned Epoch AI to produce 300 math questions for the FrontierMath benchmark. They own these and have access to the statements and solutions, except for a 50-question holdout set.| Epoch AI
A step change as influential as the release of GPT-4. Reasoning language models are the current big thing.| www.interconnects.ai
We are launching the AI Benchmarking Hub: a platform presenting our evaluations of leading models on challenging benchmarks, with analysis of trends in AI capabilities.| Epoch AI