How AI Could Transform the World for the Better| www.darioamodei.com
“The world is going to get a lot funnier, weirder, and quirkier.”| Dwarkesh Podcast
The economy will literally double every year afterwards| Dwarkesh Podcast
Misaligned hive minds, Xi and Trump waking up, and automated Ilyas accelerating AI progress| www.dwarkesh.com
Two of Gemini's co-leads on Google's path to AGI| www.dwarkesh.com
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.| www.anthropic.com
Seeing a world in a grain of sand| www.astralcodexten.com
Agentic AI connects to enterprise data and uses sophisticated reasoning and iterative planning to autonomously solve complex, multi-step problems.| NVIDIA Blog
We investigate the internal mechanisms used by Claude 3.5 Haiku — Anthropic's lightweight production model — in a variety of contexts, using our circuit tracing methodology.| Transformer Circuits
A paper from Anthropic's Alignment Science team on Alignment Faking in AI large language models| www.anthropic.com
Hypothetical point in time when technological growth becomes uncontrollable and irreversible| en.wikipedia.org
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.| www.anthropic.com
AGI by 2027 is strikingly plausible. GPT-2 to GPT-4 took us from ~preschooler to ~smart high-schooler abilities in 4 years. Tracing trendlines in compute (~0.5 orders of magnitude or OOMs/year), algorithmic efficiencies (~0.5 OOMs/year), and “unhobbling” gains (from chatbot to agent), we should expect another preschooler-to-high-schooler-sized qualitative jump by 2027. Look. The models, they just| SITUATIONAL AWARENESS
Introducing Claude 3.5 Sonnet—our most intelligent model yet. Sonnet now outperforms competitor models and Claude 3 Opus on key evaluations, at twice the speed.| www.anthropic.com
When we turn up the strength of the “Golden Gate Bridge” feature, Claude’s responses begin to focus on the Golden Gate Bridge. For a short time, we’re making this model available for everyone to interact with.| www.anthropic.com
Gemini 1.5 Pro brings big improvements to speed and efficiency, but one of its innovations is its long context window, which measures how many tokens that the model can …| Google
Mechanistic interpretability seeks to understand neural networks by breaking them into components that are more easily understood than the whole. By understanding the function of each component, and how they interact, we hope to be able to reason about the behavior of the entire network. The first step in that program is to identify the correct components to analyze. | transformer-circuits.pub
Software engineering toolkit| en.wikipedia.org