A paper from Anthropic describing persona vectors and their applications to monitoring and controlling model behavior| www.anthropic.com
How AI Could Transform the World for the Better| www.darioamodei.com
Anthropic's latest interpretability research: a new microscope to understand Claude's internal mechanisms| www.anthropic.com
In the decade that I have been working on AI, I’ve watched it grow from a tiny academic field to arguably the most important economic and geopolitical issue in the world. In all that time, perhaps the most important lesson I’ve learned is this: the progress of the underlying technology is inexorable, driven by forces too powerful to stop, but the way in which it happens—the order in which things are built, the applications we choose, and the details of how it is rolled out to society...| www.darioamodei.com
Industrial Strength Data Science and AI| rssdsaisection.substack.com
We should take AI models seriously, which means taking their evaluation seriously| Who is Nnamdi?
When we turn up the strength of the “Golden Gate Bridge” feature, Claude’s responses begin to focus on the Golden Gate Bridge. For a short time, we’re making this model available for everyone to interact with.| www.anthropic.com