Today we are publishing a significant update to our Responsible Scaling Policy (RSP), the risk governance framework we use to mitigate potential catastrophic risks from frontier AI systems.| www.anthropic.com
Today, we’re announcing Claude 3.7 Sonnet, our most intelligent model to date and the first hybrid reasoning model generally available on the market.| www.anthropic.com
A paper from Anthropic describing a new way to guard LLMs against jailbreaking| www.anthropic.com
A paper from Anthropic's Alignment Science team on Alignment Faking in AI large language models| www.anthropic.com
Explore the latest with the release of Gemini 2.0 Flash and new coding agents, now available for testing in Google AI Studio.| developers.googleblog.com