Login
From:
www.anthropic.com
(Uncensored)
subscribe
Simple probes can catch sleeper agents \ Anthropic
https://www.anthropic.com/research/probes-catch-sleeper-agents
links
backlinks
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
Roast topics
Find topics
Find it!