A paper from Anthropic's Alignment Science team on Alignment Faking in AI large language models| www.anthropic.com
How much do regular people and experts worry about AI risks?| PauseAI