I shared a controversial take the other day at an event and I decided to write it down in a longer format: I’m afraid AI won't give us a compressed 21st century.| thomwolf.io
A paper from Anthropic's Alignment Science team on Alignment Faking in AI large language models| www.anthropic.com