Login
From:
www.lesswrong.com
(Uncensored)
subscribe
AI Control: Improving Safety Despite Intentional Subversion — LessWrong
https://www.lesswrong.com/posts/d9FJHawgkiMSPjagR/ai-control-improving-safety-despite-intentional-subversion
links
backlinks
Roast topics
Find topics
Find it!
We’ve released a paper, AI Control: Improving Safety Despite Intentional Subversion. This paper explores techniques that prevent AI catastrophes even…