Tl;dr We want to be able to supervise models with superhuman knowledge of the world and how to manipulate it. For this we need an overseer to be able…| www.alignmentforum.org
How to scale alignment techniques to hard tasks| aligned.substack.com
My attempt at clarifying a confusing topic| aligned.substack.com