Login
From:
www.alignmentforum.org
(Uncensored)
subscribe
0. CAST: Corrigibility as Singular Target — AI Alignment Forum
https://www.alignmentforum.org/posts/NQK8KHSrZRF5erTba/0-cast-corrigibility-as-singular-target-1
links
backlinks
Roast topics
Find topics
Find it!
An agent is corrigible when it robustly acts opposite of the trope of "be careful what you wish for" by cautiously reflecting on itself as a flawed tool and focusing on empowering the principal to fix its flaws and mistakes.