Roast topics
Find topics
Roast it!
Roast topics
Find topics
Find it!
Login
From:
www.alignmentforum.org
(Uncensored)
subscribe
0. CAST: Corrigibility as Singular Target — AI Alignment Forum
https://www.alignmentforum.org/posts/NQK8KHSrZRF5erTba/0-cast-corrigibility-as-singular-target-1
links
backlinks
An agent is corrigible when it robustly acts opposite of the trope of "be careful what you wish for" by cautiously reflecting on itself as a flawed tool and focusing on empowering the principal to fix its flaws and mistakes.