Topic: How RLHF Works (And How Things May Go Wrong)