Topic: How Reinforcement Learning from AI Feedback works