Topic: Teaching Large Language Models to Reason with Reinforcement Learning with Alex Havrilla