Topic: Inside s1: An o1-Style Reasoning Model That Cost Under $50 to Train with Niklas Muennighoff