Login
From:
SemiAnalysis
(Uncensored)
subscribe
Scaling Reinforcement Learning: Environments, Reward Hacking, Agents, Scaling Data
https://newsletter.semianalysis.com/p/scaling-reinforcement-learning-environments-reward-hacking-agents-scaling-data
links
backlinks
Infrastructure Bottlenecks and Changes, Distillation, Data is a Moat, Recursive Self Improvement, o4 and o5 RL Training, China Accelerator Production
Roast topics
Find topics
Roast it!
Roast topics
Find topics
Find it!
Roast topics
Find topics
Find it!