Authored by Biao He Zilin Zhu Ji Li 1. What is slime? slime is a LLM post-training framework aiming for RL Scaling, it was designed to be: Versatile – with a fully customizable rollout interface and flexible training setups (colocated or decoupled, synchronous or asynchronous, RL or SFT cold start). Performant - integrating SGLang for inference and Megatron-LM for training, natively. Maintainable - with a lightweight codebase and smooth transition from Megatron pretraining to SGLang dep...