Topic: MiniMax-M1 Hybrid Architecture Meets vLLM: Long Context, Fast Inference