Minimax M2.5: Scaling RL for Industrial-Grade Agentic AI
Paper • Feb 16, 2026 • arXiv • MiniMax Research Team
Training agents for industrial-scale deployment requires extreme stability and data throughput. We present Minimax M2.5, a model trained using a novel asynchronous RL architecture designed to proce...