The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence

Aonian Li; Baichuan Zhou; Bangwei Gong; Binyang Jiang; Boji Dan; Changqing Yu; Chao Wang; Chengjun Xiao; Cheng Ma; Chengyi Yang

arxiv: 2605.26494 · v1 · pith:DWUOSFPWnew · submitted 2026-05-26 · 💻 cs.AI · cs.CL· cs.LG

The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence

MiniMax: Aili Chen , Aonian Li , Baichuan Zhou , Bangwei Gong , Binyang Jiang , Boji Dan , Changqing Yu , Chao Wang

show 197 more authors

Cheng Ma Cheng Zhong Cheng Zhu Chengjun Xiao Chengyi Yang Chengyu Du Chenyang Zhang Chi Zhang Chuangyi Huang Chunhao Zhang Chunhui Du Chunyu Zhao Congchao Guo Da Chen Deming Ding Dianjun Sun Dongyu Zhang Enhui Yang Fei Yu Guang Zheng Guodong Zheng Guohong Li Haichao Zhu Haigang Zhou Haimo Zhang Han Ding Hao Zhang Haohai Sun Haolin Lyu Haonan Lu Haoyu Wang Huajie Shi Huiyang Li Jiacheng Chen Jian Zhang Jiaqi Zhuang Jiaren Cai Jiaxin Pan Jiayao Li Jiayuan Song Jichuan Zhang Jie Wang Jihao Gu Jin Zhu Jingwei Dong Jingyang Li Jingyu Zhang Jingze Zhuang Jinhao Tian Jinli Liu Jinyi Hu Jun Tao Jun Zhang Junbin Ruan Junhao Xu Junjie Yan Junteng Liu Junxian He Kang Xu Ke Ji Ke Yang Kecheng Xiao Keyu Duan Keyu Li Le Han Letian Ruan Li Yuan Lianfei Yu Liheng Feng Lijie Mo Lin Li Lingye Bao Lingyu Yang Lingyuan Zhou Loki Lu Chen Lunbin Ceng Ming Li Ming Zhong Mingliang Tao Mingyuan Chi Mujie Lin Nan Hu Ningxin Chen Peiyin Zhu Peng Gao Pengcheng Gao Pengfei Li Penglin Li Pengyu Zhao Qibin Ren Qidi Xu Qihan Ren Qile Li Qin Wang Quanliang Chen Qunhong Ceng Rong Tian Rui Dong Ruitao Leng Ruize Zhang Shanqi Liu Shaoyu Chen Sheng Jia Shun Yao Shuoran Zhao Shuqi Yu Sichen Li Sicheng Pan Songquan Zhu Tengfei Li Tian Xie Tiancheng Qin Tianrun Liang Wei Liu Weiqi Xu Weitao Li Weixiang Chen Weiyu Cheng Weiyu Zhang Wenhu Chen Wenqian Zhao Xiancai Chen Xiangjun Song Xiangyuan Wang Xiao Luo Xiao Su Xiaobo Li Xiaodong Han Xiaojie Wu Xihao Song Xingyi Han Xinyu Guan Xuan Lu Xun Zou Xunhao Lai Xutong Li Yan Gong Yang Wang Yang Xu Yangsen Wang Ye Tang Yicheng Chen Yinran Qiu Yiqi Shi Yiting Guo Yiwen Huang Yixuan Wang Yongyi Hu Yu Gao Yu Zhang Yuanxiang Ying Yuanzhen Zhang Yubo Wang Yuchen Song Yufeng Yang Yuhang Meng Yuhang Miao Yuhao Li Yujie Liu Yulin Hu Yunan Huang Yunji Li Yunyi Huang Yusen Zhang Yusu Hong Yutao Xie Yutong Zhang Yuwen Liao Yuxuan Shi Yuze Wenren Zebin Li Zehan Li Zejian Luo Zeyu Jin Zeyuan Sun Zhanpeng Zhou Zhaochen Su Zhendong Li Zhengmao Zhu Zhengyuan Peng Zhenhua Fan Zhi Zhang Zhichao Xu Zhiheng Lv Zhikang Xu Zhitao He Zhiwei He Zhongyuan Li Zibo Gao Zijia Wu Zijian Song Zijian Zhou Zijun Sun Zishan Huang Ziying Chen Ziyue Ge

This is my paper

classification 💻 cs.AI cs.CLcs.LG

keywords agenticseriesacrossactivationscodingintelligenceminiminimax-m2

0 comments

read the original abstract

We introduce the MiniMax-M2 series, a family of Mixture-of-Experts language models built around the principle that mini activations can unleash maximum real-world intelligence. The flagship M2 contains 229.9B total parameters with only 9.8B activated per token. Designed end-to-end for agentic deployment, the M2 series rests on three components: (i) agent-driven data pipelines producing large-scale, verifiable trajectories across agentic coding and agentic cowork, each grounded in an executable workspace and an artifact-aligned reward; (ii) Forge, a scalable agent-native RL system that adapts to long-horizon agent trajectories, paired with windowed-FIFO scheduling, prefix-tree merging, inference optimization, and a clean training-inference-agent decoupling that supports both white-box and black-box agents; (iii) the latest M2.7 checkpoint takes an early step toward self-evolution -- autonomously debugging training runs and modifying its own scaffold. Across M2 through M2.7, this combination translates a mini-activation footprint into frontier-tier performance on agentic coding, deep search, office-task, and reasoning benchmarks.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 6 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

CLI-Universe: Towards Verifiable Task Synthesis Engine for Terminal Agents
cs.AI 2026-06 unverdicted novelty 7.0

CLI-Universe synthesizes a verified 6K dataset of terminal-agent tasks that, when used to fine-tune Qwen3-32B, reaches 33.4% on Terminal-Bench 2.0 and sets a new open-source SOTA for models at or below 32B parameters.
OCELOT: Inference-Leakage Budgets for Privacy-Preserving LLM Agents
cs.CR 2026-06 unverdicted novelty 7.0

OCELOT recasts agent privacy as posterior-risk control and implements Witness-Verified Declassification to authorize the least-disclosing useful release under a sink-trust-weighted min-entropy budget.
AutoMedBench: Towards Medical AutoResearch with Agentic AI Models
cs.AI 2026-06 conditional novelty 7.0

AutoMedBench evaluates AI agents on long-horizon medical workflows across five stages and finds validation and submission as dominant failure points based on thousands of runs.
INCARBench: A Benchmark for Scientific Configuration in VASP INCAR by Large Language Models
cond-mat.mtrl-sci 2026-06 unverdicted novelty 6.0

INCARBench evaluates 19 LLMs on VASP INCAR configuration generation and repair, showing high semantic accuracy but lower scientific correctness especially for DFT+U, magnetism, and correlated materials.
Vortex: Efficient and Programmable Sparse Attention Serving for AI Agents
cs.AI 2026-06 unverdicted novelty 6.0

Vortex provides a programmable frontend and backend for sparse attention in LLM serving, delivering up to 3.46x throughput over full attention while preserving accuracy.
Token-Operations-Oriented Inference Optimization Techniques for Large Models
cs.SE 2026-06 unverdicted novelty 3.0

The paper introduces a four-layer technical architecture for token-operations-oriented inference optimization in large models and reviews key technologies and industry status at each layer.