Kimi K2: Open Agentic Intelligence

  • 2025-07-28 05:35:43
  • Kimi Team, Yifan Bai, Yiping Bao, Guanduo Chen, Jiahao Chen, Ningxin Chen, Ruijue Chen, Yanru Chen, Yuankun Chen, Yutian Chen, Zhuofu Chen, Jialei Cui, Hao Ding, Mengnan Dong, Angang Du, Chenzhuang Du, Dikang Du, Yulun Du, Yu Fan, Yichen Feng, Kelin Fu, Bofei Gao, Hongcheng Gao, Peizhong Gao, Tong Gao, Xinran Gu, Longyu Guan, Haiqing Guo, Jianhang Guo, Hao Hu, Xiaoru Hao, Tianhong He, Weiran He, Wenyang He, Chao Hong, Yangyang Hu, Zhenxing Hu, Weixiao Huang, Zhiqi Huang, Zihao Huang, Tao Jiang, Zhejun Jiang, Xinyi Jin, Yongsheng Kang, Guokun Lai, Cheng Li, Fang Li, Haoyang Li, Ming Li, Wentao Li, Yanhao Li, Yiwei Li, Zhaowei Li, Zheming Li, Hongzhan Lin, Xiaohan Lin, Zongyu Lin, Chengyin Liu, Chenyu Liu, Hongzhang Liu, Jingyuan Liu, Junqi Liu, Liang Liu, Shaowei Liu, T. Y. Liu, Tianwei Liu
  • 0

Abstract

We introduce Kimi K2, a Mixture-of-Experts (MoE) large language model with 32billion activated parameters and 1 trillion total parameters. We propose theMuonClip optimizer, which improves upon Muon with a novel QK-clip technique toaddress training instability while enjoying the advanced token efficiency ofMuon. Based on MuonClip, K2 was pre-trained on 15.5 trillion tokens with zeroloss spike. During post-training, K2 undergoes a multi-stage post-trainingprocess, highlighted by a large-scale agentic data synthesis pipeline and ajoint reinforcement learning (RL) stage, where the model improves itscapabilities through interactions with real and synthetic environments. Kimi K2 achieves state-of-the-art performance among open-source non-thinkingmodels, with strengths in agentic capabilities. Notably, K2 obtains 66.1 onTau2-Bench, 76.5 on ACEBench (En), 65.8 on SWE-Bench Verified, and 47.3 onSWE-Bench Multilingual -- surpassing most open and closed-sourced baselines innon-thinking settings. It also exhibits strong capabilities in coding,mathematics, and reasoning tasks, with a score of 53.7 on LiveCodeBench v6,49.5 on AIME 2025, 75.1 on GPQA-Diamond, and 27.1 on OJBench, all withoutextended thinking. These results position Kimi K2 as one of the most capableopen-source large language models to date, particularly in software engineeringand agentic tasks. We release our base and post-trained model checkpoints tofacilitate future research and applications of agentic intelligence.

 

Quick Read (beta)

loading the full paper ...