Abstract
We introduce Kimi K2, a Mixture-of-Experts (MoE) large language model with 32billion activated parameters and 1 trillion total parameters. We propose theMuonClip optimizer, which improves upon Muon with a novel QK-clip technique toaddress training instability while enjoying the advanced token efficiency ofMuon. Based on MuonClip, K2 was pre-trained on 15.5 trillion tokens with zeroloss spike. During post-training, K2 undergoes a multi-stage post-trainingprocess, highlighted by a large-scale agentic data synthesis pipeline and ajoint reinforcement learning (RL) stage, where the model improves itscapabilities through interactions with real and synthetic environments. Kimi K2 achieves state-of-the-art performance among open-source non-thinkingmodels, with strengths in agentic capabilities. Notably, K2 obtains 66.1 onTau2-Bench, 76.5 on ACEBench (En), 65.8 on SWE-Bench Verified, and 47.3 onSWE-Bench Multilingual -- surpassing most open and closed-sourced baselines innon-thinking settings. It also exhibits strong capabilities in coding,mathematics, and reasoning tasks, with a score of 53.7 on LiveCodeBench v6,49.5 on AIME 2025, 75.1 on GPQA-Diamond, and 27.1 on OJBench, all withoutextended thinking. These results position Kimi K2 as one of the most capableopen-source large language models to date, particularly in software engineeringand agentic tasks. We release our base and post-trained model checkpoints tofacilitate future research and applications of agentic intelligence.