[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"article-xiaomi-mimo-v2-pro-1t-moe-agents-en":3,"article-related-xiaomi-mimo-v2-pro-1t-moe-agents-en":24,"series-model-release-f063d8d1-41d1-4de4-8ebc-6c40511b9369":78},{"id":4,"slug":5,"title":6,"content":7,"summary":8,"source":9,"source_url":10,"author":11,"image_url":11,"cover_image":12,"category":13,"language":14,"translated_content":11,"related_article_id":15,"keywords":16,"key_takeaways":11,"views":22,"created_at":23,"published_at":23,"topic_cluster_id":11},"f063d8d1-41d1-4de4-8ebc-6c40511b9369","xiaomi-mimo-v2-pro-1t-moe-agents-en","Xiaomi MiMo-V2-Pro: 1T MoE Model for Agents","\u003Cp>Xiaomi’s \u003Ca href=\"https:\u002F\u002Fawesomeagents.ai\u002Fmodels\u002Fmimo-v2-pro\u002F\" target=\"_blank\" rel=\"noopener\">MiMo-V2-Pro\u003C\u002Fa> arrived with a number that gets attention fast: over 1 trillion total parameters, with 42 billion active on each token. It also brings a 1 million token extended context window and pricing that starts at $1 per million input tokens.\u003C\u002Fp>\u003Cp>That combination matters because the model is aimed at agentic coding, where cost, latency, and long context all hit the budget at the same time. On \u003Ca href=\"https:\u002F\u002Fwww.swebench.com\u002F\" target=\"_blank\" rel=\"noopener\">SWE-bench Verified\u003C\u002Fa>, Xiaomi says MiMo-V2-Pro scores 78.0%, which puts it close to \u003Ca href=\"https:\u002F\u002Fwww.anthropic.com\u002Fclaude-sonnet\" target=\"_blank\" rel=\"noopener\">Claude Sonnet 4.6\u003C\u002Fa> at 79.6% and a bit behind \u003Ca href=\"https:\u002F\u002Fwww.anthropic.com\u002Fclaude-opus\" target=\"_blank\" rel=\"noopener\">Claude Opus 4.6\u003C\u002Fa> at 80.8%.\u003C\u002Fp>\u003Ch2>What Xiaomi actually shipped\u003C\u002Fh2>\u003Cp>MiMo-V2-Pro is the flagship text model in Xiaomi’s second-generation MiMo family. It is built as a Mixture-of-Experts system, which means the model has a very large total parameter pool but activates only a smaller slice for each token. In this case, that active slice is 42B, up from 15B active parameters in the smaller MiMo-V2-Flash.\u003C\u002Fp>\u003Cp>That design choice is the whole trick. Xiaomi gets a model with huge capacity for hard reasoning and tool use, while keeping per-token inference more manageable than a dense trillion-parameter model would be. The company also added a 7:1 hybrid attention pattern and a lightweight Multi-Token Prediction layer, both aimed at faster agent loops.\u003C\u002Fp>\u003Cp>For developers, the practical details matter more than the architecture diagram. MiMo-V2-Pro is API-only, so there are no public weights to download. If you want to call it directly, Xiaomi points developers to \u003Ca href=\"https:\u002F\u002Fplatform.xiaomimimo.com\" target=\"_blank\" rel=\"noopener\">platform.xiaomimimo.com\u003C\u002Fa> and its OpenAI-compatible endpoint at \u003Ca href=\"https:\u002F\u002Fapi.xiaomimimo.com\u002Fv1\" target=\"_blank\" rel=\"noopener\">api.xiaomimimo.com\u002Fv1\u003C\u002Fa>. It is also listed on \u003Ca href=\"https:\u002F\u002Fopenrouter.ai\u002Fxiaomi\u002Fmimo-v2-pro\" target=\"_blank\" rel=\"noopener\">OpenRouter\u003C\u002Fa>.\u003C\u002Fp>\u003Cul>\u003Cli>Over 1 trillion total parameters\u003C\u002Fli>\u003Cli>42B active parameters per token\u003C\u002Fli>\u003Cli>256K standard context, 1M extended context\u003C\u002Fli>\u003Cli>131,072-token maximum completion\u003C\u002Fli>\u003Cli>$1 input \u002F $3 output per million tokens at standard context\u003C\u002Fli>\u003Cli>$2 input \u002F $6 output per million tokens at 256K to 1M context\u003C\u002Fli>\u003C\u002Ful>\u003Ch2>Why the Hunter Alpha mystery mattered\u003C\u002Fh2>\u003Cp>Before Xiaomi revealed the model, the AI community had already been arguing about an anonymous OpenRouter model called Hunter Alpha. It appeared around March 11, 2026, then started chewing through huge volumes of traffic. The mystery model reportedly handled roughly 500 billion tokens per week, which is the kind of usage that makes people assume something very large is hiding underneath.\u003C\u002Fp>\u003Cp>The guess that spread fastest was DeepSeek V4, partly because Xiaomi’s lead researcher, Luo Fuli, had previously worked at DeepSeek. That made the speculation feel plausible. Once Xiaomi confirmed that Hunter Alpha was actually MiMo-V2-Pro on March 18, the story changed from “what is this?” to “how did Xiaomi get this close on price and coding performance?”\u003C\u002Fp>\u003Cp>There is a direct quote from the model itself that captures the confusion. When asked who built it, Hunter Alpha replied: “I am a Chinese AI model primarily trained in Chinese.” That answer was vague, but it was enough to keep the mystery alive for another week.\u003C\u002Fp>\u003Cblockquote>“I am a Chinese AI model primarily trained in Chinese.”\u003C\u002Fblockquote>\u003Cp>Xiaomi also used the launch to give developers a reason to test the model immediately. Teams working with \u003Ca href=\"https:\u002F\u002Fcline.bot\u002F\" target=\"_blank\" rel=\"noopener\">Cline\u003C\u002Fa>, \u003Ca href=\"https:\u002F\u002Fwww.blackbox.ai\u002F\" target=\"_blank\" rel=\"noopener\">Blackbox\u003C\u002Fa>, \u003Ca href=\"https:\u002F\u002Fwww.kilocode.ai\u002F\" target=\"_blank\" rel=\"noopener\">KiloCode\u003C\u002Fa>, \u003Ca href=\"https:\u002F\u002Fwww.openclaw.ai\u002F\" target=\"_blank\" rel=\"noopener\">OpenClaw\u003C\u002Fa>, and \u003Ca href=\"https:\u002F\u002Fwww.opencode.ai\u002F\" target=\"_blank\" rel=\"noopener\">OpenCode\u003C\u002Fa> got free API access during launch week. That is a smart move, because agentic models improve fastest when real developers hit them with messy repos and broken tool calls.\u003C\u002Fp>\u003Ch2>Benchmarks: close to Sonnet, far cheaper\u003C\u002Fh2>\u003Cp>The cleanest way to judge MiMo-V2-Pro is to compare it against the models developers already know. On coding benchmarks, it lands very close to Anthropic’s best general-purpose coding models while undercutting them hard on price.\u003C\u002Fp>\u003Cp>On SWE-bench Verified, Xiaomi reports 78.0% for MiMo-V2-Pro, compared with 79.6% for Claude Sonnet 4.6 and 80.8% for Claude Opus 4.6. That is a small gap in raw score, but the pricing spread is much larger. At standard context, MiMo-V2-Pro costs $1\u002F$3 per million tokens, while Sonnet 4.6 is priced at $3\u002F$15 and Opus 4.6 at $5\u002F$25.\u003C\u002Fp>\u003Cp>On agentic tasks, Xiaomi’s ClawEval score is 61.5. That matters because ClawEval measures multi-turn tool use, recovery from errors, and long-horizon planning, which is where coding agents usually break down. Xiaomi’s number puts MiMo-V2-Pro above GPT-5.2’s reported 50.0 on that benchmark and behind Opus 4.6 at 66.3.\u003C\u002Fp>\u003Cul>\u003Cli>SWE-bench Verified: MiMo-V2-Pro 78.0%, Sonnet 4.6 79.6%, Opus 4.6 80.8%\u003C\u002Fli>\u003Cli>ClawEval: MiMo-V2-Pro 61.5, GPT-5.2 50.0, Opus 4.6 66.3\u003C\u002Fli>\u003Cli>VentureBeat’s benchmark cost total: $348 for MiMo-V2-Pro, $2,304 for GPT-5.2, $2,486 for Claude Opus 4.6\u003C\u002Fli>\u003Cli>Terminal-Bench 2.0: 86.7 for MiMo-V2-Pro\u003C\u002Fli>\u003Cli>GPQA Diamond: 87% for MiMo-V2-Pro\u003C\u002Fli>\u003C\u002Ful>\u003Cp>The cost math may be the most important line in the whole launch. VentureBeat reported a total benchmark bill of $348 for MiMo-V2-Pro, versus $2,304 for GPT-5.2 and $2,486 for Claude Opus 4.6. If those numbers hold up in real workloads, procurement teams will care more than they care about a 1.6-point SWE-bench gap.\u003C\u002Fp>\u003Ch2>Where MiMo-V2-Pro fits in a real stack\u003C\u002Fh2>\u003Cp>MiMo-V2-Pro is not Xiaomi’s only model. The company launched three at once, and each one targets a different buyer. That matters because the Pro tier is the expensive, closed option, while the other two models give developers different tradeoffs.\u003C\u002Fp>\u003Cp>\u003Ca href=\"https:\u002F\u002Fhuggingface.co\u002FXiaomiMiMo\" target=\"_blank\" rel=\"noopener\">MiMo-V2-Flash\u003C\u002Fa> is the self-hostable one. It has 310B total parameters, 15B active parameters, and a MIT license on Hugging Face. Xiaomi also released \u003Ca href=\"https:\u002F\u002Fhuggingface.co\u002FXiaomiMiMo\" target=\"_blank\" rel=\"noopener\">MiMo-V2-Omni\u003C\u002Fa>, a multimodal model for text, image, video, and audio. Xiaomi says Omni can process 10+ hours of continuous audio in a single request and costs $0.40 input \u002F $2.00 output per million tokens.\u003C\u002Fp>\u003Cp>That gives teams a fairly clear split. If you need local control, Flash is the obvious candidate. If you need multimodal inputs, Omni is the one to test. If you want the strongest agentic coding performance Xiaomi has right now, Pro is the model to benchmark first.\u003C\u002Fp>\u003Cp>There is one catch: MiMo-V2-Pro still has a few unknowns. Xiaomi has not published public weights, exact total parameter counts, or a full apples-to-apples benchmark slate across every major knowledge test. It also has no multimodal input, so teams doing document understanding or media workflows will need a different model.\u003C\u002Fp>\u003Cp>My read is simple: Xiaomi is trying to buy developer trust with price and performance, then keep the enterprise upside for later. If MiMo-V2-Pro keeps its current SWE-bench and agentic numbers under heavy real-world use, expect more teams to route coding agents through it as the default high-volume model and reserve pricier systems for edge cases.\u003C\u002Fp>\u003Cp>For now, the most useful question is not whether MiMo-V2-Pro beats every rival. It is whether your own agent stack can save enough money by switching a chunk of coding traffic to Xiaomi without losing reliability. That is a test worth running this quarter, not next year.\u003C\u002Fp>","Xiaomi’s MiMo-V2-Pro packs 1T parameters, 42B active, and 1M context, with SWE-bench results close to Claude Sonnet 4.6.","awesomeagents.ai","https:\u002F\u002Fawesomeagents.ai\u002Fmodels\u002Fmimo-v2-pro\u002F",null,"https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1774619185536-iewn.png","model-release","en","9e1044b4-946d-47fe-9e2a-c2ee032e1164",[17,18,19,20,21],"Xiaomi MiMo-V2-Pro","agentic coding","MoE model","SWE-bench","OpenRouter",10,"2026-03-28T03:06:19.238032+00:00",{"tags":25,"relatedLang":37,"relatedPosts":41},[26,28,30,33,35],{"name":21,"slug":27},"openrouter",{"name":17,"slug":29},"xiaomi-mimo-v2-pro",{"name":31,"slug":32},"SWE-Bench","swe-bench",{"name":18,"slug":34},"agentic-coding",{"name":19,"slug":36},"moe-model",{"id":15,"slug":38,"title":39,"language":40},"xiaomi-mimo-v2-pro-1t-moe-agents-zh","小米 MiMo-V2-Pro 登場：1T MoE 模型","zh",[42,48,54,60,66,72],{"id":43,"slug":44,"title":45,"cover_image":46,"image_url":46,"created_at":47,"category":13},"58aa41ca-2c5f-44c6-ab07-2002473e95b1","gemini-1-5-pro-002-flash-002-2-0-flash-update-en","Gemini 1.5 Pro-002, Flash-002 and 2.0 Flash update Google AI","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1780999383257-jccn.png","2026-06-09T10:02:28.362637+00:00",{"id":49,"slug":50,"title":51,"cover_image":52,"image_url":52,"created_at":53,"category":13},"435fc551-a461-444a-bf95-dbf5685cfac0","minimax-m3-open-weight-coding-win-en","MiniMax M3 Proves Open-Weight Can Still Win on Coding","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1780968781159-odhi.png","2026-06-09T01:32:31.256895+00:00",{"id":55,"slug":56,"title":57,"cover_image":58,"image_url":58,"created_at":59,"category":13},"12af5a0d-1bbf-4a50-a391-b53f8003f234","gemini-35-flash-pricing-benchmarks-en","Gemini 3.5 Flash Pricing, Context, Benchmarks","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1780840981235-e7hm.png","2026-06-07T14:02:30.280485+00:00",{"id":61,"slug":62,"title":63,"cover_image":64,"image_url":64,"created_at":65,"category":13},"0e767e9d-5d17-4cd0-b6ee-0328f89eb49b","gemma-4-12b-specs-benchmarks-run-locally-en","Gemma 4 12B: Specs, Benchmarks & How to Run It Locally","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1780777984661-5ymr.png","2026-06-06T20:32:25.294996+00:00",{"id":67,"slug":68,"title":69,"cover_image":70,"image_url":70,"created_at":71,"category":13},"9d15f962-739d-44f8-a7f9-11bca64d38e0","best-kimi-models-2026-k2-5-vs-k2-thinking-en","Best Kimi Models in 2026: K2.5 vs K2 Thinking","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1780770786284-shy0.png","2026-06-06T18:32:39.779504+00:00",{"id":73,"slug":74,"title":75,"cover_image":76,"image_url":76,"created_at":77,"category":13},"34547376-5d6b-4453-8d80-8072d8ac36ed","kimi-k2-6-open-source-coding-agent-swarm-en","Kimi K2.6 adds open-source coding and agent swarm","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1780761781526-wop4.png","2026-06-06T16:02:22.26883+00:00",[79,84,89,94,99,104,109,114,119,120],{"id":80,"slug":81,"title":82,"created_at":83},"d4cffde7-9b50-4cc7-bb68-8bc9e3b15477","nvidia-rubin-ai-supercomputer-en","NVIDIA Unveils Rubin: A Leap in AI Supercomputing","2026-03-25T16:24:35.155565+00:00",{"id":85,"slug":86,"title":87,"created_at":88},"eab919b9-fbac-4048-89fc-afad6749ccef","google-gemini-ai-innovations-2026-en","Google's AI Leap with Gemini Innovations in 2026","2026-03-25T16:27:18.841838+00:00",{"id":90,"slug":91,"title":92,"created_at":93},"5f5cfc67-3384-4816-a8f6-19e44d90113d","gap-google-gemini-ai-checkout-en","Gap Teams Up with Google Gemini for AI-Driven Checkout","2026-03-25T16:27:46.483272+00:00",{"id":95,"slug":96,"title":97,"created_at":98},"f6d04567-47f6-49ec-804c-52e61ab91225","ai-model-release-wave-march-2026-en","Navigating the AI Model Release Wave of March 2026","2026-03-25T16:28:45.409716+00:00",{"id":100,"slug":101,"title":102,"created_at":103},"895c150c-569e-4fdf-939d-dade785c990e","small-language-models-transform-ai-en","Small Language Models: Llama 3.2 and Phi-3 Transform AI","2026-03-25T16:30:26.688313+00:00",{"id":105,"slug":106,"title":107,"created_at":108},"38eb1d26-d961-4fd3-ae12-9c4089680f5f","midjourney-v8-alpha-features-pricing-en","Midjourney V8 Alpha: A Deep Dive into Its Features and Pricing","2026-03-26T01:25:36.387587+00:00",{"id":110,"slug":111,"title":112,"created_at":113},"bf36bb9e-3444-4fb8-ab19-0df6bc9d8271","rag-2026-indispensable-ai-bridge-en","RAG in 2026: The Indispensable AI Bridge","2026-03-26T01:28:34.472046+00:00",{"id":115,"slug":116,"title":117,"created_at":118},"60881d6d-2310-44ef-b1fb-7f98e9dd2f0e","xiaomi-mimo-trio-agents-robots-voice-en","Xiaomi’s MiMo trio targets agents, robots, and voice","2026-03-28T03:05:08.899895+00:00",{"id":4,"slug":5,"title":6,"created_at":23},{"id":121,"slug":122,"title":123,"created_at":124},"a1379e9a-6785-4ff5-9b0a-8cff55f8264f","cursor-composer-2-started-from-kimi-en","Cursor’s Composer 2 started from Kimi","2026-03-28T03:11:59.132398+00:00"]