Apr 21, 2026·1 min readNew Models & Releases

Kimi K2.6 refreshes open-model lead with agentic coding gains

Moonshot's Kimi K2.6 is a 1T-parameter MoE open-weight model claiming open-source SOTA on benchmarks including SWE-Bench Pro 58.6 and HLE w/ tools 54.0, with support for 4,000+ tool calls and 300 parallel sub-agents.

Latent Space·Latent.Space

Read at source

Composite

6.8

out of 10

Novelty · 25%

Novelty

Impact · 43%

Impact

Credibility · 12%

Credibility

Depth · 20%

Depth

Weights applied. How scores work ↗

Why it matters

Developers evaluating open-weight backends for agentic coding and long-horizon infra tasks now have a 1T-parameter MoE option with broad day-0 ecosystem support and documented multi-agent orchestration patterns to benchmark against proprietary alternatives.

01Kimi K2.6 is a 1T-parameter MoE with 32B active parameters, 384 experts (8 routed + 1 shared), MLA attention, 256K context, and INT4 quantization.
02Moonshot claims open-source SOTA on SWE-Bench Pro (58.6), SWE-bench Multilingual (76.7), BrowseComp (83.2), and HLE w/ tools (54.0).
03K2.6 supports 4,000+ tool calls, 12+ hour continuous runs, and 300 parallel sub-agents via 'Claw Groups' multi-agent coordination.

Summary— our read of the original

Moonshot's Kimi K2.6 is the follow-up to K2.5, released roughly three months later, and continues the lab's run as the leading Chinese open model lab throughout 2026. The model is a 1T-parameter MoE with 32B active parameters, 384 experts (8 routed + 1 shared), MLA attention, 256K context, native multimodality, and INT4 quantization. It launched with day-0 support across vLLM, OpenRouter, Cloudflare Workers AI, Baseten, MLX, Hermes Agent, and OpenCode. Moonshot claims open-source SOTA on HLE w/ tools (54.0), SWE-Bench Pro (58.6), SWE-bench Multilingual (76.7), BrowseComp (83.2), Toolathlon (50.0), CharXiv w/ python (86.7), and Math Vision w/ python (93.2). The model also touts a 68.6% win+tie rate against Gemini 3.1 Pro on frontend design tasks.\n\nThe more novel claims are around long-horizon agentic execution: 4,000+ tool calls, 12+ hour continuous runs, and 300 parallel sub-agents coordinated through "Claw Groups," a rebranding and extension of the lab's earlier Agent Swarm RL work. Community reactions highlighted K2.6 as a viable Claude/GPT backend for coding and infrastructure, with reports of a 5-day autonomous infra agent run, kernel rewrites, and a Zig inference engine outperforming LM Studio by 20% TPS. Alibaba's Qwen3.6-Max-Preview also dropped in the same window, with early community takes noting unusual stability for long-reasoning tasks — including solving AIME 2026 #15 after roughly 30 minutes of thinking — and the model reaching #7 in Code Arena while moving Alibaba to #3 lab there.\n\nThe Latent Space roundup also highlighted rapid growth in the Hermes Agent open agent stack, which reportedly surpassed 100K GitHub stars in under two months and overtook OpenClaw in weekly star growth. Substantive community content described advanced multi-agent orchestration patterns using Hermes, including stateless ephemeral units for parallelism (`skip_memory=True`, `skip_context_files=True`), LLM-driven replanning over structured failure metadata (`status`, `exit_reason`, `tool_trace`), and dynamic context injection via directory-local `AGENTS.md` / `.cursorrules` files — described as a more disciplined orchestration model than single-prompt history stuffing.

Key facts

01Kimi K2.6 is a 1T-parameter MoE with 32B active parameters, 384 experts (8 routed + 1 shared), MLA attention, 256K context, and INT4 quantization.
02Moonshot claims open-source SOTA on SWE-Bench Pro (58.6), SWE-bench Multilingual (76.7), BrowseComp (83.2), and HLE w/ tools (54.0).
03K2.6 supports 4,000+ tool calls, 12+ hour continuous runs, and 300 parallel sub-agents via 'Claw Groups' multi-agent coordination.
04K2.6 launched with day-0 support in vLLM, OpenRouter, Cloudflare Workers AI, Baseten, MLX, Hermes Agent, and OpenCode.
05Moonshot claims a 68.6% win+tie rate vs Gemini 3.1 Pro on frontend design tasks.
06Alibaba's Qwen3.6-Max-Preview reached #7 in Code Arena, moving Alibaba to #3 lab there.
07Hermes Agent reportedly surpassed 100K GitHub stars in under two months, overtaking OpenClaw in weekly star growth.

Topics

#model-release #agentic-coding #open-source #agent-framework #benchmarks

Methodology

Summary and scoring are generated automatically from the original article. We always link back to the publisher and never republish images or paywalled content. Last processed Apr 22, 2026 · 11:07 UTC. How this works →

Apr 21, 2026·1 min readNew Models & Releases

Kimi K2.6 refreshes open-model lead with agentic coding gains

Latent Space·Latent.Space

Read at source

Composite

6.8

out of 10

Novelty · 25%

Novelty

Impact · 43%

Impact

Credibility · 12%

Credibility

Depth · 20%

Depth

Weights applied. How scores work ↗

Why it matters

01Kimi K2.6 is a 1T-parameter MoE with 32B active parameters, 384 experts (8 routed + 1 shared), MLA attention, 256K context, and INT4 quantization.
02Moonshot claims open-source SOTA on SWE-Bench Pro (58.6), SWE-bench Multilingual (76.7), BrowseComp (83.2), and HLE w/ tools (54.0).
03K2.6 supports 4,000+ tool calls, 12+ hour continuous runs, and 300 parallel sub-agents via 'Claw Groups' multi-agent coordination.

Summary— our read of the original

Key facts

01Kimi K2.6 is a 1T-parameter MoE with 32B active parameters, 384 experts (8 routed + 1 shared), MLA attention, 256K context, and INT4 quantization.
02Moonshot claims open-source SOTA on SWE-Bench Pro (58.6), SWE-bench Multilingual (76.7), BrowseComp (83.2), and HLE w/ tools (54.0).
03K2.6 supports 4,000+ tool calls, 12+ hour continuous runs, and 300 parallel sub-agents via 'Claw Groups' multi-agent coordination.
04K2.6 launched with day-0 support in vLLM, OpenRouter, Cloudflare Workers AI, Baseten, MLX, Hermes Agent, and OpenCode.
05Moonshot claims a 68.6% win+tie rate vs Gemini 3.1 Pro on frontend design tasks.
06Alibaba's Qwen3.6-Max-Preview reached #7 in Code Arena, moving Alibaba to #3 lab there.
07Hermes Agent reportedly surpassed 100K GitHub stars in under two months, overtaking OpenClaw in weekly star growth.

Topics

#model-release #agentic-coding #open-source #agent-framework #benchmarks

Methodology

Score breakdown

Key facts

Topics

Score breakdown

Key facts

Topics