Cohere launches North Mini Code, its first open-source agentic coding model
Cohere has released North Mini Code, a 30B-parameter mixture-of-experts open-source agentic coding model with only 3B active parameters, available under an Apache 2.0 license.
Score breakdown
North Mini Code is Cohere's first open-source, developer-facing model, extending agentic coding capabilities to the broader developer ecosystem under a permissive Apache 2.0 license.
- 01North Mini Code is Cohere's first agentic coding model and first open-source model for developers.
- 02It uses a mixture-of-experts (MoE) architecture with 30B total parameters and only 3B active parameters.
- 03Released under an Apache 2.0 license.
Cohere has released North Mini Code open-source, marking the company's first agentic coding model and the first entry in what it describes as its next generation of powerful models. The model uses a mixture-of-experts (MoE) architecture with 30B total parameters and just 3B active parameters, a design intended to deliver strong software development performance while minimizing hardware requirements. The minimum hardware requirement listed is a single H100 GPU at FP8 precision.
Cohere frames the release as part of its broader mission to make sovereign AI practical for developers.
The model is released under an Apache 2.0 license and is available in several ways: weights can be downloaded from Hugging Face, it can be deployed in a managed inference environment via Cohere's Model Vault, accessed through the Cohere API, or tried via OpenRouter. North Mini Code supports a 256K total context length with a 64K maximum generation length, and is optimized for code generation, agentic software engineering, and terminal tasks. Cohere frames the release as part of its broader mission to make sovereign AI practical for developers.
Key facts
- 01North Mini Code is Cohere's first agentic coding model and first open-source model for developers.
- 02It uses a mixture-of-experts (MoE) architecture with 30B total parameters and only 3B active parameters.
- 03Released under an Apache 2.0 license.
- 04Supports a 256K total context window with a 64K maximum generation length.
- 05Optimized for code generation, agentic software engineering, and terminal tasks.
- 06Minimum hardware requirement is 1× H100 GPU at FP8 precision.
- 07Available via Hugging Face (weights), Cohere API, Cohere Model Vault, and OpenRouter.
Topics
Summary and scoring are generated automatically from the original article. We always link back to the publisher and never republish images or paywalled content. Last processed Jun 10, 2026 · 15:34 UTC. How this works →