VentureBeat
Follow
Cohere open-sources a coding agent that runs on a single H100
Cohere has released North Mini Code, an open-source model for agentic coding pipelines. This 30 billion parameter mixture-of-experts model runs efficiently on a single H100 and is ideal for tasks like sub-agent orchestration and architecture mapping. It boasts a large 256,000 token context window and a 64,000 token maximum generation length. North Mini Code is specifically engineered for software engineering workflows, including integrated tool use and interleaved thinking. It excels at analyzing large codebases, mapping system architectures, and performing code reviews. Furthermore, the model is trained for terminal-based agentic tasks, interacting with shell commands and tooling. Cohere trained it through supervised fine-tuning and reinforcement learning across diverse agent scaffolds. While offering impressive output token generation, it can be more verbose, leading to higher inference costs in high-volume scenarios. This release provides a direct alternative to managed models like Claude Fable 5, emphasizing local deployment and cost-effectiveness. Enterprises must now consider purpose-built agentic training and the impact of verbosity on pipeline costs. The choice between North Mini Code and managed services presents a real trade-off between cost control and infrastructure overhead.