Open Source AIJune 13, 20264 min read

Kimi K2.7 Code: Moonshot AI launches 1T-code model vs GPT-5.5

Open-weights Kimi K2.7 Code is a one-trillion-parameter programming model that undercuts GPT-5.5 and Claude Opus 4.8 on price per token.

The BrieftideJune 13, 2026

TL;DR

01Open-weights Kimi K2.7 Code is a one-trillion-parameter programming model that undercuts GPT-5.5 and Claude Opus 4.8 on price per token.
02Moonshot AI has released Kimi K2.7 Code, an open-weights, one-trillion-parameter model trained for programming tasks, and priced to compete on inference cost)))))))))))-cost).
03Kimi K2.7 Code is being distributed with weights available to the public, enabling teams to run the model on their own hardware or through third-party hosts.

Moonshot AI has released Kimi K2.7 Code, an open-weights, one-trillion-parameter model trained for programming tasks, and priced to compete on inference cost. The company positions the model as a low-cost option for code generation and related developer workflows, and initial comparisons show Kimi undercutting OpenAI's GPT-5.5 and Anthropic's Claude Opus 4.8 on price per token by as much as 12x.

Kimi K2.7 Code is being distributed with weights available to the public, enabling teams to run the model on their own hardware or through third-party hosts. Moonshot AI highlights the model's focus on code completion, synthesis, and code-aware reasoning, and publishes example latency and throughput figures for common GPU configurations. While the model's parameter count is one trillion, Moonshot cautions that Kimi does not yet match proprietary models on a range of quality benchmarks.

Benchmarks and pricing

Early benchmark snapshots place Kimi behind GPT-5.5 and Claude Opus 4.8 on many standard code-evaluation metrics, including functional correctness on code synthesis tasks and multi-step reasoning in complex debugging scenarios. The gap is not uniform, with Kimi performing competitively on single-step completions and boilerplate generation, while more sophisticated reasoning and long-context problems favor the closed models.

Where Kimi stands out is cost. Moonshot's published pricing and community-hosted instances put Kimi's effective price per token significantly below the leading managed APIs. Aggregated comparisons show Kimi can be up to 12x cheaper per token than GPT-5.5 and Claude Opus 4.8 depending on deployment and throughput assumptions. That gap narrows if customers use highly optimized managed endpoints from established providers, but it remains material for teams running large volumes of code inference in-house.

The open-weights nature of Kimi lowers entry barriers for on-premise and private-cloud deployments, and it enables organizations to tailor inference stacks for cost or latency. Moonshot supplies recommended configurations and quantized checkpoints to reduce memory and compute needs, and community maintainers are already publishing optimized containers and orchestration tips.

Quality trade-offs, however, are real. On head-to-head functional tests, GPT-5.5 and Claude Opus 4.8 maintain higher pass@k and fewer hallucinated function calls. Moonshot acknowledges these gaps in its technical notes and frames Kimi as a pragmatic option when cost, control, or data sovereignty are priorities.

Why it matters

An open-weights, trillion-parameter code model priced far below leading closed models changes the calculus for teams that run heavy code inference workloads or require private deployments. Developers and smaller companies can now choose lower-cost, locally controlled models for many production tasks, while organizations that need the highest correctness and reasoning will still favor proprietary offerings. The release intensifies pressure on API providers to justify managed pricing with measurable quality or integration advantages.

Kimi K2.7 Code versus GPT-5.5 and Claude Opus 4.8

Item
Kimi K2.7 Code	1T	Open weights	Code generation and synthesis	1x	Competitive on single-step code, trails on complex reasoning
OpenAI GPT-5.5	Not disclosed	Closed	General purpose, strong code reasoning	Up to 12x	Higher overall correctness and multi-step reasoning
Anthropic Claude Opus 4.8	Not disclosed	Closed	General purpose with safety and reasoning focus	Up to 12x	High quality on reasoning and safety-sensitive tasks

Written by The Brieftide · Source: The Decoder

The Brieftide Daily · 06:00

Briefs like this one, in your inbox every morning.

FreeOne email a dayEvery claim sourcedUnsubscribe in one click

Continue reading

OpenAI backs EU AI content transparency code

OpenAI pledged to support the European Code of Practice on AI content transparency.

OpenAIFRONTIER LAB

PRC-linked AI influence campaigns target US tech policy debates

OpenAI says PRC-linked actors used AI-generated content and coordinated accounts to push narratives about data centers and tariffs.

OpenAIFRONTIER LAB

LSEG adopts OpenAI to scale trusted AI across global teams

London Stock Exchange Group embedded OpenAI models across global teams, accelerating insights and shortening release cycles.

OpenAIFRONTIER LAB

OpenAI people-first AI industrial policy and workforce plan

OpenAI proposes workforce programs, public investment, corporate governance rules and international coordination to expand AI opportunity.