Kimi K2.7 Code Pricing Explained: API Costs and Plans

8 min read2026-06-25
Kimi K2.7 Code API pricing

Kimi K2.7 Code membership plans & usage tiers

If you want to use Kimi K2.7 Code directly in your terminal or IDE, the simplest path is through a Kimi Code subscription.

ModeratoAllegrettoAllegroVivace
Monthly Price$19 / month$39 / month$99 / month$199 / month
DescriptionWeekly refreshed usage quotas with multi-device login, allowing you to share your plan across devices and work efficiently on multiple projectsAmple weekly limits and increased concurrency caps, providing a high-value solution for advanced usersExpansive quota designed for excellence. Seamlessly handles everything from daily tasks to intensive developmentOffering our highest weekly plan quotas, enabling you to handle complex projects and large codebases with ease

Each plan includes weekly refreshed usage quotas. Higher tiers provide larger weekly limits and higher concurrency caps, making them suitable for more demanding projects. For the latest plan details, see the official membership page.

What's included beyond Kimi Code?

Purchasing any Kimi Code plan also unlocks the broader Kimi membership benefits, powered primarily by Kimi K2.6.

AdagioModeratoAllegrettoAllegroVivace
Price$0$19 / month$39 / month$99 / month$199 / month
Agent Usage660150360720
Agent Concurrent Tasks1 task2 tasks2 tasks4 tasks4 tasks
Agent Priority Queue4x speed4x speed4x speed4x speed
Agent Swarm25 uses included50 uses included120 uses included240 uses included
Agent Swarm Concurrent Subagents2 subagents4 subagents4 subagents8 subagents
Kimi Claw
Kimi Claw Android
Kimi Claw PC
Group Chat with Claw10 group chats10 group chats10 group chats
Deploy a Website with a Database
Kimi Work Active Scheduled Tasks2 tasks10 tasks15 tasks20 tasks20 tasks

These cross-product benefits are available across all paid tiers alongside your Kimi K2.7 Code usage, giving you both a dedicated coding agent and a full AI workspace in one subscription.

Kimi K2.7 Code API pricing

The Kimi K2.7 Code API pricing uses a token-based model, with usage billed per 1M tokens (1,000,000 tokens) for both input and output processing, enabling clear and predictable cost control.

ModelUnitInput price (cache hit)Input price (cache miss)Output priceContext window
kimi-k2.7-code1M tokens$0.19$0.95$4.00262,144 tokens
kimi-k2.7-code-highspeed1M tokens$0.38$1.90$8.00262,144 tokens

The standard kimi-k2.7-code model delivers Kimi's most intelligent coding performance with full support for long-context reasoning and agentic tasks. The kimi-k2.7-code-highspeed variant offers approximately 180 Tokens/s output speed (up to 260 Tokens/s in short context scenarios), providing an extreme coding experience for latency-sensitive workflows.

How Kimi K2.7 Code API billing works

Kimi K2.7 Code API uses a token-based pricing model for each request, where every interaction with the model consumes tokens that are billed according to their type. Within this model, tokens are generally categorized into three types: input tokens, output tokens, and cached input tokens.

Input tokens

Input tokens represent everything sent to the model, including:

  • User prompts

  • System instructions

  • Conversation history or context

These tokens determine how much context the model needs to process before generating a response.

Cached input tokens

Cached input tokens occur when previously processed context is reused.

  • If the same or similar context is reused, it can be served at a reduced cost

  • This significantly improves efficiency for repetitive workflows

  • It is especially useful in long-context applications or multi-turn interactions

Output tokens

Output tokens are generated by the model in response to a request. They represent the actual AI-generated content, such as:

  • Code generation

  • Text responses

  • Structured outputs

Because output generation requires additional computation, it is typically priced higher than input tokens.

Conclusion

Kimi K2.7 Code offers a transparent and flexible pricing structure that scales from individual development projects to large production workloads. With token-based billing, cache discounts, and a high-speed model option, developers can balance performance and cost based on their specific requirements. This makes it easier to build, test, and deploy AI-powered coding workflows while maintaining predictable spending.

FAQ

How is Kimi K2.7 Code API billing calculated?
Kimi K2.7 Code API uses per-token billing. Every request consumes input tokens (your prompt plus context) and output tokens (the model's generated response). Input tokens are billed at $0.95 per 1M tokens on a cache miss, or $0.19 per 1M tokens on a cache hit. Output tokens are billed at $4.00 per 1M tokens for the standard model, and $8.00 per 1M tokens for the high-speed variant. All prices exclude applicable taxes.
How much can I save with context caching?
When previously processed context is reused, cached input tokens are billed at $0.19 per 1M tokens instead of $0.95 — an 80% reduction on input costs for repeated context. This makes caching particularly effective for long multi-turn coding sessions, iterative debugging workflows, and any scenario where similar prompts or codebase context are sent repeatedly.
Should I use the API or a Kimi membership plan?
Use the API if you're building a custom tool, CI/CD pipeline, or internal developer platform where you need programmatic access and fine-grained control over token consumption. Use a Kimi membership plan (starting at $19/month) if you want instant access through Kimi Code in your terminal and IDE without managing API keys or estimating token usage. Membership plans include weekly refreshed Kimi Code credits; higher tiers unlock more credits and higher concurrency.
Does Kimi K2.7 Code API pricing work for enterprise-scale usage?
Yes. The token-based model scales from individual developers to enterprise deployments with no upper usage limits. For high-volume scenarios, the automatic context caching mechanism significantly reduces costs when similar inputs are reused across sessions. If you need custom enterprise terms, dedicated support, or volume discounts, contact the Kimi sales team through the Kimi API Platform.