Kimi K2.6 is an open-source model featuring state-of-the-art coding, long-horizon execution, and agent swarm capabilities. Below is an overview of Kimi API pricing and Kimi membership plans.
Table of contents
The Kimi K2.6 API pricing uses a token-based model, with usage billed per 1M tokens (1,000,000 tokens) for both input and output processing, enabling clear and predictable cost control.
| Model | Unit | Input Price (Cache Hit) | Input Price (Cache Miss) | Output Price | Context Window |
|---|---|---|---|---|---|
| kimi-k2.6 | 1M tokens | $0.16 | $0.95 | $4.00 | 262,144 tokens |
Kimi K2.6 API uses a token-based pricing model for each request, where every interaction with the model consumes tokens that are billed according to their type. Within this model, tokens are generally categorized into three types: input tokens, output tokens, and cached input tokens.
Input tokens represent everything sent to the model, including:
These tokens determine how much context the model needs to process before generating a response.
Output tokens are generated by the model in response to a request. They represent the actual AI-generated content, such as:
Because output generation requires additional computation, it is typically priced higher than input tokens.
Cached input tokens occur when previously processed context is reused.
Kimi K2.6 API pricing follows a transparent, consumption-based model, with a few important details outlined below to help developers better understand billing and cost behavior.
All prices listed for Kimi K2.6 API pricing exclude applicable taxes. Taxes are automatically calculated at checkout based on the user's billing region and local tax requirements, ensuring accurate and compliant invoicing for each order.
To make Kimi K2.6 API pricing easier to understand, billing is calculated using a consistent token standard:
This structure ensures transparent and predictable cost estimation across all Kimi API requests.
Kimi K2.6 also includes a caching mechanism that helps optimize usage costs. When working with repeated or similar inputs, cached input tokens are billed at a reduced rate, which helps lower overall consumption under the Kimi API pricing model.
This makes Kimi K2.6 API pricing more cost-effective for production scenarios where prompts or contexts are frequently reused.
While there is no permanent Kimi API free tier for production usage, the pricing model is designed to remain flexible and scalable, allowing developers to control costs based on actual token consumption.
In addition to API-based usage pricing, Kimi offers tiered membership plans that scale with your needs, making it easy to choose the right level for your workflow. These plans allow users to choose the most suitable tier based on their daily usage needs and scale requirements.
| Feature | Adagio | Moderato | Allegretto | Allegro | Vivace |
|---|---|---|---|---|---|
| Annual Billing (Effective Monthly) | $0 / month | $15 / month | $31 / month | $79 / month | $159 / month |
| Agent Usage | 6 | 60 | 150 | 360 | 720 |
| Concurrent Tasks | 1 task | 2 tasks | 2 tasks | 4 tasks | 4 tasks |
| Agent Priority Queue | × | 4× speed | 4× speed | 4× speed | 4× speed |
| Agent Swarm | × | × | 50 uses included | 120 uses included | 240 uses included |
| Concurrent Subagents | × | × | 4 subagents | 4 subagents | 8 subagents |
| Kimi Code | × | 1× credits | 5× credits | 15× credits | 30× credits |
| Kimi Claw | × | × | ✓ | ✓ | ✓ |
| Kimi Claw Android | × | × | ✓ | ✓ | ✓ |
| Kimi Claw (Mac ARM / PC) | × | × | ✓ | ✓ | ✓ |
| Group Chat with Claw | × | × | 10 chats | 10 chats | 10 chats |
| Professional Data Requests | 200 | 2000 | 5000 | 12000 | 24000 |
| Deploy Website with Database | × | ✓ | ✓ | ✓ | ✓ |
Kimi K2.6 offers flexible pricing for both developers and everyday users. The token-based API pricing keeps costs transparent and predictable, with caching support to reduce expenses in high-volume or long-context workflows. For those who prefer structured access, the tiered membership plans scale from free to professional use, covering agent capabilities, concurrent tasks, and tools like Kimi Claw and Agent Swarm. Whether you're integrating via API or exploring Kimi's full feature set, there's a plan designed to match your workflow and budget.