* When using the OpenAI compatible interface, you can use existing OpenAI SDKs and tools by simply modifying the base_url parameter. The Anthropic native interface provides more complete functionality support.
* Our main model is k2sonnet-chat, an optimized conversational model that supports Chinese and English dialogue, code generation, and various other tasks.
Models & Pricing
The model prices listed in the table below are in units of "million tokens". A token is the smallest unit used by the model to represent natural language text, and can be a word, number, punctuation mark, etc. We will charge based on the total number of tokens input and output by the model.
Due to the widespread attention received by the new product launch, the current system load is high. To ensure service quality, k2sonnet-chat currently supports only 5 concurrent requests. We are actively coordinating resources to enhance user experience. Thank you for your understanding and support.
Model Details
Model
k2sonnet-chat
Model Version
k2sonnet-chat (Optimized Version)
Context Length
128K
Output Length
Default 4K, Max 32K
Json Output
Supported
Function Calling
Supported
Vision Capabilities
Supported
Thinking
Supported
Pricing
Model
k2sonnet-chat
Input Uncached (Million Tokens)
$1.5
Output (Million Tokens)
$5
Input Cached (Million Tokens)
$0.2
Billing Rules
Deduction = Token Consumption × Model Unit Price. The corresponding fees will be directly deducted from the recharged balance or gifted balance. When both recharged balance and gifted balance exist, gifted balance will be deducted first.
Resource Pack Billing
Uncached Input: Actual input tokens deducted from resource pack tokens at 1:1 ratio. Example: 10,000 input tokens deducts 10,000 tokens from resource pack
Cached Input: Actual input tokens divided by 7.5 deducted from resource pack. Example: 10,000 cached tokens deducts only 1,333.33 tokens from resource pack
Output: No discount, actual output tokens deducted from resource pack at 1:1 ratio. Example: 10,000 output tokens deducts 10,000 tokens from resource pack
Product prices may change, and K2Sonnet reserves the right to modify prices. Please recharge according to actual usage as needed and check this page regularly for the latest pricing information.
Technical Notice: To maximize model capabilities, K2Sonnet team has implemented third-party model capability fusion, simulated thinking abilities, and API context expansion capabilities at the API layer. By using K2Sonnet's services, you acknowledge and agree to these technical solutions.
Token Usage Calculation
A token is the basic unit used by the model to represent natural language text, and is also our billing unit. It can be intuitively understood as a "character" or "word". Typically, 1 Chinese word, 1 English word, 1 number or 1 symbol counts as 1 token.
Token Conversion Ratio
Generally, the conversion ratio between tokens and character count in models is approximately as follows:
Text Type
Conversion Ratio
Example
English Characters
1 English character ≈ 0.3 tokens
"Hello" ≈ 1.5 tokens
Chinese Characters
1 Chinese character ≈ 0.6 tokens
"你好" ≈ 1.2 tokens
Numbers and Symbols
1 number or symbol ≈ 1 token
"123!" ≈ 4 tokens
Note: Since different models have different tokenization methods, the conversion ratios may vary. The actual token count for each processing is based on the model's response, which you can view in the usage field of the returned results.