ByteDance Doubao (Doubao-Pro / Lite)
LLMby ByteDance
ByteDance's Doubao models are available through Volcano Engine (Ark platform). Known for aggressive pricing and generous free credits. Doubao-Pro offers strong performance at very low cost.
Registration & API Key Steps
Step 2
Register with your phone number and verify with SMS code.
Step 3
Complete real-name verification (required for Chinese cloud services).
Step 4
Navigate to Console > Volcano Ark > Model Services > Doubao.
Step 5
Activate the Doubao API service.
Step 6
Create an API key in the key management section.
Step 7
Create an "Endpoint" for the model you want to use.
Pricing
| Tier | Price | Features |
|---|---|---|
| Doubao-Lite-32K | ¥0.3 / ¥0.6 per 1M tokens | Input / Output (~$0.04/$0.08). Most affordable. |
| Doubao-Pro-32K | ¥0.8 / ¥2 per 1M tokens | Input / Output (~$0.11/$0.27). Best value for quality. |
| Doubao-Pro-256K | ¥5 / ¥9 per 1M tokens | Input / Output. Long context version. |
| Doubao-1.5-Pro-256K | ¥4 / ¥8 per 1M tokens | Input / Output. Latest model with enhanced reasoning. |
Application Tips
Tip 1
Doubao is one of the cheapest LLM APIs available — Lite model at ¥0.3/1M tokens.
Tip 2
Every model gets 500K free tokens monthly that never expire.
Tip 3
Volcano Engine also hosts DeepSeek, Llama, and other third-party models.
Tip 4
Cache hits reduce costs by up to 80%.
Tip 5
API uses endpoint-based routing — create endpoints before making API calls.
Tip 6
Doubao-Seed-Code is optimized for programming with 256K context.
China Access Solutions
Access Solution
Directly accessible in China. Volcano Engine is a domestic cloud platform by ByteDance. Supports Chinese phone, Alipay/WeChat payment.
Code Example
from volcenginesdkarkruntime import Ark
client = Ark(api_key="your-ark-api-key")
response = client.chat.completions.create(
model="your-endpoint-id", # e.g., "ep-20240xxx-xxx"
messages=[
{"role": "system", "content": "You are a helpful assistant."},
{"role": "user", "content": "请用简单的语言解释人工智能"}
]
)
print(response.choices[0].message.content)
# --- cURL example ---
# curl https://ark.cn-beijing.volces.com/api/v3/chat/completions \
# -H "Authorization: Bearer your-ark-api-key" \
# -H "Content-Type: application/json" \
# -d '{"model":"your-endpoint-id","messages":[{"role":"user","content":"你好"}]}'Rate Limits
| Tier | Limits |
|---|---|
| Default | Varies by model and endpoint configuration. Default: 120 RPM, 120K TPM. Can request increases through console. |
Recommended Use Cases
Related API Guides
OpenAI GPT-4o / GPT-4.1 / o3
OpenAI
OpenAI's flagship LLM family including GPT-4o for multimodal tasks, GPT-4.1 for long-context coding, and o3 for advanced reasoning. Industry-leading models with the largest developer ecosystem.
Anthropic Claude (Sonnet 4.5 / Opus 4.5)
Anthropic
Anthropic's Claude model family excels in nuanced reasoning, safety, and long-context tasks. Claude Sonnet 4.5 offers the best balance of cost and performance, while Opus 4.5 delivers frontier intelligence.
Google Gemini (2.5 Pro / 2.5 Flash)
Google's Gemini models offer a generous free tier, 1M token context window, and strong multimodal capabilities. Gemini 2.5 Pro leads in reasoning, while Flash models provide cost-effective alternatives.
Meta Llama 4 (Scout / Maverick)
Meta
Meta's open-source Llama 4 models are free to use and available through multiple cloud providers. Llama 4 Scout and Maverick offer competitive performance at extremely low cost through partner APIs.