A

Alibaba Qwen (Qwen-Max / Qwen-Plus)

LLM

by Alibaba Cloud

Alibaba's Qwen series excels in Chinese and multilingual tasks. Available through Alibaba Cloud's Bailian (DashScope) platform. Qwen 2.5 is also open-source for self-hosting.

Registration & API Key Steps

1

Step 1

Visit Alibaba Cloud Bailian Console and log in with your Alibaba Cloud account.

Open link
2

Step 2

If you don't have an account, register at aliyun.com (supports Chinese phone/ID).

Open link
3

Step 3

Complete real-name verification (required for Chinese cloud services).

4

Step 4

Agree to the Bailian service terms to activate the service.

5

Step 5

Navigate to API Key management and create a new key.

6

Step 6

Install the DashScope SDK or use the OpenAI-compatible endpoint.

Pricing

TierPriceFeatures
Qwen-Turbo¥2 / ¥6 per 1M tokensInput / Output (~$0.28/$0.82). Fast and affordable.
Qwen-Plus¥4 / ¥12 per 1M tokensInput / Output (~$0.55/$1.65). Balanced performance.
Qwen-Max¥40 / ¥120 per 1M tokensInput / Output (~$5.50/$16.50). Most capable model.
Qwen-Long¥0.5 / ¥2 per 1M tokensInput / Output. Optimized for long documents.

Application Tips

Tip 1

Qwen models are also available as open-source on Hugging Face for self-hosting.

Tip 2

API supports OpenAI-compatible format — easy to migrate from OpenAI.

Tip 3

Qwen-Long is extremely cheap for long document processing (¥0.5/1M input tokens).

Tip 4

New users get generous free token allocations for each model.

Tip 5

Alibaba Cloud's international platform also available at alibabacloud.com for global users.

Tip 6

Qwen 2.5 72B is one of the best open-source models and can be self-hosted.

China Access Solutions

Access Solution

Directly accessible in China. Native support for Chinese phone registration, Alipay payment. No VPN needed. Best choice for Chinese language applications.

Code Example

JavaScript / TypeScript
from openai import OpenAI

client = OpenAI(
    api_key="sk-xxx",  # Bailian API key
    base_url="https://dashscope.aliyuncs.com/compatible-mode/v1"
)

response = client.chat.completions.create(
    model="qwen-plus",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "用中文解释量子计算的基本原理"}
    ]
)

print(response.choices[0].message.content)

# --- cURL example ---
# curl https://dashscope.aliyuncs.com/compatible-mode/v1/chat/completions \
#   -H "Authorization: Bearer sk-xxx" \
#   -H "Content-Type: application/json" \
#   -d '{"model":"qwen-plus","messages":[{"role":"user","content":"你好"}]}'

Rate Limits

TierLimits
DefaultQwen-Turbo: 500 RPM, 500K TPM. Qwen-Plus: 200 RPM, 200K TPM. Qwen-Max: 120 RPM, 120K TPM. Can apply for higher limits.

Recommended Use Cases

Chinese language tasksDocument analysisCode generationEnterprise applicationsMultilingual translation
Last Updated: 2026-02-10

Related API Guides