ElevenLabs
Voice Synthesisby ElevenLabs
Industry-leading AI voice synthesis and cloning platform with the most natural-sounding TTS and voice cloning capabilities.
Registration & API Key Steps
Install SDK
Install the official Python or TypeScript SDK.
Upgrade for Commercial Use
Free is non-commercial. Upgrade to Starter ($5/mo) for commercial rights.
Pricing
| Tier | Price | Features |
|---|---|---|
| Free | Free (10,000 credits/mo) | ~20,000 chars TTS/month, Pre-made voices, Non-commercial only |
| Starter | $5/month | 30,000 credits (~60K chars), Commercial license, Voice cloning |
| Creator | $22/month | 100,000 credits (~200K chars), Professional cloning, Priority support |
| Pro | $99/month | 500,000 credits, 44.1kHz PCM API, All voices & models |
| Scale | $330/month | 2M credits (~4M chars), Multi-seat, Low-latency TTS |
Application Tips
Free Tier Includes API
ElevenLabs includes API access even in the free tier. Great for testing.
Voice Cloning Tips
Provide 30+ minutes of clean audio for best cloning. Creator plan or higher required.
Streaming Support
Real-time streaming TTS available for conversational AI applications.
China Access Solutions
API Proxy
Access via proxy server in supported regions.
Direct Access
May be accessible directly in some regions of China.
Code Example
import { ElevenLabsClient } from 'elevenlabs';
import fs from 'fs';
const client = new ElevenLabsClient({ apiKey: process.env.ELEVENLABS_API_KEY });
async function generateSpeech() {
const audio = await client.textToSpeech.convert('21m00Tcm4TlvDq8ikWAM', {
text: 'Hello! Welcome to our AI-powered application.',
model_id: 'eleven_multilingual_v2',
voice_settings: { stability: 0.5, similarity_boost: 0.75 },
});
const chunks: Buffer[] = [];
for await (const chunk of audio) { chunks.push(Buffer.from(chunk)); }
fs.writeFileSync('output.mp3', Buffer.concat(chunks));
}
generateSpeech();Rate Limits
| Tier | Limits |
|---|---|
| Free | ~20K chars/month |
| Pro | ~1M chars/month, higher concurrency |
Recommended Use Cases
Related API Guides
OpenAI GPT-4o / GPT-4.1 / o3
OpenAI
OpenAI's flagship LLM family including GPT-4o for multimodal tasks, GPT-4.1 for long-context coding, and o3 for advanced reasoning. Industry-leading models with the largest developer ecosystem.
Anthropic Claude (Sonnet 4.5 / Opus 4.5)
Anthropic
Anthropic's Claude model family excels in nuanced reasoning, safety, and long-context tasks. Claude Sonnet 4.5 offers the best balance of cost and performance, while Opus 4.5 delivers frontier intelligence.
Google Gemini (2.5 Pro / 2.5 Flash)
Google's Gemini models offer a generous free tier, 1M token context window, and strong multimodal capabilities. Gemini 2.5 Pro leads in reasoning, while Flash models provide cost-effective alternatives.
Meta Llama 4 (Scout / Maverick)
Meta
Meta's open-source Llama 4 models are free to use and available through multiple cloud providers. Llama 4 Scout and Maverick offer competitive performance at extremely low cost through partner APIs.