W

Whisper

Speech Recognition

by OpenAI

OpenAI's automatic speech recognition model that transcribes and translates audio in multiple languages with high accuracy.

Registration & API Key Steps

1

Create OpenAI Account

Visit platform.openai.com and sign up. Same account for all OpenAI APIs.

Open link
2

Get API Key

Generate an API key from your account dashboard.

Open link
3

Use Free Credits or Add Payment

New accounts get $5 free credits (3 months). At $0.006/min, that is ~833 minutes of transcription.

Pricing

TierPriceFeatures
Whisper$0.006 / minuteHigh accuracy, Multi-language, Translation to English
GPT-4o Transcribe$0.006 / minuteBetter accuracy, Speaker diarization, Better punctuation
GPT-4o Mini Transcribe$0.003 / minute50% cheaper, Good accuracy, Best for bulk

Application Tips

Self-Host for High Volume

Whisper is open-source. Break-even at ~500 hours/month vs API.

Use GPT-4o Mini for Savings

GPT-4o Mini Transcribe costs 50% less at $0.003/min with good accuracy.

Max 25MB File Size

Supports mp3, mp4, wav, webm etc. Max 25MB. Split longer files.

China Access Solutions

API Proxy

Use third-party proxy services for Whisper API.

Self-Host Whisper

Deploy open-source Whisper locally. No API needed.

Code Example

JavaScript / TypeScript
import OpenAI from 'openai';
import fs from 'fs';

const openai = new OpenAI({ apiKey: process.env.OPENAI_API_KEY });

async function transcribeAudio() {
  const transcription = await openai.audio.transcriptions.create({
    file: fs.createReadStream('audio.mp3'),
    model: 'whisper-1',
    language: 'en',
    response_format: 'verbose_json',
  });
  console.log('Text:', transcription.text);
}

transcribeAudio();

Rate Limits

TierLimits
Tier 150 RPM
Tier 3+500+ RPM

Recommended Use Cases

Meeting transcriptionPodcast transcriptionSubtitle generationVoice note conversionMulti-language audio processing
Last Updated: 2025-02

Related API Guides