AI Model API Directory
One-stop guide for 30+ mainstream AI model APIs. Step-by-step registration tutorials, pricing comparisons, free credits, code examples, and China access solutions.
OpenAI GPT-4o / GPT-4.1 / o3
OpenAIOpenAI's flagship LLM family including GPT-4o for multimodal tasks, GPT-4.1 for long-context coding, and o3 for advanced reasoning. Industry-leading models with the largest developer ecosystem.
Anthropic Claude (Sonnet 4.5 / Opus 4.5)
AnthropicAnthropic's Claude model family excels in nuanced reasoning, safety, and long-context tasks. Claude Sonnet 4.5 offers the best balance of cost and performance, while Opus 4.5 delivers frontier intelligence.
Google Gemini (2.5 Pro / 2.5 Flash)
GoogleGoogle's Gemini models offer a generous free tier, 1M token context window, and strong multimodal capabilities. Gemini 2.5 Pro leads in reasoning, while Flash models provide cost-effective alternatives.
Meta Llama 4 (Scout / Maverick)
MetaMeta's open-source Llama 4 models are free to use and available through multiple cloud providers. Llama 4 Scout and Maverick offer competitive performance at extremely low cost through partner APIs.
Mistral AI (Mistral Large / Small / Codestral)
Mistral AIFrench AI company offering efficient open and commercial models. Mistral Large for complex reasoning, Mistral Small for cost-effective tasks, and Codestral for code generation. Known for strong European data privacy.
DeepSeek (V3 / R1)
DeepSeekChinese AI lab offering extremely cost-effective models. DeepSeek-V3 for general tasks and DeepSeek-R1 for advanced reasoning. Known for breakthrough price-performance ratio and open-source availability.
Alibaba Qwen (Qwen-Max / Qwen-Plus)
Alibaba CloudAlibaba's Qwen series excels in Chinese and multilingual tasks. Available through Alibaba Cloud's Bailian (DashScope) platform. Qwen 2.5 is also open-source for self-hosting.
Baidu ERNIE (ERNIE 4.0 / Speed / Lite)
BaiduBaidu's ERNIE models are deeply integrated with the Qianfan platform. ERNIE 4.0 is the flagship model, while Speed and Lite models offer free tiers. Strong in Chinese language understanding.
ByteDance Doubao (Doubao-Pro / Lite)
ByteDanceByteDance's Doubao models are available through Volcano Engine (Ark platform). Known for aggressive pricing and generous free credits. Doubao-Pro offers strong performance at very low cost.
iFlytek SparkDesk (Spark 4.0 / Lite / Pro)
iFlytekiFlytek's SparkDesk models are competitive in Chinese language tasks and offer the permanently free Spark Lite API. Known for strong speech and NLP capabilities built on iFlytek's long AI history.
Zhipu AI GLM (GLM-4.7 / GLM-4-Flash)
Zhipu AIZhipu AI's GLM series offers strong Chinese-English bilingual capabilities. GLM-4-Flash is free, GLM-4.7 provides frontier performance. The platform also offers free vision, reasoning, and image generation models.
Moonshot AI Kimi (K2 / K2.5)
Moonshot AIMoonshot AI's Kimi models are known for long-context capabilities (256K) and OpenAI-compatible API. Kimi K2 is a trillion-parameter open-source model optimized for agent tasks.
MiniMax (M2 / Text-01 / Hailuo AI)
MiniMaxMiniMax is a pioneer in Asian LLMs offering multimodal capabilities across text, audio, video, image, and music. Known for natural voice synthesis and the consumer product Hailuo AI.
01.AI Yi (Yi-Large / Yi-Medium)
01.AI01.AI's Yi series models offer strong multilingual performance and competitive pricing. Yi-Large is the flagship model. OpenAI-compatible API makes migration easy. Open-source models available.
Cohere Command R+ / Embed / Rerank
CohereCohere specializes in enterprise AI with Command R+ for generation, Embed for embeddings, and Rerank for search optimization. Strong focus on RAG (Retrieval-Augmented Generation) workflows.
DALL-E 3
OpenAIOpenAI's most advanced image generation model, capable of creating highly detailed and accurate images from text descriptions with excellent prompt following.
Midjourney
Midjourney Inc.Industry-leading AI image generation known for stunning artistic quality. No official public API yet; access via Discord bot or third-party API services.
Stable Diffusion API
Stability AIStability AI's official API for Stable Diffusion models including SD 3.5, Stable Image Core, and Stable Image Ultra. Open-source models also available for self-hosting.
Imagen 3
GoogleGoogle's state-of-the-art image generation model, available through the Gemini API and Vertex AI. Excels in photorealistic images with SynthID watermarking.
Ideogram
Ideogram AIAI image generation specializing in accurate text rendering within images. Excellent for logos, posters, and designs requiring readable text.
Whisper
OpenAIOpenAI's automatic speech recognition model that transcribes and translates audio in multiple languages with high accuracy.
ElevenLabs
ElevenLabsIndustry-leading AI voice synthesis and cloning platform with the most natural-sounding TTS and voice cloning capabilities.
OpenAI TTS
OpenAIOpenAI's TTS API with natural-sounding voices, real-time streaming, and multiple quality tiers.
Suno AI
Suno Inc.AI music generation creating full songs with vocals, lyrics, and instrumentation from text prompts.
Runway Gen-3/Gen-4
Runway MLLeading AI video generation with Gen-3 and Gen-4 models. Text-to-video, image-to-video, and editing.
Pika Labs
PikaAI video generation platform with text-to-video and image-to-video capabilities. API access available through Fal.ai partnership.
Luma AI Dream Machine
Luma AIAI video and image generation platform with Dream Machine for video and Photon for images. Offers both web and API access.
GitHub Copilot
GitHub / MicrosoftAI-powered code completion and chat assistant integrated into IDEs. Supports extensions via MCP (Model Context Protocol).
Hugging Face Inference API
Hugging FaceAccess 800,000+ open-source AI models through a unified API. Supports text, image, audio, and more with pay-as-you-go pricing.
Replicate
ReplicateRun open-source AI models with a simple API. Pay-per-use pricing for thousands of models including image, video, audio, and text generation.