Skip to main content

Audio AI

Compare pricing for speech-to-text, text-to-speech, voice cloning, and audio processing APIs.

Speech-to-Text (STT)

VendorServicePrice/minFree Tier
OpenAIWhisper API$0.0061 hour free
AssemblyAISpeech-to-Text$0.05-$0.1510 hours
DeepgramNova-2$0.0043-$0.15200 mins
GoogleCloud Speech-to-Text$0.025-$0.1560 mins
AWS TranscribeStandard$0.024-$0.101 hour
MicrosoftAzure AI Speech$0.026-$0.101 hour
SpeechmaticsReal-time$0.04-$0.1230 mins
RevAI Transcription$0.05-$0.15Pay-per-use

Text-to-Speech (TTS)

VendorQualityPrice per 1M charsFree Tier
OpenAITTS-1$15.00$5 credit
OpenAITTS-1 HD$30.00$5 credit
ElevenLabsMultilingual v2$300.0010k chars
ElevenLabsLanguage$120.0010k chars
ElevenLabsEnglish$90.0010k chars
Google CloudWaveNet2$16.001M chars
Google CloudStandard$4.001M chars
AWS PollyNeural$16.005M chars
AWS PollyStandard$4.005M chars
AzureNeural$16.000.5M chars
Murf AIStudio$69/mo10k chars
Murf AIAPI$0.004/char10k chars
WellSaid LabsCreative$49/moPay-per-use
WellSaid LabsAPI$40/100k charsTrial
Natural ReaderPro$99/yrPay-per-use

Voice Cloning

VendorPlanPrice/moFeatures
ElevenLabsPro$33030 custom voices
ElevenLabsStarter$9910 custom voices
Resemble AIBuild$99Unlimited voices
Resemble AIScale$499+ API, custom voices
DescriptCreator$121 voice
DescriptPro$245 voices
ResembleBasicFree1 voice, limited use

Audio Intelligence

VendorServicePriceNotes
AssemblyAIAudio Intelligence$0.05-$0.15/minPII detection, topics
DeepgramAudio Intelligence$0.05/30 secTopics, entities
GoogleSpeech-to-Text + ML$0.10-$0.30/minAdvanced features
OpenAIWhisper + GPT-4$0.006 + token costTranscription + analysis

Cost Estimator

Transcription (per hour)

ServiceQualityCost
WhisperStandard$0.36
Deepgram Nova-2High$0.26
AssemblyAIStandard$1.50
RevHuman + AI$2.00
SpeechmaticsReal-time$2.40

TTS (per 100k chars)

ServiceQualityCost
Polly StandardBasic$0.40
Google StandardBasic$0.40
ElevenLabs MultilingualPremium$30.00
Murf APIPro$0.40
WellSaid APIPro$40.00

Real-Time Voice Agents

VendorPrice/minUse Case
ElevenLabs$0.30-$0.60Conversational AI
Daily$0.003-$0.005/secReal-time calls
Agora$0.99-$3.99/1000 minsVoIP
Twilio$0.001/AI agentVoice assistants

Best For

NeedRecommended
Cost efficiencyWhisper API
High accuracyAssemblyAI, Deepgram Nova
Real-timeGoogle Cloud Speech
Voice cloningElevenLabs
EnterpriseAWS Transcribe
MultilingualElevenLabs, Google WaveNet
Affordable TTSPolly, Murf
Avatars + AudioDescript

Free Tier Comparison

ServiceFree Offering
Whisper1 hour
Deepgram200 mins
AssemblyAI10 hours
ElevenLabs10k chars, 3 voices
Murf10k chars
Google Cloud1M chars