The Cheapest Text to Speech API in 2026 (With Real Numbers)
Side-by-side pricing for every major TTS API: ElevenLabs, OpenAI, Google, AWS, Azure, and Audexum. Real per-character costs, free tiers, and code.
If you bill by API call, run a high-volume pipeline, or just want to stop overpaying for speech synthesis, the pricing differences between providers are significant enough to matter. This post cuts through marketing language and gives you actual per-character costs, hidden limits, and a fair comparison so you can pick the right API for your budget.
How TTS APIs Actually Price Their Service
Most providers use one of three models:
- Per character — you pay for every character synthesized, including spaces and punctuation
- Per 1,000 characters — same thing, just expressed differently
- Per minute of audio — harder to compare directly; depends on your speaking rate (~800–1,000 chars/min at normal pace)
Free tiers are almost always "per month, resets on billing date" — not cumulative. If you don't use them, you lose them.
TTS API Pricing Comparison Table (2026)
| Provider | Free Tier | Entry Paid | €/1M chars | Notes |
|---|---|---|---|---|
| Audexum | 10K chars/mo | €4 / 100K | ~€8.00 | PAYG available; 43 voices, 33 langs |
| ElevenLabs | 10K chars/mo | $5 / 30K | ~$22–$330 | Price drops sharply at scale |
| OpenAI TTS | None | None | $15.00 | Standard model; HD at $30/1M |
| Google Cloud TTS | 1M chars/mo (WaveNet) | $4 / 1M | $4.00–$16.00 | Depends on voice tier |
| AWS Polly | 5M chars (12-mo trial) | $4 / 1M | $4.00 | Neural voices $16/1M |
| Azure TTS | 500K chars/mo (F0) | $1 / 500K | $4.00–$16.00 | Neural voices more expensive |
| Murf | None | $19/mo | Not per-char | Subscription only; 2M chars cap |
Key takeaway: At low volumes (under 100K chars/month), Audexum and ElevenLabs are competitive. Above 500K chars/month, Audexum's PAYG rate of €8/1M chars undercuts ElevenLabs and OpenAI — OpenAI's $15/1M is roughly 2× more. The hyperscalers (AWS/Azure standard tiers) are cheaper per character, but ship far fewer voices and languages and no ongoing free tier.
Free Tier Breakdown: What You Actually Get
"Free tier" marketing is often misleading. Here is what each provider actually gives you monthly:
- Audexum: 10,000 characters. That is roughly 7 minutes of synthesized audio at normal speaking pace. Sign up at audexum.com/signup — no credit card required.
- ElevenLabs: 10,000 characters on the free plan, but you are limited to 3 custom voices and cannot use the API commercially without a paid plan.
- Google Cloud TTS: 1 million characters/month for WaveNet voices — genuinely useful for prototyping, but requires a billing account attached.
- AWS Polly: 5 million characters for the first 12 months only (new accounts). After 12 months, it ends.
- Azure: 500,000 chars/month on the F0 tier, but limited to standard (non-neural) voices.
- OpenAI TTS: No free tier. You pay from character zero.
For ongoing free usage with no credit card required, Audexum and Google Cloud offer the most transparent options. Google's trial is more generous in volume but requires a billing account setup.
What Happens at Scale?
The pricing gap widens significantly at higher volumes. Here is a concrete example for a production pipeline processing 5 million characters per month:
| Provider | Monthly Cost (5M chars) |
|---|---|
| Audexum PAYG | ~€15 |
| Google Cloud WaveNet | $20 |
| AWS Polly Standard | $20 |
| Azure Neural | $80 |
| ElevenLabs Creator ($22/mo) | $22 + overages or $99/mo |
| OpenAI TTS | $75 |
At 5M chars/month, Audexum's PAYG tier costs roughly the same as AWS Polly standard voices, but includes 43 voices across 33 languages. The comparison becomes starker against OpenAI ($75) or Azure Neural ($80).
Making an API Call: Audexum REST Example
The API uses Bearer auth and returns audio as a binary stream:
curl -X POST https://audexum.com/api/synthesize \
-H "Authorization: Bearer YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"text": "The cheapest TTS API depends on your volume and voice requirements.",
"voice": "af_heart",
"format": "mp3"
}' \
--output output.mp3Replace YOUR_API_KEY with the key from your dashboard. Voice IDs are listed in the API docs.
Python Example
import requests
API_KEY = "your_api_key_here"
TEXT = "Lower your TTS costs without sacrificing voice quality."
response = requests.post(
"https://audexum.com/api/synthesize",
headers={
"Authorization": f"Bearer {API_KEY}",
"Content-Type": "application/json",
},
json={
"text": TEXT,
"voice": "af_heart",
"format": "mp3",
},
)
with open("output.mp3", "wb") as f:
f.write(response.content)
print(f"Characters used: {len(TEXT)}")Full API reference is at audexum.com/docs.
Hidden Costs to Watch For
Rate Limits
Free tiers often have tight concurrency limits. ElevenLabs free plan is single-threaded (one request at a time). AWS Polly free tier has the same default limits as paid, which is a genuine advantage. Audexum's limits are documented at audexum.com/docs — check your tier before building a production pipeline.
Voice Selection Restrictions
Several providers gate premium voices behind higher plans:
- ElevenLabs: cloning and professional voices require $22+/mo plans
- Azure: Neural voices cost 4× more than standard voices
- Google: WaveNet and Studio voices are priced separately
Audexum includes all 43 voices on every plan, including free. See pricing.
Audio Format Fees
Some APIs charge more for higher-quality formats. OpenAI's standard TTS is $15/1M; their HD model is $30/1M. Audexum includes MP3 and WAV output regardless of plan.
Which Provider Should You Choose?
- If you need zero budget to start: Google Cloud (1M chars free monthly) or Audexum (10K free, simplest setup).
- If you want predictable flat-rate pricing: Audexum at €8/1M PAYG (credits never expire), or AWS Polly Standard at $4/1M.
- If you need OpenAI ecosystem integration: OpenAI TTS is convenient but expensive — $15/1M with no free tier.
- If voice quality and cloning are your priority: ElevenLabs' voice quality is excellent but the cost at scale is a significant factor. Audexum vs ElevenLabs comparison covers this in detail.
- If you are building a Discord bot or game: The TTS API for Discord bots guide covers integration patterns with low-latency audio streaming.
Referral: Get Extra Free Characters
Audexum runs a referral program: every account gets a referral code at signup. When someone uses your code, both accounts receive +10,000 free characters. If you have a Discord community, newsletter, or developer audience, sharing your code is a straightforward way to extend your free tier before spending anything.
By Petar, founder of Audexum. Running a bootstrapped TTS SaaS gives you strong opinions about API pricing.