Free text to speech that says what free means.
Most free text to speech pages keep the catch below the fold. Ours fits on a receipt: about 10 premium minutes a month, unlimited generations on the open-weight Kokoro engine, and no card at the door.
- Premium engine minutes
- about 10 / mo
- Metered as
- 10,000 characters
- Kokoro generations
- Unlimited
- Card to start
- None
Converted by the house math: about 1,000 characters is about a minute of speech. The cap above is the one the generation route enforces, read from the same file.
Same line. Three deliveries.
The bracketed cue is an instruction the engine acts. Pick one and press play: each take is real Gemini Flash output, recorded unedited.
[whispering]Don't go in there... something moved.
Real Gemini Flash output, unedited. About 1,000 characters is a minute of audio.
Every limit we have, in one table.
Pages built on the word free tend to save this part for the checkout screen. Here it is at the top instead, next to exactly what paying adds.
| What | Free | Creator | Studio |
|---|---|---|---|
| Monthly price | $0, forever | $15 / mo | $49 / mo |
| Kokoro engine (open weight) | Unlimited, unmetered | Unlimited, unmetered | Unlimited, unmetered |
| Premium engine time | About 10 minutes (10,000 characters) | About 10 hours (600,000 characters) | About 30 hours (1,800,000 characters) |
| At the premium cap | Premium waits for the monthly reset; Kokoro keeps going | Same rule, a much bigger cap | Same rule, the biggest cap |
| Cloned voices you own | Not included | 1 owned cloned voice, exportable (coming soon) | 5 owned cloned voices (coming soon) |
The numbers come from the plan registry the billing code reads, not from a marketing document. The full cards live on pricing, and the unmetered lane has a page of its own.
An open-weight engine pays for the free lane.
Kokoro is an open-weight model: its weights are published, anyone can host it, and hosts competing on identical output have pushed the cost of serving a draft close to a rounding error. Counting those generations would buy us nothing, so we do not count them, on any plan.
The expressive premium engines bill real money for every character they speak, and that is precisely the part we meter. Your free premium minutes have a printed edge rather than an invisible one, and an upgrade buys more of the expensive thing, not permission to keep using the cheap one. That split is the entire economics of free text to speech here, and it is why the free lane has no expiry date.
The whole calculation, raw engine rates included, is written out in why our free tier has an unlimited engine.
Asked before signing up.
The questions a free tier should answer plainly, with the numbers it actually enforces.
Is it really free, or a trial in disguise?
Do I need a credit card to start?
What happens when I hit the monthly limit?
Can I use the free audio commercially?
Which engines does the free plan include?
Put your own script through it.
An email gets you in, the receipt above is the whole deal, and the first file you export is yours to keep.