Skip to content
New · the open voice benchmark is liveRead it
cantari
Engine

Grok Voice

xAI voice with 5 personas. Plain read, ignores cues.

5 personasEnglish
Generated by Grok Voice
5 real voices · previews below
The numbers

Scores, speed, and the real rate.

The same trust rows as the open benchmark: a third-party quality score, our own measured latency, and the raw engine cost in the open.

Quality Elo*
1197
Measured latency
2444ms measured 2026-06-10
Languages
1 language
Follows [cues]
Plain read, ignores cues
Engine cost
$15/M in · $0 out rate checked 2026-06-11

The provider’s published rate when we last checked. Rates move, and when they do we update this row. It’s here so you can weigh our flat pricing against the raw cost underneath, instead of taking our word for it.

Rights
Commercial use; outputs are yours

* Quality Elo from the Artificial Analysis Speech Arena, retrieved June 10, 2026. It is a user-vote arena rating; the top model of all rated is Fun-Realtime-TTS at 1228.06.

Latency is our own wall-clock time to full audio, measured 2026-06-10 on the same path that serves the studio. A measurement, not a server SLA.

Voices

Every voice Grok Voice ships.

Press play to hear each voice: real output from this engine, recorded unedited. What you hear is what it produced.

Eve

Expressive lead persona

Ara

Calm and friendly

Rex

Assured, direct male

Sal

Laid back and neutral

Leo

Bold, projected read

Best for

Where Grok Voice earns its place.

Character and persona reads in English.

Hear it in the studio.

Open the studio and generate real audio through Grok Voice. No sign-up required to listen.