Skip to content
New · the open voice benchmark is liveRead it
cantari
Docs

Documentation

How the studio works, in plain language. Every guide describes the product as it runs today: real limits, real numbers, nothing aspirational.

Start here

Tools

Cantari Scribe

Platform

Trust & safety

Why Cantari Has No Children's Voices

No child stock voices, no cloning minors, no child-like presets. The five reasons: consent, privacy law, voiceprint law, replica law, and the abuse record.

Updated June 11, 2026
Voice cloning consent and the law

Why consent is the legal foundation of voice cloning: publicity rights, voiceprint statutes with real damages, GDPR biometrics, and what genuine permission looks like.

Updated June 11, 2026
Cloning celebrities, politicians, and other public figures

Famous voices are the most legally protected voices, not the least. The ELVIS Act, the FCC's robocall ruling, the New Hampshire deepfake fine, and where parody actually stands.

Updated June 11, 2026
Voices of the dead: who can say yes to a posthumous clone

Death does not end a voice's legal protection. Postmortem publicity rights, California's digital replica law, how estates actually license voices, and what we allow.

Updated June 11, 2026
Deepfake disclosure laws: when synthetic audio must say so

The labeling rules arriving for AI audio: the EU AI Act's transparency article, US state election-deepfake laws, platform policies, and where Cantari already stands.

Updated June 11, 2026
Voice cloning scams: what they sound like and how to protect your family

The family-emergency voice scam, the case that reached Congress, what the FTC and FBI advise, the passphrase habit worth adopting today, and what we do on our side.

Updated June 11, 2026
Voice banking: preserving your own voice, and consent done right

The best case for voice cloning: people banking their voices ahead of ALS, performers licensing replicas on their own terms, and how you will preserve a voice here when cloning ships (coming soon).

Updated June 11, 2026

Glossary

What is text to speech? TTS, explained simply

Text to speech (TTS) is software that turns written words into spoken audio. What modern neural engines actually do, and how to size a script in minutes.

Updated June 11, 2026
What is speech to text? Transcription, explained simply

Speech to text (STT) turns recorded speech into written text. How modern transcription models work, what limits their accuracy, and what they cannot do yet.

Updated June 11, 2026
What is a TTS cue? Bracketed emotion directions, explained

A cue is a stage direction in square brackets, like [whispering], that tells a voice engine how to deliver the next line. Which engines act them, and which ignore them.

Updated June 11, 2026
What is speakable text normalization?

Normalization rewrites numbers, dates, and abbreviations into the words a voice should actually say: Dr. into Doctor, 1982 into nineteen eighty-two. Why it matters for long-form audio.

Updated June 11, 2026
What is voice drift in AI narration?

Voice drift is the slow change in a synthetic narrator's tone, pacing, or energy across long-form audio. Why it happens, and how chaptered workflows keep hour seven sounding like hour one.

Updated June 11, 2026
What is TTS latency? Time to first byte vs full audio

TTS latency is how long an engine takes to speak. The number depends entirely on where you stop the clock: the first streamed byte, or the complete audio file.

Updated June 11, 2026
What is a Quality Elo score for AI voices?

Quality Elo is a listener-vote rating for voice engines, borrowed from chess: blind pairwise comparisons produce a score nobody can self-award. How to read one, with the attribution.

Updated June 11, 2026
What is voice cloning consent, and why does it matter?

Voice cloning consent is the speaker's explicit permission before their voice is cloned. What good consent practice looks like, and how it is enforced here.

Updated June 11, 2026
Dubbing vs subtitling: what is the difference?

Dubbing replaces the voice track in a new language; subtitling keeps the original audio and translates on screen. The honest trade-offs, and which one this studio does.

Updated June 11, 2026
What is zero data retention (ZDR)?

Zero data retention means an AI provider processes your request and keeps nothing: no stored prompts, no stored outputs, no training on your text. What ZDR covers, and what it does not.

Updated June 11, 2026

Looking for an API reference? There is no public API yet. API docs ship the day the API does.