Kokoro is an open-weight TTS model with 82 million parameters. As of January 31st, 2025, Kokoro was the most-liked TTS model and the most-liked TTS space on Hugging Face. This demo only showcases English, but you can directly use the model to access other languages.

Voice

Quality and availability vary by language

Hardware

GPU is usually faster, but has a usage quota

0.5 2

πŸ’‘ Customize pronunciation with Markdown link syntax and /slashes/ like [Kokoro](/kˈOkΙ™ΙΉO/) πŸ’¬ To adjust intonation, try punctuation ;:,.!?—…"()β€œβ€ or stress ˈ and ˌ ⬇️ Lower stress [1 level](-1) or [2 levels](-2) ⬆️ Raise stress 1 level [or](+2) 2 levels (only works on less stressed, usually short words)