Documentation Index
Fetch the complete documentation index at: https://mintlify.com/KittenML/KittenTTS/llms.txt
Use this file to discover all available pages before exploring further.
KittenTTS runs on CPU only. No GPU is required or supported.
Load a model
KittenTTS offers three model sizes. Choose based on your quality and speed requirements:Models are downloaded from Hugging Face on first use and cached locally. You can specify a custom cache directory:
Generate audio and save to file
Generate speech from text and save the result as a WAV file:
generate() returns a NumPy array at 24 kHz. You can use any audio library that accepts NumPy arrays, or use generate_to_file() to skip the intermediate step:Advanced usage
Next steps
How it works
Learn about the ONNX runtime and how KittenTTS processes text into audio.
Voices
Explore the available voices and how to choose the right one for your use case.
Models overview
Compare model sizes and understand the trade-offs between quality and speed.
API reference
Full reference for the
KittenTTS class, parameters, and return values.