Kitten TTS Nano 0.8 (MLX)

This is an MLX conversion of the KittenTTS Nano 0.8 model.

Source

  • Original model: KittenML/kitten-tts-nano-0.8-fp32
  • License: Apache-2.0

Usage

This model is meant to be used with the mlx-audio KittenTTS implementation.

from mlx_audio.tts.utils import load_model
import soundfile as sf

model = load_model("mlx-community/kitten-tts-nano-0.8")

# The generator yields one segment at a time.
for out in model.generate("This model runs with MLX.", voice="Jasper"):
    sf.write("out.wav", out.audio, model.sample_rate)

Notes:

  • phonemizer (espeak backend) is required for generate().
  • Available voices: Bella, Jasper, Luna, Bruno, Rosie, Hugo, Kiki, Leo.

Conversion Details

  • Converted from ONNX weights.
  • Output shapes and durations match the reference ONNX model for test inputs.
Downloads last month
72
Safetensors
Model size
14.6M params
Tensor type
F32
·
MLX
Hardware compatibility
Log In to add your hardware

Quantized

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including mlx-community/kitten-tts-nano-0.8