Kitten TTS Nano 0.8 (MLX)

This is an MLX conversion of the KittenTTS Nano 0.8 model.

Source

Original model: KittenML/kitten-tts-nano-0.8-fp32
License: Apache-2.0

Usage

This model is meant to be used with the mlx-audio KittenTTS implementation.

from mlx_audio.tts.utils import load_model
import soundfile as sf

model = load_model("mlx-community/kitten-tts-nano-0.8")

# The generator yields one segment at a time.
for out in model.generate("This model runs with MLX.", voice="Jasper"):
    sf.write("out.wav", out.audio, model.sample_rate)

Notes:

phonemizer (espeak backend) is required for generate().
Available voices: Bella, Jasper, Luna, Bruno, Rosie, Hugo, Kiki, Leo.

Conversion Details

Converted from ONNX weights.
Output shapes and durations match the reference ONNX model for test inputs.

Downloads last month: 72

Safetensors

Model size

14.6M params

Tensor type

F32

MLX

Hardware compatibility

Quantized

Collection including mlx-community/kitten-tts-nano-0.8

KittenTTS

Collection

All MLX conversions of KittenTTS (nano/micro/mini) across fp32, fp16, bf16, and 4/5/6/8-bit quantizations. • 21 items • Updated 12 days ago • 2