Running on Zero 733 IndexTTS 2 Demo ๐ข 733 Generate expressive voice from text using audio reference
litagin/anime-whisper Automatic Speech Recognition โข 0.8B โข Updated Nov 24, 2024 โข 5.14k โข 118