Audio, Speech & Music - a rocari Collection

rocari 's Collections

Image Generation

Audio, Speech & Music

Agents, Planning & Tools

Audio, Speech & Music

updated Jan 4, 2024

facebook/seamless-m4t-v2-large

Automatic Speech Recognition • 2B • Updated Jan 4, 2024 • 76.4k • 982
openai/whisper-large-v3

Automatic Speech Recognition • 2B • Updated Aug 12, 2024 • 5.15M • • 5.73k
jonatasgrosman/whisper-large-pt-cv11

Automatic Speech Recognition • Updated Dec 22, 2022 • 23 • 16
openai/whisper-large-v2

Automatic Speech Recognition • 2B • Updated Feb 29, 2024 • 72.8k • 1.8k
Incremental FastPitch: Chunk-based High Quality Text to Speech

Paper • 2401.01755 • Published Jan 3, 2024 • 10