Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1.4
TFLOPS
7
78
627
Full Name
PRO
Gatozu35
Follow
mosaicboost's profile picture
Joseluir's profile picture
syanghugging's profile picture
12 followers
·
109 following
AI & ML interests
Text-to-Speech, Voice Conversion
Recent Activity
liked
a dataset
2 days ago
bosonai/AudioTokenBench
liked
a model
6 days ago
sarulab-speech/sidon-v0.1
reacted
to
fdaudens
's
post
with 🚀
6 days ago
AudioRAG is becoming real! Just built a demo with ColQwen-Omni that does semantic search on raw audio, no transcription needed. Drop in a podcast, ask your question, and it finds the exact chunks where it happens. You can also get a written answer. What’s exciting: it skips transcription, making it faster and better at capturing emotion, ambient sound, and tone, surfacing results text search would miss. - Demo: https://huggingface.co/spaces/fdaudens/colqwen-omni-demo - Blog post from ColQwen team: https://huggingface.co/blog/manu/colqwen-omni-omnimodal-retrieval
View all activity
Organizations
Gatozu35
's datasets
5
Sort: Recently updated
Gatozu35/bg3
Preview
•
Updated
May 8, 2024
•
14
Gatozu35/test-webdataset
Viewer
•
Updated
Apr 18, 2024
•
1
•
27
Gatozu35/test
Updated
Apr 13, 2024
•
3
Gatozu35/DNSMOS-TTS
Viewer
•
Updated
Jan 31, 2024
•
13.1k
•
27
•
1
Gatozu35/novelupdates
Updated
Mar 9, 2023
•
3