Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Full Name's picture
7 78 627

Full Name PRO

Gatozu35
mosaicboost's profile picture Joseluir's profile picture syanghugging's profile picture
·

AI & ML interests

Text-to-Speech, Voice Conversion

Recent Activity

liked a dataset 2 days ago
bosonai/AudioTokenBench
liked a model 6 days ago
sarulab-speech/sidon-v0.1
reacted to fdaudens's post with 🚀 6 days ago
AudioRAG is becoming real! Just built a demo with ColQwen-Omni that does semantic search on raw audio, no transcription needed. Drop in a podcast, ask your question, and it finds the exact chunks where it happens. You can also get a written answer. What’s exciting: it skips transcription, making it faster and better at capturing emotion, ambient sound, and tone, surfacing results text search would miss. - Demo: https://huggingface.co/spaces/fdaudens/colqwen-omni-demo - Blog post from ColQwen team: https://huggingface.co/blog/manu/colqwen-omni-omnimodal-retrieval
View all activity

Organizations

Hugging Face Discord Community's profile picture AI Starter Pack's profile picture

Gatozu35 's datasets 5

Gatozu35/bg3

Preview • Updated May 8, 2024 • 14

Gatozu35/test-webdataset

Viewer • Updated Apr 18, 2024 • 1 • 27

Gatozu35/test

Updated Apr 13, 2024 • 3

Gatozu35/DNSMOS-TTS

Viewer • Updated Jan 31, 2024 • 13.1k • 27 • 1

Gatozu35/novelupdates

Updated Mar 9, 2023 • 3
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs