7 78 627

Full Name PRO

Gatozu35

AI & ML interests

Text-to-Speech, Voice Conversion

Recent Activity

liked a dataset about 22 hours ago

bosonai/AudioTokenBench

liked a model 5 days ago

sarulab-speech/sidon-v0.1

reacted to fdaudens's post with 🚀 5 days ago

AudioRAG is becoming real! Just built a demo with ColQwen-Omni that does semantic search on raw audio, no transcription needed. Drop in a podcast, ask your question, and it finds the exact chunks where it happens. You can also get a written answer. What’s exciting: it skips transcription, making it faster and better at capturing emotion, ambient sound, and tone, surfacing results text search would miss. - Demo: https://huggingface.co/spaces/fdaudens/colqwen-omni-demo - Blog post from ColQwen team: https://huggingface.co/blog/manu/colqwen-omni-omnimodal-retrieval

View all activity

Organizations

liked a dataset about 22 hours ago

bosonai/AudioTokenBench

Viewer • Updated 3 days ago • 3.15k • 45 • 2

liked a model 5 days ago

sarulab-speech/sidon-v0.1

Updated 6 days ago • 4

reacted to fdaudens's post with 🚀 5 days ago

Post

2081

AudioRAG is becoming real! Just built a demo with ColQwen-Omni that does semantic search on raw audio, no transcription needed.

Drop in a podcast, ask your question, and it finds the exact chunks where it happens. You can also get a written answer.

What’s exciting: it skips transcription, making it faster and better at capturing emotion, ambient sound, and tone, surfacing results text search would miss.

- Demo: fdaudens/colqwen-omni-demo
- Blog post from ColQwen team: https://huggingface.co/blog/manu/colqwen-omni-omnimodal-retrieval

1 reply

reacted to fdaudens's post with 🚀 7 days ago

Post

2081

AudioRAG is becoming real! Just built a demo with ColQwen-Omni that does semantic search on raw audio, no transcription needed.

Drop in a podcast, ask your question, and it finds the exact chunks where it happens. You can also get a written answer.

What’s exciting: it skips transcription, making it faster and better at capturing emotion, ambient sound, and tone, surfacing results text search would miss.

- Demo: fdaudens/colqwen-omni-demo
- Blog post from ColQwen team: https://huggingface.co/blog/manu/colqwen-omni-omnimodal-retrieval