Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Sonal Kumar's picture
6 2

Sonal Kumar

sonalkum
Csplk's profile picture John6666's profile picture nishitanand's profile picture
ยท

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago
MMAU-Pro: A Challenging and Comprehensive Benchmark for Holistic Evaluation of Audio General Intelligence
authored a paper about 2 months ago
Multi-Domain Audio Question Answering Toward Acoustic Content Reasoning in The DCASE 2025 Challenge
authored a paper about 2 months ago
Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models
View all activity

Organizations

ZeroGPU Explorers's profile picture JSALT25-AuGI's profile picture Gamma Lab's profile picture

Papers 15

arxiv:2508.13992
arxiv:2507.08128
arxiv:2505.07365
arxiv:2503.03983

spaces 3

Configuration error
13

Synthio Stable Audio Open

๐Ÿ“š

Stable audio open model from Synthio paper.

Oct 26, 2024
Running on Zero
17

GAMA

๐ŸŒ

Generate text based on audio input and questions

Jul 25, 2024
Running on Zero
5

GAMA-IT

๐Ÿ†

Describe audio with questions

Jul 25, 2024

models 3

sonalkum/GAMA

Updated Jun 26

sonalkum/synthio-t5

Updated Oct 26, 2024 โ€ข 1

sonalkum/synthio-stable-audio-open

Updated Oct 19, 2024 โ€ข 3

datasets 1

sonalkum/AudioSkills-Llama3

Viewer โ€ข Updated Aug 18 โ€ข 704k โ€ข 26
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs