Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

tsinghua-ee
/
SALMONN

Automatic Speech Recognition
PyTorch
English
automatic-audio-captioning
automatic-speech-translation
music-captioning
audio-based-storytelling
speech-audio-coreasoning
auditory understanding
Model card Files Files and versions Community
2
SALMONN / resource /audio_demo
Ctrl+K
Ctrl+K
  • 5 contributors
History: 3 commits
Changli's picture
Changli
chore: release v1
0bf5005 almost 2 years ago
  • duck.wav
    640 kB
    chore: release v1 almost 2 years ago
  • excitement.wav
    40.4 kB
    chore: release v1 almost 2 years ago
  • gunshots.wav
    320 kB
    chore: release v1 almost 2 years ago
  • mountain.wav
    79.1 kB
    chore: release v1 almost 2 years ago
  • music.wav
    639 kB
    chore: release v1 almost 2 years ago