Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
tsinghua-ee
/
SALMONN
like
47
Follow
Electronic Engineering @Tsinghua University
45
Automatic Speech Recognition
PyTorch
English
automatic-audio-captioning
automatic-speech-translation
music-captioning
audio-based-storytelling
speech-audio-coreasoning
auditory understanding
arxiv:
2310.13289
arxiv:
2406.15704
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
3
eaf3298
SALMONN
407 MB
6 contributors
History:
7 commits
Changli
Upload salmonn_v1.pth
eaf3298
about 2 years ago
resource
Upload 20 files
about 2 years ago
.gitattributes
Safe
1.58 kB
Upload 20 files
about 2 years ago
README.md
3.86 kB
Update README.md
about 2 years ago
salmonn_v1.pth
Safe
pickle
Detected Pickle imports (4)
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
,
"collections.OrderedDict"
,
"torch.LongStorage"
What is a pickle import?
400 MB
xet
Upload salmonn_v1.pth
about 2 years ago