Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

tsinghua-ee
/
SALMONN

Automatic Speech Recognition
PyTorch
English
automatic-audio-captioning
automatic-speech-translation
music-captioning
audio-based-storytelling
speech-audio-coreasoning
auditory understanding
Model card Files Files and versions
xet
Community
3
SALMONN
404 MB
  • 6 contributors
History: 31 commits
Changli's picture
Changli
Update README.md
3cc7208 about 2 years ago
  • beats
    chore: release v1 about 2 years ago
  • other_third-party_licenses
    chore: release v1 about 2 years ago
  • qformer
    chore: release v1 about 2 years ago
  • resource
    chore: release v1 about 2 years ago
  • .gitattributes
    56 Bytes
    chore: release v1 about 2 years ago
  • .gitignore
    3.1 kB
    chore: release v1 about 2 years ago
  • LICENSE
    11.3 kB
    chore: release v1 about 2 years ago
  • README.md
    6.05 kB
    Update README.md about 2 years ago
  • cli_inference.py
    1.98 kB
    chore: add lora alpha about 2 years ago
  • model.py
    9.79 kB
    chore: release v1 about 2 years ago
  • requirements.txt
    160 Bytes
    Create requirements.txt about 2 years ago
  • salmonn_v1.pth

    Detected Pickle imports (4)

    • "torch._utils._rebuild_tensor_v2",
    • "torch.FloatStorage",
    • "collections.OrderedDict",
    • "torch.LongStorage"

    What is a pickle import?

    400 MB
    xet
    Upload salmonn_v1.pth about 2 years ago
  • web_demo.py
    7.32 kB
    chore: change sac prompot about 2 years ago