Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

nguyenvulebinh
/
AVSRCocktail

Automatic Speech Recognition
Transformers
Safetensors
PyTorch
English
avhubert_avsr
audio-visual-speech-recognition
multimodal
speech-recognition
lip-reading
cocktail-party
noise-robust
av-hubert
transformer
audio
video
english
lrs2
voxceleb2
ctc
attention
beam-search
multi-speaker
noisy-speech
Model card Files Files and versions
xet
Community
AVSRCocktail
1.72 GB
  • 1 contributor
History: 2 commits
nguyenvulebinh's picture
nguyenvulebinh
Upload AVHubertAVSR
67bfcfe verified 4 months ago
  • .gitattributes
    1.52 kB
    initial commit 4 months ago
  • README.md
    5.17 kB
    Upload AVHubertAVSR 4 months ago
  • config.json
    4.44 kB
    Upload AVHubertAVSR 4 months ago
  • model.safetensors
    1.72 GB
    xet
    Upload AVHubertAVSR 4 months ago