lucio
/

xls-r-uzbek-cv8

Automatic Speech Recognition

Generated from Trainer

hf-asr-leaderboard

mozilla-foundation/common_voice_8_0

robust-speech-event

Model card Files Files and versions

Metrics Training metrics Community

lucio commited on Feb 8, 2022

Commit

c7bc7d9

·

1 Parent(s): 007afbd

update model card

Files changed (1) hide show

README.md +30 -11

README.md CHANGED Viewed

@@ -6,17 +6,29 @@ tags:
 - automatic-speech-recognition
 - mozilla-foundation/common_voice_8_0
 - generated_from_trainer
 datasets:
-- common_voice
 model-index:
-- name: xls-r-uzbek-cv8
-  results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
-# xls-r-uzbek-cv8
 This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the MOZILLA-FOUNDATION/COMMON_VOICE_8_0 - UZ dataset.
 It achieves the following results on the evaluation set:
@@ -26,17 +38,24 @@ It achieves the following results on the evaluation set:
 ## Model description
-More information needed
 ## Intended uses & limitations
-More information needed
 ## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters

 - automatic-speech-recognition
 - mozilla-foundation/common_voice_8_0
 - generated_from_trainer
+- robust-speech-event
 datasets:
+- mozilla-foundation/common_voice_8_0
 model-index:
+- name: XLS-R-300M Uzbek CV8
+  results:
+  - task:
+      name: Automatic Speech Recognition
+      type: automatic-speech-recognition
+    dataset:
+      name: Common Voice 8
+      type: mozilla-foundation/common_voice_8_0
+      args: uz
+    metrics:
+       - name: Test WER (no LM)
+         type: wer
+         value: 32.88
+       - name: Test CER (no LM)
+         type: cer
+         value: 6.53
 ---
+# XLS-R-300M Uzbek CV8
 This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the MOZILLA-FOUNDATION/COMMON_VOICE_8_0 - UZ dataset.
 It achieves the following results on the evaluation set:
 ## Model description
+For a description of the model architecture, see [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m)
+The model vocabulary consists of the [Modern Latin alphabet for Uzbek](https://en.wikipedia.org/wiki/Uzbek_alphabet), with punctuation removed.
+Note that the characters <‘> and <’> do not count as punctuation, as <‘> modifies <o> and <g>, and <’> indicates the glottal stop or a long vowel.
 ## Intended uses & limitations
+This model is expected to be of some utility for low-fidelity use cases such as:
+- Draft video captions
+- Indexing of recorded broadcasts
+The model is not reliable enough to use as a substitute for live captions for accessibility purposes, and it should not be used in a manner that would infringe the privacy of any of the contributors to the Common Voice dataset nor any other speakers.
 ## Training and evaluation data
+The 50% of the `train` common voice official split was used as training data. The 50% of the official `dev` split was used as validation data, and the full `test` set was used for final evaluation.
+The kenlm language model was compiled from the target sentences of the train + other datasets.
 ### Training hyperparameters