lucio
/

xls-r-uzbek-cv8

Automatic Speech Recognition

Generated from Trainer

hf-asr-leaderboard

mozilla-foundation/common_voice_8_0

robust-speech-event

Model card Files Files and versions

Metrics Training metrics Community

lucio commited on Feb 8, 2022

Commit

e7b488d

·

1 Parent(s): f5da949

update model card

Files changed (1) hide show

README.md +7 -1

README.md CHANGED Viewed

@@ -26,6 +26,12 @@ model-index:
        - name: Test CER (no LM)
          type: cer
          value: 6.53
 ---
 # XLS-R-300M Uzbek CV8
@@ -53,7 +59,7 @@ The model is not reliable enough to use as a substitute for live captions for ac
 ## Training and evaluation data
-The 50% of the `train` common voice official split was used as training data. The 50% of the official `dev` split was used as validation data, and the full `test` set was used for final evaluation.
 The kenlm language model was compiled from the target sentences of the train + other datasets.

        - name: Test CER (no LM)
          type: cer
          value: 6.53
+       - name: Test WER (with LM)
+         type: wer
+         value: 15.065
+       - name: Test CER (with LM)
+         type: cer
+         value: 3.077
 ---
 # XLS-R-300M Uzbek CV8
 ## Training and evaluation data
+The 50% of the `train` common voice official split was used as training data. The 50% of the official `dev` split was used as validation data, and the full `test` set was used for final evaluation of the model without LM, while the model with LM was evaluated only on 500 examples from the `test` set.
 The kenlm language model was compiled from the target sentences of the train + other datasets.