update model card
Browse files
README.md
CHANGED
|
@@ -26,6 +26,12 @@ model-index:
|
|
| 26 |
- name: Test CER (no LM)
|
| 27 |
type: cer
|
| 28 |
value: 6.53
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 29 |
---
|
| 30 |
|
| 31 |
# XLS-R-300M Uzbek CV8
|
|
@@ -53,7 +59,7 @@ The model is not reliable enough to use as a substitute for live captions for ac
|
|
| 53 |
|
| 54 |
## Training and evaluation data
|
| 55 |
|
| 56 |
-
The 50% of the `train` common voice official split was used as training data. The 50% of the official `dev` split was used as validation data, and the full `test` set was used for final evaluation.
|
| 57 |
|
| 58 |
The kenlm language model was compiled from the target sentences of the train + other datasets.
|
| 59 |
|
|
|
|
| 26 |
- name: Test CER (no LM)
|
| 27 |
type: cer
|
| 28 |
value: 6.53
|
| 29 |
+
- name: Test WER (with LM)
|
| 30 |
+
type: wer
|
| 31 |
+
value: 15.065
|
| 32 |
+
- name: Test CER (with LM)
|
| 33 |
+
type: cer
|
| 34 |
+
value: 3.077
|
| 35 |
---
|
| 36 |
|
| 37 |
# XLS-R-300M Uzbek CV8
|
|
|
|
| 59 |
|
| 60 |
## Training and evaluation data
|
| 61 |
|
| 62 |
+
The 50% of the `train` common voice official split was used as training data. The 50% of the official `dev` split was used as validation data, and the full `test` set was used for final evaluation of the model without LM, while the model with LM was evaluated only on 500 examples from the `test` set.
|
| 63 |
|
| 64 |
The kenlm language model was compiled from the target sentences of the train + other datasets.
|
| 65 |
|