Update README.md
Browse files
README.md
CHANGED
|
@@ -14,3 +14,21 @@ It is a small pretrained model that can do text generation. Very useful for algo
|
|
| 14 |
Special thanks to the original author [OuteAI](https://huggingface.co/OuteAI/) for the hard work and contribution.
|
| 15 |
|
| 16 |
**This repo is just a backup for myself. If you find this model useful, consider using the original repo instead.**
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 14 |
Special thanks to the original author [OuteAI](https://huggingface.co/OuteAI/) for the hard work and contribution.
|
| 15 |
|
| 16 |
**This repo is just a backup for myself. If you find this model useful, consider using the original repo instead.**
|
| 17 |
+
|
| 18 |
+
## Evaluation
|
| 19 |
+
|
| 20 |
+
```bash
|
| 21 |
+
lm_eval --model hf \
|
| 22 |
+
--model_args pretrained=yujiepan/microllama-0.06B,max_length=4096,dtype="<dtype>" \
|
| 23 |
+
--tasks wikitext \
|
| 24 |
+
--device cuda:0 \
|
| 25 |
+
--batch_size 1
|
| 26 |
+
```
|
| 27 |
+
|
| 28 |
+
| Model dtype | Word perplexity |
|
| 29 |
+
| ----------- | --------------- |
|
| 30 |
+
| FP32 | 97.3325 |
|
| 31 |
+
| BF16 | 97.2494 |
|
| 32 |
+
| FP16 | 97.3342 |
|
| 33 |
+
|
| 34 |
+
Tested on A100 with `lm-eval==0.4.7`.
|