yujiepan
/

microllama-0.06B

Text Generation

Model card Files Files and versions

yujiepan commited on May 16

Commit

8544322

·

verified ·

1 Parent(s): dd35ccc

Update README.md

Files changed (1) hide show

README.md +18 -0

README.md CHANGED Viewed

@@ -14,3 +14,21 @@ It is a small pretrained model that can do text generation. Very useful for algo
 Special thanks to the original author [OuteAI](https://huggingface.co/OuteAI/) for the hard work and contribution.
 **This repo is just a backup for myself. If you find this model useful, consider using the original repo instead.**

 Special thanks to the original author [OuteAI](https://huggingface.co/OuteAI/) for the hard work and contribution.
 **This repo is just a backup for myself. If you find this model useful, consider using the original repo instead.**
+## Evaluation
+```bash
+lm_eval --model hf \
+  --model_args pretrained=yujiepan/microllama-0.06B,max_length=4096,dtype="<dtype>" \
+  --tasks wikitext \
+  --device cuda:0 \
+  --batch_size 1
+```
+| Model dtype | Word perplexity |
+| ----------- | --------------- |
+| FP32        | 97.3325         |
+| BF16        | 97.2494         |
+| FP16        | 97.3342         |
+Tested on A100 with `lm-eval==0.4.7`.