Update README.md
Browse files
README.md
CHANGED
|
@@ -19,4 +19,27 @@ This autoregressive model belongs to a series of rather small language models tr
|
|
| 19 |
| Attention heads | 8 | 8 | 16 | 16 |
|
| 20 |
| Embedding size | 128 | 128 | 256 | 256 |
|
| 21 |
| Context size | 128 | 128 | 256 | 256 |
|
| 22 |
-
| Vocab size | 16k | 16k | 16k | 16k |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 19 |
| Attention heads | 8 | 8 | 16 | 16 |
|
| 20 |
| Embedding size | 128 | 128 | 256 | 256 |
|
| 21 |
| Context size | 128 | 128 | 256 | 256 |
|
| 22 |
+
| Vocab size | 16k | 16k | 16k | 16k |
|
| 23 |
+
|
| 24 |
+
|
| 25 |
+
If you use this model in your research, please cite the following publication:
|
| 26 |
+
|
| 27 |
+
```
|
| 28 |
+
@inproceedings{bunzeck-zarriess-2024-fifty,
|
| 29 |
+
title = "Fifty shapes of {BL}i{MP}: syntactic learning curves in language models are not uniform, but sometimes unruly",
|
| 30 |
+
author = "Bunzeck, Bastian and
|
| 31 |
+
Zarrie{\ss}, Sina",
|
| 32 |
+
editor = "Qiu, Amy and
|
| 33 |
+
Noble, Bill and
|
| 34 |
+
Pagmar, David and
|
| 35 |
+
Maraev, Vladislav and
|
| 36 |
+
Ilinykh, Nikolai",
|
| 37 |
+
booktitle = "Proceedings of the 2024 CLASP Conference on Multimodality and Interaction in Language Learning",
|
| 38 |
+
month = oct,
|
| 39 |
+
year = "2024",
|
| 40 |
+
address = "Gothenburg, Sweden",
|
| 41 |
+
publisher = "Association for Computational Linguistics",
|
| 42 |
+
url = "https://aclanthology.org/2024.clasp-1.7",
|
| 43 |
+
pages = "39--55",
|
| 44 |
+
}
|
| 45 |
+
```
|