Update README.md
Browse files
README.md
CHANGED
|
@@ -11,7 +11,7 @@ language:
|
|
| 11 |
|
| 12 |
|
| 13 |
The UCCIX-Llama2-13B Large Language Model (LLM) is an Irish-English bilingual model, capables of understanding both languages and outperforms much larger models on Irish language tasks.
|
| 14 |
-
The model is based on Llama 2-13B, with vocabulary expansion to include native Irish tokens, and additional continued pre-training on our collection of ~
|
| 15 |
|
| 16 |
UCCIX is a pioneering effort on the development of first-ever open-source Irish-based LLM. You can find more details at: https://arxiv.org/abs/2405.13010
|
| 17 |
|
|
|
|
| 11 |
|
| 12 |
|
| 13 |
The UCCIX-Llama2-13B Large Language Model (LLM) is an Irish-English bilingual model, capables of understanding both languages and outperforms much larger models on Irish language tasks.
|
| 14 |
+
The model is based on Llama 2-13B, with vocabulary expansion to include native Irish tokens, and additional continued pre-training on our collection of ~520M Irish tokens (available at https://huggingface.co/datasets/ReliableAI/Irish-Text-Collection).
|
| 15 |
|
| 16 |
UCCIX is a pioneering effort on the development of first-ever open-source Irish-based LLM. You can find more details at: https://arxiv.org/abs/2405.13010
|
| 17 |
|