rccmsu
/

ruadapt_mistral_saiga_7b_v0.1

Text Generation

Model card Files Files and versions

rccmsu commited on Jan 15, 2024

Commit

7f84342

·

verified ·

1 Parent(s): eb8cece

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -11,6 +11,8 @@ WARNING! Load tokenizer as AutoTokenizer.from_pretrained(model_path, use_fast=Tr
 Up to 60% faster generation and 35% training (on identical russian text sequences!) with HF because of different tokenizer.
 ## Training procedure
 ruadapt mistral trained on saiga corpuses.

 Up to 60% faster generation and 35% training (on identical russian text sequences!) with HF because of different tokenizer.
+Paper: Tikhomirov M., Chernyshev D. Impact of Tokenization on LLaMa Russian Adaptation //arXiv preprint arXiv:2312.02598. – 2023.
 ## Training procedure
 ruadapt mistral trained on saiga corpuses.