Update README.md
Browse files
README.md
CHANGED
|
@@ -290,7 +290,7 @@ The fine-tuning process was divided into two stages:
|
|
| 290 |
- Supervised fine-tuning (SFT) using the [TucanoBR/Tucano-SFT](https://huggingface.co/datasets/TucanoBR/Tucano-SFT), a concatenation of three different instruction tuning datasets ([`cnmoro/GPT4-500k-Augmented-PTBR-Clean`](https://huggingface.co/datasets/cnmoro/GPT4-500k-Augmented-PTBR-Clean), [`rhaymison/orca-math-portuguese-64k`](https://huggingface.co/datasets/rhaymison/orca-math-portuguese-64k), [`nicholasKluge/instruct-aira-dataset-v3`](https://huggingface.co/datasets/nicholasKluge/instruct-aira-dataset-v3)).
|
| 291 |
- Direct Preference Optimization (DPO) using the [nicholasKluge/reward-aira-dataset](https://huggingface.co/datasets/nicholasKluge/reward-aira-dataset).
|
| 292 |
|
| 293 |
-
Read our preprint [here](https://arxiv.org/abs/
|
| 294 |
|
| 295 |
## Details
|
| 296 |
|
|
@@ -414,11 +414,14 @@ To evaluate the `Instruct` versions of our models, we used [AlpacaEval](https://
|
|
| 414 |
## Cite as 🤗
|
| 415 |
|
| 416 |
```latex
|
| 417 |
-
@misc{
|
| 418 |
-
|
| 419 |
-
|
| 420 |
-
|
| 421 |
-
|
|
|
|
|
|
|
|
|
|
| 422 |
}
|
| 423 |
```
|
| 424 |
|
|
|
|
| 290 |
- Supervised fine-tuning (SFT) using the [TucanoBR/Tucano-SFT](https://huggingface.co/datasets/TucanoBR/Tucano-SFT), a concatenation of three different instruction tuning datasets ([`cnmoro/GPT4-500k-Augmented-PTBR-Clean`](https://huggingface.co/datasets/cnmoro/GPT4-500k-Augmented-PTBR-Clean), [`rhaymison/orca-math-portuguese-64k`](https://huggingface.co/datasets/rhaymison/orca-math-portuguese-64k), [`nicholasKluge/instruct-aira-dataset-v3`](https://huggingface.co/datasets/nicholasKluge/instruct-aira-dataset-v3)).
|
| 291 |
- Direct Preference Optimization (DPO) using the [nicholasKluge/reward-aira-dataset](https://huggingface.co/datasets/nicholasKluge/reward-aira-dataset).
|
| 292 |
|
| 293 |
+
Read our preprint [here](https://arxiv.org/abs/2411.07854).
|
| 294 |
|
| 295 |
## Details
|
| 296 |
|
|
|
|
| 414 |
## Cite as 🤗
|
| 415 |
|
| 416 |
```latex
|
| 417 |
+
@misc{correa2024tucanoadvancingneuraltext,
|
| 418 |
+
title={{Tucano: Advancing Neural Text Generation for Portuguese}},
|
| 419 |
+
author={Corr{\^e}a, Nicholas Kluge and Sen, Aniket and Falk, Sophia and Fatimah, Shiza},
|
| 420 |
+
year={2024},
|
| 421 |
+
eprint={2411.07854},
|
| 422 |
+
archivePrefix={arXiv},
|
| 423 |
+
primaryClass={cs.CL},
|
| 424 |
+
url={https://arxiv.org/abs/2411.07854},
|
| 425 |
}
|
| 426 |
```
|
| 427 |
|