Update README.md
Browse files
README.md
CHANGED
@@ -10,7 +10,7 @@ base_model:
|
|
10 |
# TESS 2 v0.1 Base
|
11 |
|
12 |
This model is the diffusion adapted TESS 2. This model is a simplex-based diffusion model adapted from Mistral v0.1 7B, further trained on Dolma 1.7.
|
13 |
-
For more details, please check out our paper [TESS-2: A Large-Scale, Generalist Diffusion Language Model](https://
|
14 |
This is the model based on Mistral v0.1.
|
15 |
|
16 |
**This is the diffusion-adapted base model, which has not yet undergone instruction tuning. We recommend further tuning this model on your dataset of interest, or checking out the [instruction tuned version](https://huggingface.co/hamishivi/tess2).**
|
|
|
10 |
# TESS 2 v0.1 Base
|
11 |
|
12 |
This model is the diffusion adapted TESS 2. This model is a simplex-based diffusion model adapted from Mistral v0.1 7B, further trained on Dolma 1.7.
|
13 |
+
For more details, please check out our paper [TESS-2: A Large-Scale, Generalist Diffusion Language Model](https://arxiv.org/abs/2502.13917).
|
14 |
This is the model based on Mistral v0.1.
|
15 |
|
16 |
**This is the diffusion-adapted base model, which has not yet undergone instruction tuning. We recommend further tuning this model on your dataset of interest, or checking out the [instruction tuned version](https://huggingface.co/hamishivi/tess2).**
|