update readme
Browse files
README.md
CHANGED
|
@@ -14,6 +14,8 @@ inference:
|
|
| 14 |
|
| 15 |
This is a DenseFormer implementation of Mistral-7B-v0.1. The details about DenseFormer are in the [paper](https://arxiv.org/abs/2402.02622).
|
| 16 |
|
|
|
|
|
|
|
| 17 |
This model WILL require additional pretraining to see the improvements indicated by the DenseFormer paper. Per the paper:
|
| 18 |
|
| 19 |
```
|
|
|
|
| 14 |
|
| 15 |
This is a DenseFormer implementation of Mistral-7B-v0.1. The details about DenseFormer are in the [paper](https://arxiv.org/abs/2402.02622).
|
| 16 |
|
| 17 |
+
You will need to use `trust_remote_code=True` to load this model.
|
| 18 |
+
|
| 19 |
This model WILL require additional pretraining to see the improvements indicated by the DenseFormer paper. Per the paper:
|
| 20 |
|
| 21 |
```
|