Update README.md
Browse files
README.md
CHANGED
|
@@ -25,9 +25,6 @@ This is the official model from the publication "Fine-Tuning with Divergent Chai
|
|
| 25 |
> TLDR: Divergent Chain of Thought (DCoT) consists of requiring models to generate multiple CoTs before choosing an answer. Adding DCoT data to instruction tuning allows models to improve performance through self-correction.
|
| 26 |
|
| 27 |
|
| 28 |
-
Stay tuned for the release of the paper!
|
| 29 |
-
|
| 30 |
-
|
| 31 |
# Load the Model
|
| 32 |
```
|
| 33 |
from peft import LoraConfig, PeftModel
|
|
|
|
| 25 |
> TLDR: Divergent Chain of Thought (DCoT) consists of requiring models to generate multiple CoTs before choosing an answer. Adding DCoT data to instruction tuning allows models to improve performance through self-correction.
|
| 26 |
|
| 27 |
|
|
|
|
|
|
|
|
|
|
| 28 |
# Load the Model
|
| 29 |
```
|
| 30 |
from peft import LoraConfig, PeftModel
|