Update README.md
Browse files
README.md
CHANGED
|
@@ -22,7 +22,7 @@ widget:
|
|
| 22 |
|
| 23 |
This is the official model from the publication "Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models" (arXiv, 2024).
|
| 24 |
|
| 25 |
-
> TLDR: Divergent Chain of Thought (DCoT) consists of requiring models to generate multiple CoTs before choosing an answer
|
| 26 |
|
| 27 |
|
| 28 |
Stay tuned for the release of the paper!
|
|
|
|
| 22 |
|
| 23 |
This is the official model from the publication "Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models" (arXiv, 2024).
|
| 24 |
|
| 25 |
+
> TLDR: Divergent Chain of Thought (DCoT) consists of requiring models to generate multiple CoTs before choosing an answer. Adding DCoT data to instruction tuning allows models to improve performance through self-correction.
|
| 26 |
|
| 27 |
|
| 28 |
Stay tuned for the release of the paper!
|