haritzpuerto
/

LLaMA2-70B-dcot

Text Generation

Model card Files Files and versions

haritzpuerto commited on Jun 26, 2024

Commit

5671ade

·

verified ·

1 Parent(s): d303dbf

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -22,7 +22,7 @@ widget:
 This is the official model from the publication "Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models" (arXiv, 2024).
-> TLDR: Divergent Chain of Thought (DCoT) consists of requiring models to generate multiple CoTs before choosing an answer and adding DCoT data to instruction tuning allows models to improve performance through self-correction.
 Stay tuned for the release of the paper!

 This is the official model from the publication "Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models" (arXiv, 2024).
+> TLDR: Divergent Chain of Thought (DCoT) consists of requiring models to generate multiple CoTs before choosing an answer. Adding DCoT data to instruction tuning allows models to improve performance through self-correction.
 Stay tuned for the release of the paper!