haritzpuerto
/

LLaMA2-70B-dcot

Text Generation

Model card Files Files and versions

haritzpuerto commited on Jul 16, 2024

Commit

40594d6

·

verified ·

1 Parent(s): 6b21a3d

Update README.md

Files changed (1) hide show

README.md +0 -3

README.md CHANGED Viewed

@@ -25,9 +25,6 @@ This is the official model from the publication "Fine-Tuning with Divergent Chai
 > TLDR: Divergent Chain of Thought (DCoT) consists of requiring models to generate multiple CoTs before choosing an answer. Adding DCoT data to instruction tuning allows models to improve performance through self-correction.
-Stay tuned for the release of the paper!
 # Load the Model
 ```
 from peft import LoraConfig, PeftModel

 > TLDR: Divergent Chain of Thought (DCoT) consists of requiring models to generate multiple CoTs before choosing an answer. Adding DCoT data to instruction tuning allows models to improve performance through self-correction.
 # Load the Model
 ```
 from peft import LoraConfig, PeftModel