haritzpuerto
/

LLaMA2-70B-dcot

Text Generation

Model card Files Files and versions

haritzpuerto commited on Jul 4, 2024

Commit

6b21a3d

·

verified ·

1 Parent(s): 5671ade

Update README.md

Files changed (1) hide show

README.md +17 -1

README.md CHANGED Viewed

@@ -138,4 +138,20 @@ We train all models using LoRA with the PEFT library. The main parameters are:
 | optim               | paged\_adamw\_32bit |
 | lr\_scheduler\_type |       constant      |
-Please check Appendix B of the paper for more details.

 | optim               | paged\_adamw\_32bit |
 | lr\_scheduler\_type |       constant      |
+Please check Appendix B of the paper for more details.
+# Cite
+If you find our work useful, please consider citing it using the following citation:
+```
+@misc{puerto2024dcot,
+      title={Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models},
+      author={Haritz Puerto and Tilek Chubakov and Xiaodan Zhu and Harish Tayyar Madabushi and Iryna Gurevych},
+      year={2024},
+      eprint={2407.03181},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL},
+      url={https://arxiv.org/abs/2407.03181},
+}
+```