weixinchen
/

GRATH-selftruth

Model card Files Files and versions

weixinchen commited on Jul 17, 2024

Commit

86f2c5f

·

verified ·

1 Parent(s): e1e257e

Update README.md

Files changed (1) hide show

README.md +6 -0

README.md CHANGED Viewed

@@ -1,6 +1,12 @@
 ---
 library_name: peft
 ---
 ## Training procedure

 ---
 library_name: peft
 ---
+This is a self-truthified model proposed in the paper [GRATH: Gradual Self-Truthifying for Large Language Models](https://arxiv.org/abs/2401.12292).
+Note: This model is applied with DPO once. The reference model of DPO is set as the current base model (i.e., the pretrained base model).
 ## Training procedure