DataSoul
/

ALMA-7B-R-gguf

Model card Files Files and versions

DataSoul commited on Jan 29, 2024

Commit

e18188f

·

verified ·

1 Parent(s): b3f96cc

Update README.md

Files changed (1) hide show

README.md +7 -1

README.md CHANGED Viewed

@@ -4,8 +4,14 @@ I just made a gguf file for my own use, and then share it, please support the or
 ---
 This repo contains GGUF format model files for  **[haoranxu/ALMA-7B-R](https://huggingface.co/haoranxu/ALMA-7B-R)**
 ---
-license: mit
 ---
 **[ALMA-R](https://arxiv.org/abs/2401.08417)** builds upon [ALMA models](https://arxiv.org/abs/2309.11674), with further LoRA fine-tuning with our proposed **Contrastive Preference Optimization (CPO)** as opposed to the Supervised Fine-tuning used in ALMA. CPO fine-tuning requires our [triplet preference data](https://huggingface.co/datasets/haoranxu/ALMA-R-Preference) for preference learning. ALMA-R now can matches or even exceeds GPT-4 or WMT winners!
 ```
 @misc{xu2024contrastive,

 ---
 This repo contains GGUF format model files for  **[haoranxu/ALMA-7B-R](https://huggingface.co/haoranxu/ALMA-7B-R)**
 ---
 ---
+---
+---
+---
+the original model card:
+---
+license: mit
 **[ALMA-R](https://arxiv.org/abs/2401.08417)** builds upon [ALMA models](https://arxiv.org/abs/2309.11674), with further LoRA fine-tuning with our proposed **Contrastive Preference Optimization (CPO)** as opposed to the Supervised Fine-tuning used in ALMA. CPO fine-tuning requires our [triplet preference data](https://huggingface.co/datasets/haoranxu/ALMA-R-Preference) for preference learning. ALMA-R now can matches or even exceeds GPT-4 or WMT winners!
 ```
 @misc{xu2024contrastive,