h2m
/

mhm-7b-v1.3-DPO-1

Text Generation

text-generation-inference

Model card Files Files and versions

Update README.md

#1

by Kquant03 - opened Jan 21, 2024

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

Files changed (1) hide show

README.md +6 -3

README.md CHANGED Viewed

@@ -1,10 +1,13 @@
 ---
 license: apache-2.0
 ---
-h2m/mhm-7b-v1.3-DPO-1
-This is a DPO fine tuned mhm-7b-v1.3 on Intel/orca_dpo_pairs
-Model based on mistral. created using dare_ties and models from openllm leaderboard. Mixed 7 models into 1. 3 times merging.
 Just an experiment.

 ---
 license: apache-2.0
+language:
+- en
 ---
+![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/6589d7e6586088fd2784a12c/ORVjYrpzyfKfP4ByOQnpQ.jpeg)
+A DPO fine tuned [mhm-7b-v1.3](https://huggingface.co/h2m/mhm-7b-v1.3) on [Intel/orca_dpo_pairs](https://huggingface.co/datasets/Intel/orca_dpo_pairs)
+Based upon mistral. Created using [dare_ties](https://github.com/cg123/mergekit) and models from openllm leaderboard. Over 3 merges involving 7 different models, this was the result.
 Just an experiment.