Released models trained by Selective DPO.
-
glorgao/SelectiveDPO-Gemma2-9B-SFT-UFBinarized
Text Generation • 9B • Updated -
glorgao/SelectiveDPO-Llama3-8B-SFT-UFBinarized
Text Generation • 8B • Updated • 1 • 1 -
glorgao/SelectiveDPO-Qwen2.5-7B-SFT-UFBinarized
Text Generation • 7B • Updated • 76 • 1 -
glorgao/SelectiveDPO-Mistral-7B-SFT-UFBinarized
Text Generation • 7B • Updated