safe-llm-finetune/llama-3.2-1b-it-translation-full-lr5e-05-bs8 Text Generation • 1B • Updated Jun 27 • 11
JayHyeon/llama-BDPO_5e-7-1ep_0alp_0.5bdpo_lam_0dpop_lam Text Generation • 1B • Updated about 1 month ago • 22
JayHyeon/llama-DPO_5e-7-1ep_0alp_0.5bdpo_lam_0dpop_lam Text Generation • 1B • Updated about 1 month ago • 16
JayHyeon/llama-IRPO_5e-7-1ep_1alp_0.5bdpo_lam_0dpop_lam Text Generation • 1B • Updated about 1 month ago • 17
JayHyeon/llama-DPOP_5e-7-1ep_0alp_0.5bdpo_lam_5dpop_lam Text Generation • 1B • Updated about 1 month ago • 16
safe-llm-finetune/llama-3.2-1b-it-translation-dpo-lr5e-05-bs8 Text Generation • 1B • Updated Jun 27 • 7
safe-llm-finetune/llama-3.2-1b-it-translation-full-lr5e-06-bs8 Text Generation • 1B • Updated Jun 27 • 7
safe-llm-finetune/llama-3.2-1b-it-translation-dpo-lr1e-05-bs8 Text Generation • 1B • Updated Jun 27 • 13
safe-llm-finetune/llama-3.2-1b-it-translation-dpo-lr5e-06-bs8 Text Generation • 1B • Updated Jun 27 • 10
JayHyeon/llama-DPO_1e-6-1ep_0alp_0.5bdpo_lam_0dpop_lam Text Generation • 1B • Updated 30 days ago • 15
JayHyeon/llama-DPOP_1e-6-1ep_0alp_0.5bdpo_lam_5dpop_lam Text Generation • 1B • Updated 30 days ago • 17
JayHyeon/llama-BDPO_1e-6-1ep_0alp_0.5bdpo_lam_0dpop_lam Text Generation • 1B • Updated about 1 month ago • 16
JayHyeon/llama-IRPO_1e-6-1ep_1alp_0.5bdpo_lam_0dpop_lam Text Generation • 1B • Updated about 1 month ago • 16
JayHyeon/llama-IRPO_1e-6-2ep_1alp_0.5bdpo_lam_0dpop_lam Text Generation • 1B • Updated about 1 month ago • 7
JayHyeon/llama-DPO_1e-6-2ep_0alp_0.5bdpo_lam_0dpop_lam Text Generation • 1B • Updated about 1 month ago • 8
JayHyeon/llama-BDPO_1e-6-2ep_0alp_0.5bdpo_lam_0dpop_lam Text Generation • 1B • Updated about 1 month ago • 13
JayHyeon/llama-DPOP_1e-6-2ep_0alp_0.5bdpo_lam_5dpop_lam Text Generation • 1B • Updated about 1 month ago • 7
JayHyeon/llama-BDPO_1e-7-1ep_0alp_0.5bdpo_lam_0dpop_lam Text Generation • 1B • Updated about 1 month ago • 16
JayHyeon/llama-IRPO_1e-7-1ep_1alp_0.5bdpo_lam_0dpop_lam Text Generation • 1B • Updated about 1 month ago • 16
JayHyeon/llama-DPOP_1e-7-1ep_0alp_0.5bdpo_lam_5dpop_lam Text Generation • 1B • Updated about 1 month ago • 15
JayHyeon/llama-DPO_1e-7-1ep_0alp_0.5bdpo_lam_0dpop_lam Text Generation • 1B • Updated about 1 month ago • 18
JayHyeon/llama-BDPO_2e-7-1ep_0alp_0.5bdpo_lam_0dpop_lam Text Generation • 1B • Updated about 1 month ago • 16
JayHyeon/llama-IRPO_2e-7-1ep_1alp_0.5bdpo_lam_0dpop_lam Text Generation • 1B • Updated about 1 month ago • 16