safe-llm-finetune/llama-3.2-1b-it-codeUltraFeedback-fullFT Text Generation • 1B • Updated Jun 21 • 13
safe-llm-finetune/llama-3.2-1b-it-codeUltraFeedback-fullFT-lr5e-5-bs8 Text Generation • 1B • Updated Jun 22 • 6
safe-llm-finetune/llama-3.2-1b-it-codeUltraFeedback-DPO-lr5e-6-bs8 Text Generation • 1B • Updated Jun 22 • 6