Red Teaming Alignment Evals AIPlans/Qwen-HHH-Cipher-Eng Text Generation • 0.5B • Updated Jun 14 • 4 AIPlans/Qwen-HHH-Sans-Eng Text Generation • 0.5B • Updated Jun 11 • 11 AIPlans/Qwen3-HHH-Cipher-Eng Text Generation • 0.6B • Updated Jun 15 • 3 AIPlans/Ethics_Commonsense Preview • Updated Jun 21 • 10
Model Diffing AIPlans/qwen3-8b-dpo-hh-rlhf Updated 27 days ago AIPlans/qwen3-8b-ipo-hh-rlhf Text Generation • Updated 13 days ago • 1
Red Teaming Alignment Evals AIPlans/Qwen-HHH-Cipher-Eng Text Generation • 0.5B • Updated Jun 14 • 4 AIPlans/Qwen-HHH-Sans-Eng Text Generation • 0.5B • Updated Jun 11 • 11 AIPlans/Qwen3-HHH-Cipher-Eng Text Generation • 0.6B • Updated Jun 15 • 3 AIPlans/Ethics_Commonsense Preview • Updated Jun 21 • 10
Model Diffing AIPlans/qwen3-8b-dpo-hh-rlhf Updated 27 days ago AIPlans/qwen3-8b-ipo-hh-rlhf Text Generation • Updated 13 days ago • 1