AI & ML interests
None yet
Organizations
None yet
kapilw25/llama3-8b-pku-GRPO-Instruct-SFT-Instruct
Updated
kapilw25/llama3-8b-pku-PPO-Instruct-SFT-Instruct
Updated
kapilw25/llama3-8b-pku-PPO-NoInstruct-SFT-NoInstruct
Updated
kapilw25/llama3-8b-pku-GRPO-NoInstruct-SFT-NoInstruct
Updated
kapilw25/llama3-8b-pku-CITA-Instruct-DPO-Instruct
Updated
kapilw25/llama3-8b-pku-DPO-Instruct-SFT-Instruct
Updated
kapilw25/llama3-8b-pku-CITA-NoInstruct-DPO-NoInstruct
Updated
kapilw25/llama3-8b-pku-DPO-NoInstruct-SFT-NoInstruct
Updated
kapilw25/llama3-8b-pku-SFT-Instruct-Baseline-NoInstruct
Updated
kapilw25/llama3-8b-pku-SFT-NoInstruct-Baseline-NoInstruct
Updated