plaguss/Qwen2.5-0.5B-Math-Shepherd-PRM-token-0.1 Token Classification • 0.5B • Updated Dec 4, 2024 • 5
qgallouedec/Qwen2-0.5B-Reward-Math-Sheperd-KN-fix-cast Token Classification • 0.5B • Updated Dec 9, 2024 • 2
hzy/Qwen2.5-Math-7B-Instruct-PRM-Modified-math_shepherd Token Classification • 7B • Updated Mar 10 • 1
alothomas/Qwen2.5-0.5B-PRM-RAD-balanced-150k-LastStepOnly Token Classification • 0.5B • Updated Sep 24 • 5