ReasoningMila/ServiceNowAI_R1_Distill_SFT_with_problems_and_responses Viewer • Updated May 22 • 1.68M • 35
ReasoningMila/ServiceNowAI_R1_Distill_SFT_with_problems_and_responses Viewer • Updated May 22 • 1.68M • 35
Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers Paper • 2505.04842 • Published May 7 • 12
Leveraging recent advances in Pre-Trained Language Models forEye-Tracking Prediction Paper • 2110.04475 • Published Oct 9, 2021
When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoning Paper • 2504.01005 • Published Apr 1 • 16
ReasoningMila/syn_qs_and_soln_cleaned_0_and_less20_multiple_soln_per_qs_1937545 Viewer • Updated Mar 23 • 1.94M • 9
ReasoningMila/syn_qs_and_soln_cleaned_0_and_less20_multiple_soln_per_qs_1937545 Viewer • Updated Mar 23 • 1.94M • 9
ReasoningMila/syn_qs_and_soln_cleaned_0_and_less20_1_soln_per_qs_131845 Viewer • Updated Mar 23 • 132k • 11
ReasoningMila/syn_qs_and_soln_cleaned_0_and_less20_1_soln_per_qs_131845 Viewer • Updated Mar 23 • 132k • 11