weqweasdas/qwen15b_train_simple_subset5k_for_difficulty_transition Viewer • Updated 22 days ago • 5k • 40
weqweasdas/from_default_filtered_openr1_with_scores_filtered_05_and_filtered_allwrong Viewer • Updated Sep 18 • 25k • 22
weqweasdas/dapo_and_openr1_can_be_evaluated_by_daporm_deduplicate_with_scores Viewer • Updated Sep 16 • 34.1k • 13
weqweasdas/dapo_and_openr1_can_be_evaluated_by_daporm_deduplicate Viewer • Updated Sep 15 • 34.1k • 18
weqweasdas/test_rm_from_default_filtered_openr_math_verify_scores_and_dapo_scores Viewer • Updated Sep 15 • 93.7k • 19
weqweasdas/test_rm_from_default_filtered_openr_math_verify_scores Viewer • Updated Sep 15 • 93.7k • 21
weqweasdas/from_default_filtered_openr1_with_scores_filtered_0125_but_not_all_wrong Viewer • Updated Sep 13 • 13.3k • 10
weqweasdas/from_default_filtered_openr1_with_scores_filtered_025 Viewer • Updated Sep 13 • 45.5k • 16
weqweasdas/from_default_filtered_openr1_with_scores_filtered_0125 Viewer • Updated Sep 13 • 37.8k • 10