MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning Paper • 2507.16812 • Published 7 days ago • 48
RefineX: Learning to Refine Pre-training Data at Scale from Expert-Guided Programs Paper • 2507.03253 • Published 26 days ago • 18
Evaluating Implicit Bias in Large Language Models by Attacking From a Psychometric Perspective Paper • 2406.14023 • Published Jun 20, 2024 • 1