NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation Paper β’ 2504.13055 β’ Published Apr 17 β’ 19
SCITAT: A Question Answering Benchmark for Scientific Tables and Text Covering Diverse Reasoning Types Paper β’ 2412.11757 β’ Published Dec 16, 2024
Efficient Process Reward Model Training via Active Learning Paper β’ 2504.10559 β’ Published Apr 14 β’ 13
Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies Paper β’ 2407.13623 β’ Published Jul 18, 2024 β’ 57
RegMix: Data Mixture as Regression for Language Model Pre-training Paper β’ 2407.01492 β’ Published Jul 1, 2024 β’ 41
Sailor: Open Language Models for South-East Asia Paper β’ 2404.03608 β’ Published Apr 4, 2024 β’ 21
From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning Paper β’ 2304.07995 β’ Published Apr 17, 2023 β’ 3
MultiSpider: Towards Benchmarking Multilingual Text-to-SQL Semantic Parsing Paper β’ 2212.13492 β’ Published Dec 27, 2022 β’ 2