SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond Paper • 2505.19641 • Published May 26 • 67
Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging Paper • 2505.05464 • Published May 8 • 11
Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas Paper • 2503.01773 • Published Mar 3
FELM: Benchmarking Factuality Evaluation of Large Language Models Paper • 2310.00741 • Published Oct 1, 2023
Evaluating Factual Consistency of Summaries with Large Language Models Paper • 2305.14069 • Published May 23, 2023
Composing Parameter-Efficient Modules with Arithmetic Operations Paper • 2306.14870 • Published Jun 26, 2023 • 3
FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios Paper • 2307.13528 • Published Jul 25, 2023 • 1
Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs Paper • 2502.12982 • Published Feb 18 • 19
SkyLadder: Better and Faster Pretraining via Context Window Scheduling Paper • 2503.15450 • Published Mar 19 • 12
HARE: HumAn pRiors, a key to small language model Efficiency Paper • 2406.11410 • Published Jun 17, 2024 • 39