Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning Paper • 2507.17512 • Published 6 days ago • 33
REST: Stress Testing Large Reasoning Models by Asking Multiple Problems at Once Paper • 2507.10541 • Published 15 days ago • 28
CipherBank: Exploring the Boundary of LLM Reasoning Capabilities through Cryptography Challenges Paper • 2504.19093 • Published Apr 27 • 17
A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis Paper • 2504.12322 • Published Apr 11 • 28
Heimdall: test-time scaling on the generative verification Paper • 2504.10337 • Published Apr 14 • 33
FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding Paper • 2504.09925 • Published Apr 14 • 38
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Paper • 2504.10479 • Published Apr 14 • 279
BioT5+: Towards Generalized Biological Understanding with IUPAC Integration and Multi-task Tuning Paper • 2402.17810 • Published Feb 27, 2024 • 1
GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation Paper • 2504.02782 • Published Apr 3 • 58
LEMMA: Learning from Errors for MatheMatical Advancement in LLMs Paper • 2503.17439 • Published Mar 21 • 15
MathFusion: Enhancing Mathematic Problem-solving of LLM through Instruction Fusion Paper • 2503.16212 • Published Mar 20 • 25
MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer Paper • 2503.14891 • Published Mar 19 • 22
BioT5: Enriching Cross-modal Integration in Biology with Chemical Knowledge and Natural Language Associations Paper • 2310.07276 • Published Oct 11, 2023 • 5
3D-MolT5: Towards Unified 3D Molecule-Text Modeling with 3D Molecular Tokenization Paper • 2406.05797 • Published Jun 9, 2024 • 1
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence Paper • 2406.11931 • Published Jun 17, 2024 • 65
The Impact of Large Language Models on Scientific Discovery: a Preliminary Study using GPT-4 Paper • 2311.07361 • Published Nov 13, 2023 • 14