Scaling Agent Learning via Experience Synthesis Paper • 2511.03773 • Published Nov 5, 2025 • 81
Scaling Agent Learning via Experience Synthesis Paper • 2511.03773 • Published Nov 5, 2025 • 81
MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning Paper • 2505.24871 • Published May 30, 2025 • 23
Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models Paper • 2410.02740 • Published Oct 3, 2024 • 54
Introducing v0.5 of the AI Safety Benchmark from MLCommons Paper • 2404.12241 • Published Apr 18, 2024 • 13
Interpolation for Robust Learning: Data Augmentation on Geodesics Paper • 2302.02092 • Published Feb 4, 2023 • 1
Asymmetry in Low-Rank Adapters of Foundation Models Paper • 2402.16842 • Published Feb 26, 2024 • 2
Asymmetry in Low-Rank Adapters of Foundation Models Paper • 2402.16842 • Published Feb 26, 2024 • 2