V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning Paper • 2506.09985 • Published Jun 11 • 28
CLUTRR: A Diagnostic Benchmark for Inductive Reasoning from Text Paper • 1908.06177 • Published Aug 16, 2019
Learning an Unreferenced Metric for Online Dialogue Evaluation Paper • 2005.00583 • Published May 1, 2020
How sensitive are translation systems to extra contexts? Mitigating gender bias in Neural Machine Translation models through relevant contexts Paper • 2205.10762 • Published May 22, 2022
MetaMorph: Multimodal Understanding and Generation via Instruction Tuning Paper • 2412.14164 • Published Dec 18, 2024 • 4
Efficient Tool Use with Chain-of-Abstraction Reasoning Paper • 2401.17464 • Published Jan 30, 2024 • 21