Language models scale reliably with over-training and on downstream tasks Paper • 2403.08540 • Published Mar 13, 2024 • 15
pix2gestalt: Amodal Segmentation by Synthesizing Wholes Paper • 2401.14398 • Published Jan 25, 2024 • 10
Understanding Video Transformers via Universal Concept Discovery Paper • 2401.10831 • Published Jan 19, 2024 • 8