MIRA: Multimodal Iterative Reasoning Agent for Image Editing Paper • 2511.21087 • Published 4 days ago • 9
MIRA: Multimodal Iterative Reasoning Agent for Image Editing Paper • 2511.21087 • Published 4 days ago • 9
VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation Paper • 2308.14710 • Published Aug 28, 2023
OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting Paper • 2503.08677 • Published Mar 11 • 29
MMIG-Bench: Towards Comprehensive and Explainable Evaluation of Multi-Modal Image Generation Models Paper • 2505.19415 • Published May 26 • 2
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models Paper • 2510.05034 • Published Oct 6 • 48
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models Paper • 2510.05034 • Published Oct 6 • 48