Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing Paper • 2510.19808 • Published 12 days ago • 28
ArtiMuse: Fine-Grained Image Aesthetics Assessment with Joint Scoring and Expert-Level Understanding Paper • 2507.14533 • Published Jul 19 • 5
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding Paper • 2510.06308 • Published 27 days ago • 52
Unimedvl: Unifying Medical Multimodal Understanding And Generation Through Observation-Knowledge-Analysis Paper • 2510.15710 • Published 17 days ago • 5
PICABench: How Far Are We from Physically Realistic Image Editing? Paper • 2510.17681 • Published 14 days ago • 61
PICABench: How Far Are We from Physically Realistic Image Editing? Paper • 2510.17681 • Published 14 days ago • 61
PICABench: How Far Are We from Physically Realistic Image Editing? Paper • 2510.17681 • Published 14 days ago • 61 • 3
Lumina-OmniLV: A Unified Multimodal Framework for General Low-Level Vision Paper • 2504.04903 • Published Apr 7
Factuality Matters: When Image Generation and Editing Meet Structured Visuals Paper • 2510.05091 • Published 28 days ago • 18
A Comparative Study of Image Restoration Networks for General Backbone Network Design Paper • 2310.11881 • Published Oct 18, 2023