A Preliminary Study for GPT-4o on Image Restoration
Abstract
GPT-4o, while generating visually appealing images, often lacks structural fidelity and serves as a visual prior to enhance dehazing, derainning, and low-light enhancement tasks in image restoration.
OpenAI's GPT-4o model, integrating multi-modal inputs and outputs within an autoregressive architecture, has demonstrated unprecedented performance in image generation. In this work, we investigate its potential impact on the image restoration community. We present the first systematic evaluation of GPT-4o across diverse restoration tasks. Our experiments reveal that, although restoration outputs from GPT-4o are visually appealing, they often suffer from pixel-level structural fidelity when compared to ground-truth images. Common issues are variations in image proportions, shifts in object positions and quantities, and changes in viewpoint.To address it, taking image dehazing, derainning, and low-light enhancement as representative case studies, we show that GPT-4o's outputs can serve as powerful visual priors, substantially enhancing the performance of existing dehazing networks. It offers practical guidelines and a baseline framework to facilitate the integration of GPT-4o into future image restoration pipelines. We hope the study on GPT-4o image restoration will accelerate innovation in the broader field of image generation areas. To support further research, we will release GPT-4o-restored images from over 10 widely used image restoration datasets.
Community
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Beyond Degradation Redundancy: Contrastive Prompt Learning for All-in-One Image Restoration (2025)
- DPMambaIR:All-in-One Image Restoration via Degradation-Aware Prompt State Space Model (2025)
- UniFlowRestore: A General Video Restoration Framework via Flow Matching and Prompt Guidance (2025)
- GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation (2025)
- ZipIR: Latent Pyramid Diffusion Transformer for High-Resolution Image Restoration (2025)
- Enhanced Semantic Extraction and Guidance for UGC Image Super Resolution (2025)
- Beyond Degradation Conditions: All-in-One Image Restoration via HOG Transformers (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 1
Spaces citing this paper 0
No Space linking this paper