RegionGPT: Towards Region Understanding Vision Language Model Paper • 2403.02330 • Published Mar 4, 2024 • 2
Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots Paper • 2405.07990 • Published May 13, 2024 • 21
EGC: Image Generation and Classification via a Diffusion Energy-Based Model Paper • 2304.02012 • Published Apr 4, 2023 • 1
RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths Paper • 2305.18295 • Published May 29, 2023 • 8