Towards Multimodal Understanding via Stable Diffusion as a Task-Aware Feature Extractor Paper • 2507.07106 • Published Jul 9 • 1