UniFusion: Vision-Language Model as Unified Encoder in Image Generation Paper • 2510.12789 • Published Oct 14 • 18
Qwen/Qwen3-VL-235B-A22B-Instruct Image-Text-to-Text • 236B • Updated about 16 hours ago • 71k • • 316
Qwen/Qwen3-VL-235B-A22B-Thinking Image-Text-to-Text • 236B • Updated about 16 hours ago • 11.4k • • 327