DreamPolisher: Towards High-Quality Text-to-3D Generation via Geometric Diffusion Paper • 2403.17237 • Published Mar 25, 2024 • 11
Olympus: A Universal Task Router for Computer Vision Tasks Paper • 2412.09612 • Published Dec 12, 2024 • 4
REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering Paper • 2206.01201 • Published Jun 2, 2022
Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding Paper • 2203.08481 • Published Mar 16, 2022
AdaFocus V2: End-to-End Training of Spatial Dynamic Networks for Video Recognition Paper • 2112.14238 • Published Dec 28, 2021
IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation Paper • 2506.03150 • Published Jun 3 • 21