-
Neural-Driven Image Editing
Paper • 2507.05397 • Published • 26 -
π^3: Scalable Permutation-Equivariant Visual Geometry Learning
Paper • 2507.13347 • Published • 58 -
MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second
Paper • 2507.10065 • Published • 23 -
From One to More: Contextual Part Latents for 3D Generation
Paper • 2507.08772 • Published • 20
Xuejian Rong
xrong
·
AI & ML interests
None yet
Recent Activity
updated
a collection
8 days ago
inbox
updated
a collection
8 days ago
inbox
updated
a collection
8 days ago
inbox
Organizations
None yet
Multi-Modal
Depth
-
DepthFM: Fast Monocular Depth Estimation with Flow Matching
Paper • 2403.13788 • Published • 17 -
Learning Temporally Consistent Video Depth from Video Diffusion Priors
Paper • 2406.01493 • Published • 23 -
NeuFlow v2: High-Efficiency Optical Flow Estimation on Edge Devices
Paper • 2408.10161 • Published • 15 -
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
Paper • 2409.02095 • Published • 37
3DGS
-
CompGS: Efficient 3D Scene Representation via Compressed Gaussian Splatting
Paper • 2404.09458 • Published • 7 -
FLoD: Integrating Flexible Level of Detail into 3D Gaussian Splatting for Customizable Rendering
Paper • 2408.12894 • Published • 6 -
Towards Realistic Example-based Modeling via 3D Gaussian Stitching
Paper • 2408.15708 • Published • 8 -
3D Reconstruction with Spatial Memory
Paper • 2408.16061 • Published • 15
VLM
Editing
Foundation
-
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Paper • 2409.18124 • Published • 34 -
LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness
Paper • 2409.18125 • Published • 35 -
Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices
Paper • 2410.11795 • Published • 18 -
An Introduction to Vision-Language Modeling
Paper • 2405.17247 • Published • 90
Outpainting
AnyRes
-
FeatUp: A Model-Agnostic Framework for Features at Any Resolution
Paper • 2403.10516 • Published • 16 -
FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis
Paper • 2403.12963 • Published • 8 -
MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning
Paper • 2408.11001 • Published • 12
Inpainting
-
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Paper • 2404.18212 • Published • 30 -
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
Paper • 2407.16982 • Published • 43 -
NeuFlow v2: High-Efficiency Optical Flow Estimation on Edge Devices
Paper • 2408.10161 • Published • 15
Misc
Human
inbox
-
Neural-Driven Image Editing
Paper • 2507.05397 • Published • 26 -
π^3: Scalable Permutation-Equivariant Visual Geometry Learning
Paper • 2507.13347 • Published • 58 -
MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second
Paper • 2507.10065 • Published • 23 -
From One to More: Contextual Part Latents for 3D Generation
Paper • 2507.08772 • Published • 20
Foundation
-
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Paper • 2409.18124 • Published • 34 -
LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness
Paper • 2409.18125 • Published • 35 -
Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices
Paper • 2410.11795 • Published • 18 -
An Introduction to Vision-Language Modeling
Paper • 2405.17247 • Published • 90
Multi-Modal
Outpainting
Depth
-
DepthFM: Fast Monocular Depth Estimation with Flow Matching
Paper • 2403.13788 • Published • 17 -
Learning Temporally Consistent Video Depth from Video Diffusion Priors
Paper • 2406.01493 • Published • 23 -
NeuFlow v2: High-Efficiency Optical Flow Estimation on Edge Devices
Paper • 2408.10161 • Published • 15 -
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
Paper • 2409.02095 • Published • 37
AnyRes
-
FeatUp: A Model-Agnostic Framework for Features at Any Resolution
Paper • 2403.10516 • Published • 16 -
FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis
Paper • 2403.12963 • Published • 8 -
MegaFusion: Extend Diffusion Models towards Higher-resolution Image Generation without Further Tuning
Paper • 2408.11001 • Published • 12
3DGS
-
CompGS: Efficient 3D Scene Representation via Compressed Gaussian Splatting
Paper • 2404.09458 • Published • 7 -
FLoD: Integrating Flexible Level of Detail into 3D Gaussian Splatting for Customizable Rendering
Paper • 2408.12894 • Published • 6 -
Towards Realistic Example-based Modeling via 3D Gaussian Stitching
Paper • 2408.15708 • Published • 8 -
3D Reconstruction with Spatial Memory
Paper • 2408.16061 • Published • 15
Inpainting
-
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Paper • 2404.18212 • Published • 30 -
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
Paper • 2407.16982 • Published • 43 -
NeuFlow v2: High-Efficiency Optical Flow Estimation on Edge Devices
Paper • 2408.10161 • Published • 15
VLM
Misc
Editing
Human