Sparc3D: Sparse Representation and Construction for High-Resolution 3D Shapes Modeling
Abstract
Sparc3D combines sparse deformable marching cubes with a sparse convolutional VAE for high-fidelity 3D object synthesis and generation, improving detail preservation and efficiency.
High-fidelity 3D object synthesis remains significantly more challenging than 2D image generation due to the unstructured nature of mesh data and the cubic complexity of dense volumetric grids. Existing two-stage pipelines-compressing meshes with a VAE (using either 2D or 3D supervision), followed by latent diffusion sampling-often suffer from severe detail loss caused by inefficient representations and modality mismatches introduced in VAE. We introduce Sparc3D, a unified framework that combines a sparse deformable marching cubes representation Sparcubes with a novel encoder Sparconv-VAE. Sparcubes converts raw meshes into high-resolution (1024^3) surfaces with arbitrary topology by scattering signed distance and deformation fields onto a sparse cube, allowing differentiable optimization. Sparconv-VAE is the first modality-consistent variational autoencoder built entirely upon sparse convolutional networks, enabling efficient and near-lossless 3D reconstruction suitable for high-resolution generative modeling through latent diffusion. Sparc3D achieves state-of-the-art reconstruction fidelity on challenging inputs, including open surfaces, disconnected components, and intricate geometry. It preserves fine-grained shape details, reduces training and inference cost, and integrates naturally with latent diffusion models for scalable, high-resolution 3D generation.
Community
2D to 3D image transformation service available on your platform. Unfortunately, Iโve encountered an issue where the service fails to initialize properly and does not complete the transformation process.
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper