SPARF: Large-Scale Learning of 3D Sparse Radiance Fields from Few Input Images

Abstract

Recent advances in Neural Radiance Fields (NeRFs) treat the problem of novel view synthesis as Sparse Radiance Field (SRF) optimization using sparse voxels for efficient and fast rendering (Plenoxels, InstantNGP). In order to leverage machine learning and adoption of SRFs as a 3D representation, we present SPARF, a large-scale ShapeNet-based synthetic dataset for novel view synthesis consisting of ~ 17 million images rendered from nearly 40,000 shapes at high resolution (400 X 400 pixels). The dataset is orders of magnitude larger than existing synthetic datasets for novel view synthesis and includes more than one million 3D-optimized radiance fields with multiple voxel resolutions. Furthermore, we propose a novel pipeline (SuRFNet) that learns to generate sparse voxel radiance fields from only few views. This is done by using the densely collected SPARF dataset and 3D sparse convolutions. SuRFNet employs partial SRFs from few/one images and a specialized SRF loss to learn to generate high-quality sparse voxel radiance fields that can be rendered from novel views. Our approach achieves state-of-the-art results in the task of unconstrained novel view synthesis based on few views on ShapeNet as compared to recent baselines.

Video

Multi-Resolution Sparse Radiance Fields

sparf is a large dataset of Sparse Radiance Fields (SRFs) on multiple voxel resolutions: 32³, 128³, and 512³.

32³ Voxels 128³ Voxels 512³ Voxels

SPARF: Posed Multi-View Dataset

sparf is a large high resolution posed multi-view dataset, compared to other shapes datasets.

SuRFNet: Learning to Generate Sparse Radiance Fields

The SuRFNet pipelin consists of learning sparse convolution network conditioned on partial SRFs. The pipeline utilize a combination of losses on the genrated SRFs and a perceptual loss on the rendered images from the output SRF.

Mesh Extraction from SRFs

Since Sparse Radiance Fields (SRFs) are explicit 3D structures, obtaining high quality 3D meshes is straightforward.

Novel Views Synthesis from a Single View

SuRFNet naturally enables novel views Synthesis from a single view by cnoditioning the output SRF on the partial SRFS based on the few input views. This piepline achieves state-of-the-art on novel views synthesis on unseen shapes compared to recent baselines (PixelNeRF, and VisionNeRF).

BibTeX

@InProceedings{Hamdi_2023_ICCV,
        author    = {Hamdi, Abdullah and Ghanem, Bernard and Nie{\ss}sner, Matthias},
        title     = {SPARF: Large-Scale Learning of 3D Sparse Radiance Fields from Few Input Images},
        booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops},
        month     = {October},
        year      = {2023},
        pages     = {2930-2940}
    }