NerfBaselines

NerfBaselines is a framework for evaluating and comparing existing NeRF and 3DGS methods. Currently, most official implementations use different dataset loaders, evaluation protocols, and metrics, which renders benchmarking difficult. Therefore, this project aims to provide a unified interface for running and evaluating methods on different datasets in a consistent way using the same metrics. But instead of reimplementing the methods, we use the official implementations and wrap them so that they can be run easily using the same interface.

Please visit the project page to see the results of implemented methods on dataset benchmarks.

Project Page + Results | Paper

Getting started

Start by installing the nerfbaselines pip package on your host system.

pip install nerfbaselines

Now you can use the nerfbaselines cli to interact with NerfBaselines.

The next step is to choose the backend which will be used to install different methods. At the moment there are the following backends implemented:

docker: Offers good isolation, requires docker (with NVIDIA container toolkit) to be installed and the user to have access to it (being in the docker user group).
apptainer: Similar level of isolation as docker, but does not require the user to have privileged access.
conda (default): Does not require docker/apptainer to be installed, but does not offer the same level of isolation and some methods require additional dependencies to be installed. Also, some methods are not implemented for this backend because they rely on dependencies not found on conda.
python: Will run everything directly in the current environment. Everything needs to be installed in the environment for this backend to work.

The backend can be set as the --backend <backend> argument or using the NERFBASELINES_BACKEND environment variable.

Downloading data

For some datasets, e.g. Mip-NeRF 360, NerfStudio, Blender, or Tanks and Temples, the datasets can be downloaded automatically. You can specify the argument --data external://dataset/scene during training or download the dataset beforehand by running nerfbaselines download-dataset dataset/scene. Examples:

# Downloads the garden scene to the cache folder.
nerfbaselines download-dataset mipnerf360/garden

# Downloads all nerfstudio scenes to the cache folder.
nerfbaselines download-dataset nerfstudio

# Downloads kithen scene to folder kitchen
nerfbaselines download-dataset mipnerf360/kitchen -o kitchen

Training

To start the training, use the nerfbaselines train --method <method> --data <data> command. Use --help argument to learn about all implemented methods and supported features.

Rendering

The nerfbaselines render --checkpoint <checkpoint> command can be used to render images from a trained checkpoint. Again, use --help to learn about the arguments.

In order to render a camera trajectory (e.g., created using the interactive viewer), use the following command command:

nerfbaselines render-trajectory --checkpoint <checkpoint> --trajectory <trajectory> --output <output.mp4>

Interactive viewer

Given a trained checkpoint, the interactive viewer can be launched as follows:

nerfbaselines viewer --checkpoint <checkpoin> --data <dataset>

Even though the argument --data <dataset> is optional, it is recommended, as the camera poses are used to perform gravity alignment and rescaling for a better viewing experience. It also enables visualizing the input camera frustums.

Results

In this section, we present results of implemented methods on standard benchmark datasets. For detailed results, visit the project page: https://jkulhanek.com/nerfbaselines

Mip-NeRF 360

Mip-NeRF 360 is a collection of four indoor and five outdoor object-centric scenes. The camera trajectory is an orbit around the object with fixed elevation and radius. The test set takes each n-th frame of the trajectory as test views. Detailed results are available on the project page: https://jkulhanek.com/nerfbaselines/mipnerf360

Method	PSNR	SSIM	LPIPS (VGG)	Time	GPU mem.
Zip-NeRF	28.553	0.829	0.218	5h 30m 20s	26.8 GB
Mip-NeRF 360	27.681	0.792	0.272	30h 14m 36s	33.6 GB
Mip-Splatting	27.492	0.815	0.258	25m 37s	11.0 GB
Gaussian Splatting	27.434	0.814	0.257	23m 25s	11.1 GB
Gaussian Opacity Fields	27.421	0.826	0.234	1h 3m 54s	28.4 GB
NerfStudio	26.388	0.731	0.343	19m 30s	5.9 GB
Instant NGP	25.507	0.684	0.398	3m 54s	7.8 GB

Blender

Blender (nerf-synthetic) is a synthetic dataset used to benchmark NeRF methods. It consists of 8 scenes of an object placed on a white background. Cameras are placed on a semi-sphere around the object. Detailed results are available on the project page: https://jkulhanek.com/nerfbaselines/blender

Method	PSNR	SSIM	LPIPS (VGG)	Time	GPU mem.
Zip-NeRF	33.670	0.973	0.036	5h 21m 57s	26.2 GB
Gaussian Opacity Fields	33.451	0.969	0.038	18m 26s	3.1 GB
Mip-Splatting	33.330	0.969	0.039	6m 49s	2.7 GB
Gaussian Splatting	33.308	0.969	0.037	6m 6s	3.1 GB
TensoRF	33.172	0.963	0.051	10m 47s	16.4 GB
K-Planes	32.265	0.961	0.062	23m 58s	4.6 GB
Instant NGP	32.198	0.959	0.055	2m 23s	2.6 GB
Tetra-NeRF	31.951	0.957	0.056	6h 53m 20s	29.6 GB
Mip-NeRF 360	30.345	0.951	0.060	3h 29m 39s	114.8 GB
NerfStudio	29.191	0.941	0.095	9m 38s	3.6 GB
NeRF	28.723	0.936	0.092	23h 26m 30s	10.2 GB

Tanks and Temples

Tanks and Temples is a benchmark for image-based 3D reconstruction. The benchmark sequences were acquired outside the lab, in realistic conditions. Ground-truth data was captured using an industrial laser scanner. The benchmark includes both outdoor scenes and indoor environments. The dataset is split into three subsets: training, intermediate, and advanced. Detailed results are available on the project page: https://jkulhanek.com/nerfbaselines/tanksandtemples

Method	PSNR	SSIM	LPIPS	Time	GPU mem.
Zip-NeRF	24.628	0.840	0.131	5h 44m 9s	26.6 GB
Mip-Splatting	23.930	0.833	0.166	15m 56s	7.3 GB
Gaussian Splatting	23.827	0.831	0.165	13m 48s	6.9 GB
Gaussian Opacity Fields	22.395	0.825	0.172	-	-
NerfStudio	22.043	0.743	0.270	19m 27s	3.7 GB
Instant NGP	21.623	0.712	0.340	4m 27s	4.1 GB

Reproducing results

Method	Mip-NeRF 360	Blender	NerfStudio	Tanks and Temples	LLFF
NerfStudio	🥇 gold	🥇 gold	❔	🥇 gold	❌
Instant-NGP	🥇 gold	🥇 gold	🥇 gold	🥇 gold	❌
Gaussian Splatting	🥇 gold	🥇 gold	❌	🥇 gold	❌
Mip-Splatting	🥇 gold	🥇 gold	❌	🥇 gold	❌
Gaussian Opacity Fields	🥇 gold	🥇 gold	❌	🥇 gold	❌
Tetra-NeRF	🥈 silver	🥈 silver	❔	❔	❌
Mip-NeRF 360	🥇 gold	🥇 gold	❔	❔	❌
Zip-NeRF	🥇 gold	🥇 gold	🥇 gold	🥇 gold	❌
CamP	❔	❔	❔	❔	❌
TensoRF	❌	🥇 gold	❔	❔	🥇 gold
NeRF	❔	🥇 gold	❔	❔	❔

Implementation status

Methods:

[x] NerfStudio (Nerfacto)
[x] Instant-NGP
[x] Gaussian Splatting
[x] Mip-Splatting
[x] Gaussian Opacity Fields
[x] Tetra-NeRF
[x] Mip-NeRF 360
[x] Zip-NeRF
[x] CamP
[x] TensoRF
[x] K-Planes
[ ] Nerf-W (open source reimplementation)
[ ] NeRF on-the-go
[ ] TRIPS
[ ] Mip-NeRF
[ ] NeRF

Datasets/features:

[x] Mip-NeRF 360 dataset
[x] Blender dataset
[x] any COLMAP dataset
[x] any NerfStudio dataset
[x] LLFF dataset
[x] Tanks and Temples dataset
[x] Photo Tourism dataset and evaluation protocol
[x] Bundler dataset format
[x] automatic dataset download
[x] interactive viewer and trajectory editor
[x] undistorting images for methods that do not support complex camera models (Gaussian Splatting)
[x] logging to tensorboard, wandb
[ ] HDR images support
[ ] RAW images support

Contributing

Contributions are very much welcome. Please open a PR with a dataset/method/feature that you want to contribute. The goal of this project is to slowly expand by implementing more and more methods.

Citation

If you use this project in your research, please cite the following paper:

@article{kulhanek2024nerfbaselines,
  title={NerfBaselines: Consistent and Reproducible Evaluation of Novel View Synthesis Methods},
  author={Jonas Kulhanek and Torsten Sattler},
  year={2024},
  journal={arXiv},
}

License

This project is licensed under the MIT license Each implemented method is licensed under the license provided by the authors of the method. For the currently implemented methods, the following licenses apply:

NerfStudio: Apache 2.0
Instant-NGP: custom, research purposes only
Gaussian-Splatting: custom, research purposes only
Mip-Splatting: custom, research purposes only
Gaussian Opacity Fields: custom, research purposes only
Tetra-NeRF: MIT, Apache 2.0
Mip-NeRF 360: Apache 2.0
Zip-NeRF: Apache 2.0
CamP: Apache 2.0

Acknowledgements

A big thanks to the authors of all implemented methods for the great work they have done. We would also like to thank the authors of NerfStudio, especially Brent Yi, for viser - a great framework powering the viewer. This work was supported by the Czech Science Foundation (GAČR) EXPRO (grant no. 23-07973X), the Grant Agency of the Czech Technical University in Prague (grant no. SGS24/095/OHK3/2T/13), and by the Ministry of Education, Youth and Sports of the Czech Republic through the e-INFRA CZ (ID:90254).

nerfbaselines
nerfbaselines copied to clipboard

Metadata

NerfBaselines

Project Page + Results | Paper

Getting started

Downloading data

Training

Rendering

Interactive viewer

Results

Mip-NeRF 360

Blender

Tanks and Temples

Reproducing results

Implementation status

Contributing

Citation

License

Acknowledgements

← Metadata

Owner

Metadata

nerfbaselines nerfbaselines copied to clipboard

Metadata

NerfBaselines

Project Page + Results | Paper

Getting started

Downloading data

Training

Rendering

Interactive viewer

Results

Mip-NeRF 360

Blender

Tanks and Temples

Reproducing results

Implementation status

Contributing

Citation

License

Acknowledgements

← Metadata

Owner

Metadata

nerfbaselines
nerfbaselines copied to clipboard