research-article

Open access

NeRF: representing scenes as neural radiance fields for view synthesis

Authors:

Ben Mildenhall,

Pratul P. Srinivasan,

Matthew Tancik,

Jonathan T. Barron,

Ravi Ramamoorthi,

Ren NgAuthors Info & Claims

Communications of the ACM, Volume 65, Issue 1

Pages 99 - 106

https://doi.org/10.1145/3503250

Published: 17 December 2021 Publication History

All formats PDF

Abstract

We present a method that achieves state-of-the-art results for synthesizing novel views of complex scenes by optimizing an underlying continuous volumetric scene function using a sparse set of input views. Our algorithm represents a scene using a fully connected (nonconvolutional) deep network, whose input is a single continuous 5D coordinate (spatial location (x, y, z) and viewing direction (θ, ϕ)) and whose output is the volume density and view-dependent emitted radiance at that spatial location. We synthesize views by querying 5D coordinates along camera rays and use classic volume rendering techniques to project the output colors and densities into an image. Because volume rendering is naturally differentiable, the only input required to optimize our representation is a set of images with known camera poses. We describe how to effectively optimize neural radiance fields to render photorealistic novel views of scenes with complicated geometry and appearance, and demonstrate results that outperform prior work on neural rendering and view synthesis.

References

[1]

Buehler, C., Bosse, M., McMillan, L., Gortler S., Cohen, M. Unstructured lumigraph rendering. In SIGGRAPH (2001).

Digital Library

[2]

Chang, A.X., Fhnkhouser, T., Guibas, L., Hanrahan, P., Huang, Q., Li, Z., Savarese, S., Savva, M., Song, S., Su, H., et al. ShapeNet: An information-rich 3D model repository. arXiv:1512.03012 (2015).

[3]

Curless, B., Levoy, M. A volumetric method for building complex models from range images. In SIGGRAPH (1996).

Digital Library

[4]

Debevec, P., Taylor, C.J., Malik, J. Modeling and rendering architecture from photographs: A hybrid geometry-and image-based approach. In SIGGRAPH (1996).

Digital Library

[5]

Kajiya, J.T., Herzen, B.P.V. Ray tracing volume densities. Comput. Graph. (SIGGRAPH) (1984).

[6]

Kingma, D.P., Ba, J. Adam: A method for stochastic optimization. In ICLR (2015).

[7]

Li, T.-M., Aittala, M., Durand, F., Lehtinen, J. Differentiable monte carlo ray tracing through edge sampling. ACM Trans. Graph. (SIGGRAPH Asia) (2018).

[8]

Lombardi, S., Simon, T., Saragih, J., Schwartz, G., Lehrmann, A., Sheikh, Y. Neural volumes: Learning dynamic renderable volumes from images. ACM Trans. Graph. (SIGGRAPH) (2019).

[9]

Loper, M.M., Black, M.J. OpenDR: An approximate differentiable renderer. In ECCV (2014).

[10]

Max, N. Optical models for direct volume rendering. IEEE Trans. Visual. Comput. Graph. (1995).

[11]

Mescheder, L., Oechsle, M., Niemeyer, M., Nowozin, S., Geiger, A. Occupancy networks: Learning 3D reconstruction in function space. In CVPR (2019).

[12]

Mildenhall, B., Srinivasan, P.P., Ortiz-Cayon, R., Kalantari, N.K., Ramamoorthi, R., Ng, R., Kar, A. Local light field fusion: Practical view synthesis with prescriptive sampling guidelines. ACM Trans. Graph. (SIGGRAPH) (2019).

[13]

Mildenhall, B., Srinivasan, P.P, Tancik, M., Barron, J.T., Ramamoorthi, R., Ng, R. NeRF: Representing scenes as neural radiance fields for view synthesis. In ECCV (2020).

Digital Library

[14]

Niemeyer, M., Mescheder, L., Oechsle, M., Geiger, A. Differentiable volumetric rendering: Learning implicit 3D representations without 3D supervision. In CVPR (2019).

[15]

Park, J.J., Florence, P., Straub, J., Newcombe, R., Lovegrove, S. DeepSDF: Learning continuous signed distance functions for shape representation. In CVPR (2019).

[16]

Porter, T., Duff, T. Compositing digital images. Comput. Graph. (SIGGRAPH) (1984).

[17]

Rahaman, N., Baratin, A., Arpit, D., Dräxler, F., Lin, M., Hamprecht, F.A., Bengio, Y., Courville, A.C. On the spectral bias of neural networks. In ICML (2018).

[18]

Schönberger, J.L., Frahm, J.-M. Structure-from-motion revisited. In CVPR (2016).

[19]

Seitz, S.M., Dyer, C.R. Photorealistic scene reconstruction by voxel coloring. Int. J. Comput. Vision (1999).

[20]

Sitzmann, V., Thies, J., Heide, F., Nießner, M., Wetzstein, G., Zollhöfer, M. Deepvoxels: Learning persistent 3D feature embeddings. In CVPR (2019).

[21]

Sitzmann, V., Zollhoefer, M., Wetzstein, G. Scene representation networks: Continuous 3D-structure-aware neural scene representations. In NeurIPS (2019).

[22]

Tancik, M., Srinivasan, P.P., Mildenhall, B., Fridovich-Keil, S., Raghavan, N., Singhal, U., Ramamoorthi, R., Barron, J.T., Ng, R. Fourier features let networks learn high frequency functions in low dimensional domains. In NeurIPS (2020).

[23]

Wood, D.N., Azuma, D.I., Aldinger, K., Curless, B., Duchamp, T., Salesin, D.H., Stuetzle, W. Surface light fields for 3D photography. In SIGGRAPH (2000).

Digital Library

[24]

Zhang, R., Isola, P., Efros, A.A., Shechtman, E., Wang, O. The unreasonable effectiveness of deep features as a perceptual metric. In CVPR (2018).

[25]

Zhou, T., Tucker, R., Flynn, J., Fyffe, G., Snavely, N. Stereo magnification: Learning view synthesis using multiplane images. ACM Trans. Graph. (SIGGRAPH) (2018).

Cited By

Zhu WChen XJiang L(2025)PV-LaP: Multi-sensor fusion for 3D Scene Understanding in intelligent transportation systemsSignal Processing10.1016/j.sigpro.2024.109749227(109749)Online publication date: Feb-2025
https://doi.org/10.1016/j.sigpro.2024.109749
Zhang YLi TWei ZQu Y(2025)Optimization of sparse camera array arrangement using grid-based methodOptics Communications10.1016/j.optcom.2024.131137574(131137)Online publication date: Jan-2025
https://doi.org/10.1016/j.optcom.2024.131137
Hu QWei XCheng RXu HCai YYin YHe W(2025)Visual localization of robotic end effector via fusion of 3D Gaussian Splatting and heuristic optimization algorithmMeasurement10.1016/j.measurement.2024.116195242(116195)Online publication date: Jan-2025
https://doi.org/10.1016/j.measurement.2024.116195
Show More Cited By

Index Terms

NeRF: representing scenes as neural radiance fields for view synthesis
1. Computing methodologies
  1. Computer graphics
    1. Image manipulation
      1. Image-based rendering

Recommendations

S³-NeRF: neural reflectance field from shading and shadow under a single viewpoint
NIPS '22: Proceedings of the 36th International Conference on Neural Information Processing Systems

In this paper, we address the "dual problem" of multi-view scene reconstruction in which we utilize single-view images captured under different point lights to learn a neural scene representation. Different from existing single-view methods which can ...
UE4-NeRF: neural radiance field for real-time rendering of large-scale scene
NIPS '23: Proceedings of the 37th International Conference on Neural Information Processing Systems

Neural Radiance Field (NeRF) is an implicit 3D reconstruction method that has shown immense potential and has gained significant attention for its ability to reconstruct 3D scenes solely from a set of photographs. However, its real-time rendering ...
EGRA-NeRF: Edge-Guided Ray Allocation for Neural Radiance Fields
Highlights
- Novel ray allocation strategy enhances textures and edges in scenes.
- Canny edge detector guides dynamic ray allocation.
- Improves performance of NeRF-based algorithms quantitatively and qualitatively.
Abstract
Recently, Neural Radiance Fields (NeRF) has demonstrated great potential in synthesizing novel views for realistic video generation. However, renderings from NeRF appear excessively blurred and contain aliasing artifacts in some textures or ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Communications of the ACM

Communications of the ACM Volume 65, Issue 1

January 2022

106 pages

ISSN:0001-0782

EISSN:1557-7317

DOI:10.1145/3507640

Editor:
Andrew A. Chien
Association for Computing Machinery, New York, NY

Issue’s Table of Contents

Copyright © 2021 Owner/Author.

This work is licensed under a Creative Commons Attribution International 4.0 License.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 December 2021

Published in CACM Volume 65, Issue 1

Check for updates

Qualifiers

Research-article
Research
Refereed

Funding Sources

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2,004
Total Citations
View Citations
63,593
Total Downloads

Downloads (Last 12 months)22,865
Downloads (Last 6 weeks)2,985

Reflects downloads up to 18 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Zhu WChen XJiang L(2025)PV-LaP: Multi-sensor fusion for 3D Scene Understanding in intelligent transportation systemsSignal Processing10.1016/j.sigpro.2024.109749227(109749)Online publication date: Feb-2025
https://doi.org/10.1016/j.sigpro.2024.109749
Zhang YLi TWei ZQu Y(2025)Optimization of sparse camera array arrangement using grid-based methodOptics Communications10.1016/j.optcom.2024.131137574(131137)Online publication date: Jan-2025
https://doi.org/10.1016/j.optcom.2024.131137
Hu QWei XCheng RXu HCai YYin YHe W(2025)Visual localization of robotic end effector via fusion of 3D Gaussian Splatting and heuristic optimization algorithmMeasurement10.1016/j.measurement.2024.116195242(116195)Online publication date: Jan-2025
https://doi.org/10.1016/j.measurement.2024.116195
Chi PWang ZLiao HLi TWu XZhang Q(2025)Towards new-generation of intelligent welding manufacturing: A systematic review on 3D vision measurement and path planning of humanoid welding robotsMeasurement10.1016/j.measurement.2024.116065242(116065)Online publication date: Jan-2025
https://doi.org/10.1016/j.measurement.2024.116065
Zhou JLiang TZhang DLiu SWang JWu E(2025)WaterHE-NeRF: Water-ray matching neural radiance fields for underwater scene reconstructionInformation Fusion10.1016/j.inffus.2024.102770115(102770)Online publication date: Mar-2025
https://doi.org/10.1016/j.inffus.2024.102770
Liu DWang ZChen P(2025)DSEM-NeRF: Multimodal feature fusion and global–local attention for enhanced 3D scene reconstructionInformation Fusion10.1016/j.inffus.2024.102752115(102752)Online publication date: Mar-2025
https://doi.org/10.1016/j.inffus.2024.102752
Xu HChen JMeng SWang YChau L(2025)A survey on occupancy perception for autonomous driving: The information fusion perspectiveInformation Fusion10.1016/j.inffus.2024.102671114(102671)Online publication date: Feb-2025
https://doi.org/10.1016/j.inffus.2024.102671
Balado JGarozzo RWiniwarter LTilon S(2025)A systematic literature review of low-cost 3D mapping solutionsInformation Fusion10.1016/j.inffus.2024.102656114(102656)Online publication date: Feb-2025
https://doi.org/10.1016/j.inffus.2024.102656
Wang WAn LZhou MHan G(2025)Neighborhood transformer for sparse-view X-ray 3D foot reconstructionBiomedical Signal Processing and Control10.1016/j.bspc.2024.107082100(107082)Online publication date: Feb-2025
https://doi.org/10.1016/j.bspc.2024.107082
Lian HLi XQu YDu JMeng ZLiu JChen L(2025)Bayesian uncertainty analysis for underwater 3D reconstruction with neural radiance fieldsApplied Mathematical Modelling10.1016/j.apm.2024.115806138(115806)Online publication date: Feb-2025
https://doi.org/10.1016/j.apm.2024.115806
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Digital Edition

View this article in digital edition.

Digital Edition

Magazine Site

View this article on the magazine site (external)

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Media

Figures

Other

Tables

View Issue’s Table of Contents