research-article

Single image HDR reconstruction using a CNN with masked features and perceptual loss

Authors:

Marcel Santana Santos,

Nima Khademi KalantariAuthors Info & Claims

ACM Transactions on Graphics (TOG), Volume 39, Issue 4

Article No.: 80, Pages 80:1 - 80:10

https://doi.org/10.1145/3386569.3392403

Published: 12 August 2020 Publication History

Abstract

Digital cameras can only capture a limited range of real-world scenes' luminance, producing images with saturated pixels. Existing single image high dynamic range (HDR) reconstruction methods attempt to expand the range of luminance, but are not able to hallucinate plausible textures, producing results with artifacts in the saturated areas. In this paper, we present a novel learning-based approach to reconstruct an HDR image by recovering the saturated pixels of an input LDR image in a visually pleasing way. Previous deep learning-based methods apply the same convolutional filters on wellexposed and saturated pixels, creating ambiguity during training and leading to checkerboard and halo artifacts. To overcome this problem, we propose a feature masking mechanism that reduces the contribution of the features from the saturated areas. Moreover, we adapt the VGG-based perceptual loss function to our application to be able to synthesize visually pleasing textures. Since the number of HDR images for training is limited, we propose to train our system in two stages. Specifically, we first train our system on a large number of images for image inpainting task and then fine-tune it on HDR reconstruction. Since most of the HDR examples contain smooth regions that are simple to reconstruct, we propose a sampling strategy to select challenging training patches during the HDR fine-tuning stage. We demonstrate through experimental results that our approach can reconstruct visually pleasing HDR results, better than the current state of the art on a wide range of scenes.

Supplemental Material

ZIP File

Supplemental files.

Download
319.08 MB

References

[1]

Francesco Banterle, Patrick Ledda, Kurt Debattista, and Alan Chalmers. 2006. Inverse tone mapping. In Proceedings of the 4th International Conference on Computer Graphics and Interactive Techniques in Australasia and Southeast Asia. ACM, 349--356.

Digital Library

[2]

David Bau, Hendrik Strobelt, William Peebles, Jonas Wulff, Bolei Zhou, Jun-Yan Zhu, and Antonio Torralba. 2019. Semantic Photo Manipulation with a Generative Image Prior. ACM Transactions on Graphics (Proceedings of ACM SIGGRAPH) 38, 4 (2019).

[3]

Cambodge Bist, Rémi Cozot, Gérard Madec, and Xavier Ducloux. 2017. Tone expansion using lighting style aesthetics. Computers & Graphics 62 (2017), 77--86.

[4]

Paul Debevec. 2005. A median cut algorithm for light probe sampling. In ACM SIGGRAPH 2005 Posters. ACM, 66.

Digital Library

[5]

PE Debevec and J Malik. 1997. Recovering high dynamic range images. In Proceeding of the SPIE: Image Sensors, Vol. 3965. 392--401.

[6]

Piotr Didyk, Rafal Mantiuk, Matthias Hein, and Hans-Peter Seidel. 2008. Enhancement of bright video features for HDR displays. In Computer Graphics Forum, Vol. 27. Wiley Online Library, 1265--1274.

[7]

Chao Dong, Chen Change Loy, Kaiming He, and Xiaoou Tang. 2015. Image super-resolution using deep convolutional networks. IEEE Transactions on Pattern Analysis and Machine Intelligence 38, 2 (2015), 295--307.

Digital Library

[8]

Frédo Durand and Julie Dorsey. 2002. Fast bilateral filtering for the display of high-dynamic-range images. In Proceedings of the 29th Annual Conference on Computer Graphics and Interactive Techniques. 257--266.

Digital Library

[9]

Gabriel Eilertsen, Joel Kronander, Gyorgy Denes, Rafał K Mantiuk, and Jonas Unger. 2017. HDR image reconstruction from a single exposure using deep CNNs. ACM Transactions on Graphics (TOG) 36, 6 (2017), 178.

Digital Library

[10]

Gabriel Eilertsen, RafałMantiuk, and Jonas Unger. 2019. Single-frame Regularization for Temporally Stable CNNs. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]

Yuki Endo, Yoshihiro Kanamori, and Jun Mitani. 2017. Deep reverse tone mapping. ACM Transactions on Graphics (TOG) 36, 6 (2017), 177--1.

Digital Library

[12]

Leon A Gatys, Alexander S Ecker, and Matthias Bethge. 2015. A neural algorithm of artistic style. arXiv preprint arXiv:1508.06576 (2015).

[13]

Leon A Gatys, Alexander S Ecker, and Matthias Bethge. 2016. Image style transfer using convolutional neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2414--2423.

[14]

Xavier Glorot and Yoshua Bengio. 2010. Understanding the difficulty of training deep feedforward neural networks. In Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics (AISTATS). 249--256.

[15]

Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative adversarial nets. In Advances in Neural Information Processing Systems (NeurIPS). 2672--2680.

[16]

Xintong Han, Zuxuan Wu, Weilin Huang, Matthew R Scott, and Larry S Davis. 2019. FiNet: Compatible and Diverse Fashion Image Inpainting. In Proceedings of the IEEE International Conference on Computer Vision (ICCV). 4481--4491.

[17]

Samuel W Hasinoff, Dillon Sharlet, Ryan Geiss, Andrew Adams, Jonathan T Barron, Florian Kainz, Jiawen Chen, and Marc Levoy. 2016. Burst photography for high dynamic range and low-light imaging on mobile cameras. ACM Transactions on Graphics (TOG) 35, 6 (2016), 192.

Digital Library

[18]

Geoffrey E Hinton and Ruslan R Salakhutdinov. 2006. Reducing the dimensionality of data with neural networks. Science 313, 5786 (2006), 504--507.

[19]

Jun Hu, Orazio Gallo, Kari Pulli, and Xiaobai Sun. 2013. HDR deghosting: How to deal with saturation?. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1163--1170.

Digital Library

[20]

Nima Khademi Kalantari and Ravi Ramamoorthi. 2017. Deep high dynamic range imaging of dynamic scenes. ACM Transactions on Graphics (TOG) 36, 4 (2017), 144--1.

Digital Library

[21]

Sing Bing Kang, Matthew Uyttendaele, Simon Winder, and Richard Szeliski. 2003. High dynamic range video. In ACM Transactions on Graphics (TOG), Vol. 22. ACM, 319--325.

Digital Library

[22]

Soo Ye Kim, Jihyong Oh, and Munchurl Kim. 2019. Jsi-gan: Gan-based joint super-resolution and inverse tone-mapping with pixel-wise task-specific filters for UHD HDR video. arXiv preprint arXiv:1909.04391 (2019).

[23]

Diederick P Kingma and Jimmy Ba. 2015. Adam: A method for stochastic optimization. In International Conference on Learning Representations (ICLR).

[24]

Rafael P Kovaleski and Manuel M Oliveira. 2014. High-quality reverse tone mapping for a wide range of exposures. In 2014 27th SIBGRAPI Conference on Graphics, Patterns and Images. IEEE, 49--56.

Digital Library

[25]

Hayden Landis. 2002. Production-ready global illumination. SIGGRAPH Course Notes 16, 2002 (2002), 11.

[26]

Siyeong Lee, Gwon Hwan An, and Suk-Ju Kang. 2018a. Deep chain hdri: Reconstructing a high dynamic range image from a single low dynamic range image. IEEE Access 6 (2018), 49913--49924.

[27]

Siyeong Lee, Gwon Hwan An, and Suk-Ju Kang. 2018b. Deep Recursive HDRI: Inverse Tone Mapping using Generative Adversarial Networks. In Proceedings of the European Conference on Computer Vision (ECCV). 596--611.

Digital Library

[28]

Guilin Liu, Fitsum A Reda, Kevin J Shih, Ting-Chun Wang, Andrew Tao, and Bryan Catanzaro. 2018. Image inpainting for irregular holes using partial convolutions. In Proceedings of the European Conference on Computer Vision (ECCV). 85--100.

Digital Library

[29]

Gonzalo Luzardo, Jan Aelterman, Hiep Luong, Wilfried Philips, Daniel Ochoa, and Sven Rousseaux. 2018. Fully-Automatic Inverse Tone Mapping Preserving the Content Creator's Artistic Intentions. In 2018 Picture Coding Symposium (PCS). IEEE, 199--203.

[30]

Andrew L Maas, Awni Y Hannun, and Andrew Y Ng. 2013. Rectifier nonlinearities improve neural network acoustic models. In Proceedings of International Conference on Machine Learning (ICML), Vol. 30. 3.

[31]

Rafat Mantiuk, Kil Joong Kim, Allan G Rempel, and Wolfgang Heidrich. 2011. HDR-VDP-2: A calibrated visual metric for visibility and quality predictions in all luminance conditions. ACM Transactions on Graphics (TOG) 30, 4 (2011), 40.

Digital Library

[32]

Demetris Marnerides, Thomas Bashford-Rogers, Jonathan Hatchett, and Kurt Debattista. 2018. ExpandNet: A deep convolutional neural network for high dynamic range expansion from low dynamic range content. In Computer Graphics Forum, Vol. 37. Wiley Online Library, 37--49.

[33]

Morgan McGuire, Wojciech Matusik, Hanspeter Pfister, Billy Chen, John F Hughes, and Shree K Nayar. 2007. Optical splitting trees for high-precision monocular imaging. IEEE Computer Graphics and Applications 27, 2 (2007), 32--42.

Digital Library

[34]

Vinod Nair and Geoffrey E Hinton. 2010. Rectified linear units improve restricted boltzmann machines. In Proceedings of the 27th International Conference on Machine Learning (ICML). 807--814.

Digital Library

[35]

Shiyu Ning, Hongteng Xu, Li Song, Rong Xie, and Wenjun Zhang. 2018. Learning an inverse tone mapping network with a generative adversarial regularizer. In 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 1383--1387.

Digital Library

[36]

Tae-Hyun Oh, Joon-Young Lee, Yu-Wing Tai, and In So Kweon. 2014. Robust high dynamic range imaging by rank minimization. IEEE Transactions on Pattern Analysis and Machine Intelligence 37, 6 (2014), 1219--1232.

Digital Library

[37]

Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, et al. 2019. PyTorch: An imperative style, high-performance deep learning library. In Advances in Neural Information Processing Systems (NeurIPS). 8024--8035.

[38]

Allan G Rempel, Matthew Trentacoste, Helge Seetzen, H David Young, Wolfgang Heidrich, Lorne Whitehead, and Greg Ward. 2007. Ldr2hdr: on-the-fly reverse tone mapping of legacy video and photographs. In ACM Transactions on Graphics (TOG), Vol. 26. ACM, 39.

Digital Library

[39]

Olaf Ronneberger, Philipp Fischer, and Thomas Brox. 2015. U-NET: Convolutional networks for biomedical image segmentation. In International Conference on Medical Image Computing and Computer-assisted Intervention. Springer, 234--241.

[40]

Pradeep Sen, Nima Khademi Kalantari, Maziar Yaesoubi, Soheil Darabi, Dan B Goldman, and Eli Shechtman. 2012. Robust patch-based HDR reconstruction of dynamic scenes. ACM Transactions on Graphics (TOG) 31, 6 (2012), 203--1.

Digital Library

[41]

Karen Simonyan and Andrew Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. In International Conference on Learning Representations (ICLR).

[42]

Michael D Tocci, Chris Kiser, Nora Tocci, and Pradeep Sen. 2011. A versatile HDR video production system. In ACM Transactions on Graphics (TOG), Vol. 30. ACM, 41.

Digital Library

[43]

Lvdi Wang, Li-Yi Wei, Kun Zhou, Baining Guo, and Heung-Yeung Shum. 2007. High dynamic range image hallucination. In Proceedings of the 18th Eurographics Conference on Rendering Techniques. Eurographics Association, 321--326.

Digital Library

[44]

Shangzhe Wu, Jiarui Xu, Yu-Wing Tai, and Chi-Keung Tang. 2018. Deep high dynamic range imaging with large foreground motions. In Proceedings of the European Conference on Computer Vision (ECCV). 117--132.

Digital Library

[45]

Yucheng Xu, Shiyu Ning, Rong Xie, and Li Song. 2019. Gan Based Multi-Exposure Inverse Tone Mapping. In 2019 IEEE International Conference on Image Processing (ICIP). IEEE, 1--5.

[46]

Chao Yang, Xin Lu, Zhe Lin, Eli Shechtman, Oliver Wang, and Hao Li. 2017. Highresolution image inpainting using multi-scale neural patch synthesis. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 6721--6729.

[47]

Xin Yang, Ke Xu, Yibing Song, Qiang Zhang, Xiaopeng Wei, and Rynson WH Lau. 2018. Image correction via deep reciprocating HDR transformation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1798--1807.

[48]

Jiahui Yu, Zhe Lin, Jimei Yang, Xiaohui Shen, Xin Lu, and Thomas S Huang. 2019. Free-form image inpainting with gated convolution. In Proceedings of the IEEE International Conference on Computer Vision. 4471--4480.

[49]

Richard Zhang, Phillip Isola, and Alexei A Efros. 2016. Colorful image colorization. In European Conference on Computer Vision (ECCV). Springer, 649--666.

[50]

Bolei Zhou, Agata Lapedriza, Jianxiong Xiao, Antonio Torralba, and Aude Oliva. 2014. Learning deep features for scene recognition using places database. In Advances in Neural Information Processing Systems (NeurIPS). 487--495.

Cited By

Wang ZHe G(2024)Multi-Frame Content-Aware Mapping Network for Standard-Dynamic-Range to High-Dynamic-Range Television Artifact RemovalSensors10.3390/s2401029924:1(299)Online publication date: 4-Jan-2024
https://doi.org/10.3390/s24010299
Li XCheng SZeng ZZhao CFan C(2024)ERS-HDRI: Event-Based Remote Sensing HDR ImagingRemote Sensing10.3390/rs1603043716:3(437)Online publication date: 23-Jan-2024
https://doi.org/10.3390/rs16030437
Liu YTian YWang SZhang XKwong S(2024)Overview of High-Dynamic-Range Image Quality AssessmentJournal of Imaging10.3390/jimaging1010024310:10(243)Online publication date: 27-Sep-2024
https://doi.org/10.3390/jimaging10100243
Show More Cited By

Index Terms

Single image HDR reconstruction using a CNN with masked features and perceptual loss
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Image and video acquisition
        Computational photography

Recommendations

Deep high dynamic range imaging of dynamic scenes

Producing a high dynamic range (HDR) image from a set of images with different exposures is a challenging process for dynamic scenes. A category of existing techniques first register the input images to a reference image and then merge the aligned ...
HDR image reconstruction from a single exposure using deep CNNs

Camera sensors can only capture a limited range of luminance simultaneously, and in order to create high dynamic range (HDR) images a set of different exposures are typically combined. In this paper we address the problem of predicting information that ...
Perceptual Image Dehazing Based on Generative Adversarial Learning
Advances in Multimedia Information Processing – PCM 2018
Abstract
Convolutional Neural Networks (CNN) based single image dehazing methods have recently gained much attention. However, as they heavily rely on synthetic haze images, existing CNN-based dehazing methods have limitations in achieving visually ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Graphics

ACM Transactions on Graphics Volume 39, Issue 4

August 2020

1732 pages

ISSN:0730-0301

EISSN:1557-7368

DOI:10.1145/3386569

Editor:
Szymon Rusinkiewicz
Princeton University

Issue’s Table of Contents

Copyright © 2020 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 August 2020

Published in TOG Volume 39, Issue 4

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Conselho Nacional de Desenvolvimento Científico e Tecnológico
TAMU T3

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

101
Total Citations
View Citations
660
Total Downloads

Downloads (Last 12 months)60
Downloads (Last 6 weeks)11

Reflects downloads up to 21 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Wang ZHe G(2024)Multi-Frame Content-Aware Mapping Network for Standard-Dynamic-Range to High-Dynamic-Range Television Artifact RemovalSensors10.3390/s2401029924:1(299)Online publication date: 4-Jan-2024
https://doi.org/10.3390/s24010299
Li XCheng SZeng ZZhao CFan C(2024)ERS-HDRI: Event-Based Remote Sensing HDR ImagingRemote Sensing10.3390/rs1603043716:3(437)Online publication date: 23-Jan-2024
https://doi.org/10.3390/rs16030437
Liu YTian YWang SZhang XKwong S(2024)Overview of High-Dynamic-Range Image Quality AssessmentJournal of Imaging10.3390/jimaging1010024310:10(243)Online publication date: 27-Sep-2024
https://doi.org/10.3390/jimaging10100243
Tominaga SHoriuchi T(2024)High Dynamic Range Image Reconstruction from Saturated Images of Metallic ObjectsJournal of Imaging10.3390/jimaging1004009210:4(92)Online publication date: 15-Apr-2024
https://doi.org/10.3390/jimaging10040092
Lin HLin YLin WChang C(2024)Reconstructing High Dynamic Range Image from a Single Low Dynamic Range Image Using Histogram LearningApplied Sciences10.3390/app1421984714:21(9847)Online publication date: 28-Oct-2024
https://doi.org/10.3390/app14219847
Luzardo GKumcu AAelterman JLuong HOchoa DPhilips W(2024)A Display-Adaptive Pipeline for Dynamic Range Expansion of Standard Dynamic Range Video ContentApplied Sciences10.3390/app1410408114:10(4081)Online publication date: 11-May-2024
https://doi.org/10.3390/app14104081
Banterle FMarnerides DBashford-rogers TDebattista K(2024)Self-supervised High Dynamic Range Imaging: What Can Be Learned from a Single 8-bit Video?ACM Transactions on Graphics10.1145/364857043:2(1-16)Online publication date: 23-Mar-2024
https://dl.acm.org/doi/10.1145/3648570
Wang CWolski KKerbl BSerrano ABemana MSeidel HMyszkowski KLeimkühler T(2024)Cinematic Gaussians: Real‐Time HDR Radiance Fields with Depth of FieldComputer Graphics Forum10.1111/cgf.1521443:7Online publication date: 7-Nov-2024
https://doi.org/10.1111/cgf.15214
Zhang HZou XLu GChen LSong LZhang W(2024)EffiHDR: An Efficient Framework for HDRTV Reconstruction and Enhancement in UHD SystemsIEEE Transactions on Broadcasting10.1109/TBC.2023.334565770:2(620-636)Online publication date: Jun-2024
https://doi.org/10.1109/TBC.2023.3345657
Barua HKrishnasamy GWong KDhall AStefanov K(2024)Histohdr-Net: Histogram Equalization for Single LDR to HDR Image Translation2024 IEEE International Conference on Image Processing (ICIP)10.1109/ICIP51287.2024.10648020(2730-2736)Online publication date: 27-Oct-2024
https://doi.org/10.1109/ICIP51287.2024.10648020
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Issue’s Table of Contents