Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Single image HDR reconstruction using a CNN with masked features and perceptual loss

Published: 12 August 2020 Publication History

Abstract

Digital cameras can only capture a limited range of real-world scenes' luminance, producing images with saturated pixels. Existing single image high dynamic range (HDR) reconstruction methods attempt to expand the range of luminance, but are not able to hallucinate plausible textures, producing results with artifacts in the saturated areas. In this paper, we present a novel learning-based approach to reconstruct an HDR image by recovering the saturated pixels of an input LDR image in a visually pleasing way. Previous deep learning-based methods apply the same convolutional filters on wellexposed and saturated pixels, creating ambiguity during training and leading to checkerboard and halo artifacts. To overcome this problem, we propose a feature masking mechanism that reduces the contribution of the features from the saturated areas. Moreover, we adapt the VGG-based perceptual loss function to our application to be able to synthesize visually pleasing textures. Since the number of HDR images for training is limited, we propose to train our system in two stages. Specifically, we first train our system on a large number of images for image inpainting task and then fine-tune it on HDR reconstruction. Since most of the HDR examples contain smooth regions that are simple to reconstruct, we propose a sampling strategy to select challenging training patches during the HDR fine-tuning stage. We demonstrate through experimental results that our approach can reconstruct visually pleasing HDR results, better than the current state of the art on a wide range of scenes.

Supplemental Material

ZIP File
Supplemental files.

References

[1]
Francesco Banterle, Patrick Ledda, Kurt Debattista, and Alan Chalmers. 2006. Inverse tone mapping. In Proceedings of the 4th International Conference on Computer Graphics and Interactive Techniques in Australasia and Southeast Asia. ACM, 349--356.
[2]
David Bau, Hendrik Strobelt, William Peebles, Jonas Wulff, Bolei Zhou, Jun-Yan Zhu, and Antonio Torralba. 2019. Semantic Photo Manipulation with a Generative Image Prior. ACM Transactions on Graphics (Proceedings of ACM SIGGRAPH) 38, 4 (2019).
[3]
Cambodge Bist, Rémi Cozot, Gérard Madec, and Xavier Ducloux. 2017. Tone expansion using lighting style aesthetics. Computers & Graphics 62 (2017), 77--86.
[4]
Paul Debevec. 2005. A median cut algorithm for light probe sampling. In ACM SIGGRAPH 2005 Posters. ACM, 66.
[5]
PE Debevec and J Malik. 1997. Recovering high dynamic range images. In Proceeding of the SPIE: Image Sensors, Vol. 3965. 392--401.
[6]
Piotr Didyk, Rafal Mantiuk, Matthias Hein, and Hans-Peter Seidel. 2008. Enhancement of bright video features for HDR displays. In Computer Graphics Forum, Vol. 27. Wiley Online Library, 1265--1274.
[7]
Chao Dong, Chen Change Loy, Kaiming He, and Xiaoou Tang. 2015. Image super-resolution using deep convolutional networks. IEEE Transactions on Pattern Analysis and Machine Intelligence 38, 2 (2015), 295--307.
[8]
Frédo Durand and Julie Dorsey. 2002. Fast bilateral filtering for the display of high-dynamic-range images. In Proceedings of the 29th Annual Conference on Computer Graphics and Interactive Techniques. 257--266.
[9]
Gabriel Eilertsen, Joel Kronander, Gyorgy Denes, Rafał K Mantiuk, and Jonas Unger. 2017. HDR image reconstruction from a single exposure using deep CNNs. ACM Transactions on Graphics (TOG) 36, 6 (2017), 178.
[10]
Gabriel Eilertsen, RafałMantiuk, and Jonas Unger. 2019. Single-frame Regularization for Temporally Stable CNNs. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[11]
Yuki Endo, Yoshihiro Kanamori, and Jun Mitani. 2017. Deep reverse tone mapping. ACM Transactions on Graphics (TOG) 36, 6 (2017), 177--1.
[12]
Leon A Gatys, Alexander S Ecker, and Matthias Bethge. 2015. A neural algorithm of artistic style. arXiv preprint arXiv:1508.06576 (2015).
[13]
Leon A Gatys, Alexander S Ecker, and Matthias Bethge. 2016. Image style transfer using convolutional neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2414--2423.
[14]
Xavier Glorot and Yoshua Bengio. 2010. Understanding the difficulty of training deep feedforward neural networks. In Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics (AISTATS). 249--256.
[15]
Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. Generative adversarial nets. In Advances in Neural Information Processing Systems (NeurIPS). 2672--2680.
[16]
Xintong Han, Zuxuan Wu, Weilin Huang, Matthew R Scott, and Larry S Davis. 2019. FiNet: Compatible and Diverse Fashion Image Inpainting. In Proceedings of the IEEE International Conference on Computer Vision (ICCV). 4481--4491.
[17]
Samuel W Hasinoff, Dillon Sharlet, Ryan Geiss, Andrew Adams, Jonathan T Barron, Florian Kainz, Jiawen Chen, and Marc Levoy. 2016. Burst photography for high dynamic range and low-light imaging on mobile cameras. ACM Transactions on Graphics (TOG) 35, 6 (2016), 192.
[18]
Geoffrey E Hinton and Ruslan R Salakhutdinov. 2006. Reducing the dimensionality of data with neural networks. Science 313, 5786 (2006), 504--507.
[19]
Jun Hu, Orazio Gallo, Kari Pulli, and Xiaobai Sun. 2013. HDR deghosting: How to deal with saturation?. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1163--1170.
[20]
Nima Khademi Kalantari and Ravi Ramamoorthi. 2017. Deep high dynamic range imaging of dynamic scenes. ACM Transactions on Graphics (TOG) 36, 4 (2017), 144--1.
[21]
Sing Bing Kang, Matthew Uyttendaele, Simon Winder, and Richard Szeliski. 2003. High dynamic range video. In ACM Transactions on Graphics (TOG), Vol. 22. ACM, 319--325.
[22]
Soo Ye Kim, Jihyong Oh, and Munchurl Kim. 2019. Jsi-gan: Gan-based joint super-resolution and inverse tone-mapping with pixel-wise task-specific filters for UHD HDR video. arXiv preprint arXiv:1909.04391 (2019).
[23]
Diederick P Kingma and Jimmy Ba. 2015. Adam: A method for stochastic optimization. In International Conference on Learning Representations (ICLR).
[24]
Rafael P Kovaleski and Manuel M Oliveira. 2014. High-quality reverse tone mapping for a wide range of exposures. In 2014 27th SIBGRAPI Conference on Graphics, Patterns and Images. IEEE, 49--56.
[25]
Hayden Landis. 2002. Production-ready global illumination. SIGGRAPH Course Notes 16, 2002 (2002), 11.
[26]
Siyeong Lee, Gwon Hwan An, and Suk-Ju Kang. 2018a. Deep chain hdri: Reconstructing a high dynamic range image from a single low dynamic range image. IEEE Access 6 (2018), 49913--49924.
[27]
Siyeong Lee, Gwon Hwan An, and Suk-Ju Kang. 2018b. Deep Recursive HDRI: Inverse Tone Mapping using Generative Adversarial Networks. In Proceedings of the European Conference on Computer Vision (ECCV). 596--611.
[28]
Guilin Liu, Fitsum A Reda, Kevin J Shih, Ting-Chun Wang, Andrew Tao, and Bryan Catanzaro. 2018. Image inpainting for irregular holes using partial convolutions. In Proceedings of the European Conference on Computer Vision (ECCV). 85--100.
[29]
Gonzalo Luzardo, Jan Aelterman, Hiep Luong, Wilfried Philips, Daniel Ochoa, and Sven Rousseaux. 2018. Fully-Automatic Inverse Tone Mapping Preserving the Content Creator's Artistic Intentions. In 2018 Picture Coding Symposium (PCS). IEEE, 199--203.
[30]
Andrew L Maas, Awni Y Hannun, and Andrew Y Ng. 2013. Rectifier nonlinearities improve neural network acoustic models. In Proceedings of International Conference on Machine Learning (ICML), Vol. 30. 3.
[31]
Rafat Mantiuk, Kil Joong Kim, Allan G Rempel, and Wolfgang Heidrich. 2011. HDR-VDP-2: A calibrated visual metric for visibility and quality predictions in all luminance conditions. ACM Transactions on Graphics (TOG) 30, 4 (2011), 40.
[32]
Demetris Marnerides, Thomas Bashford-Rogers, Jonathan Hatchett, and Kurt Debattista. 2018. ExpandNet: A deep convolutional neural network for high dynamic range expansion from low dynamic range content. In Computer Graphics Forum, Vol. 37. Wiley Online Library, 37--49.
[33]
Morgan McGuire, Wojciech Matusik, Hanspeter Pfister, Billy Chen, John F Hughes, and Shree K Nayar. 2007. Optical splitting trees for high-precision monocular imaging. IEEE Computer Graphics and Applications 27, 2 (2007), 32--42.
[34]
Vinod Nair and Geoffrey E Hinton. 2010. Rectified linear units improve restricted boltzmann machines. In Proceedings of the 27th International Conference on Machine Learning (ICML). 807--814.
[35]
Shiyu Ning, Hongteng Xu, Li Song, Rong Xie, and Wenjun Zhang. 2018. Learning an inverse tone mapping network with a generative adversarial regularizer. In 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 1383--1387.
[36]
Tae-Hyun Oh, Joon-Young Lee, Yu-Wing Tai, and In So Kweon. 2014. Robust high dynamic range imaging by rank minimization. IEEE Transactions on Pattern Analysis and Machine Intelligence 37, 6 (2014), 1219--1232.
[37]
Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, et al. 2019. PyTorch: An imperative style, high-performance deep learning library. In Advances in Neural Information Processing Systems (NeurIPS). 8024--8035.
[38]
Allan G Rempel, Matthew Trentacoste, Helge Seetzen, H David Young, Wolfgang Heidrich, Lorne Whitehead, and Greg Ward. 2007. Ldr2hdr: on-the-fly reverse tone mapping of legacy video and photographs. In ACM Transactions on Graphics (TOG), Vol. 26. ACM, 39.
[39]
Olaf Ronneberger, Philipp Fischer, and Thomas Brox. 2015. U-NET: Convolutional networks for biomedical image segmentation. In International Conference on Medical Image Computing and Computer-assisted Intervention. Springer, 234--241.
[40]
Pradeep Sen, Nima Khademi Kalantari, Maziar Yaesoubi, Soheil Darabi, Dan B Goldman, and Eli Shechtman. 2012. Robust patch-based HDR reconstruction of dynamic scenes. ACM Transactions on Graphics (TOG) 31, 6 (2012), 203--1.
[41]
Karen Simonyan and Andrew Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. In International Conference on Learning Representations (ICLR).
[42]
Michael D Tocci, Chris Kiser, Nora Tocci, and Pradeep Sen. 2011. A versatile HDR video production system. In ACM Transactions on Graphics (TOG), Vol. 30. ACM, 41.
[43]
Lvdi Wang, Li-Yi Wei, Kun Zhou, Baining Guo, and Heung-Yeung Shum. 2007. High dynamic range image hallucination. In Proceedings of the 18th Eurographics Conference on Rendering Techniques. Eurographics Association, 321--326.
[44]
Shangzhe Wu, Jiarui Xu, Yu-Wing Tai, and Chi-Keung Tang. 2018. Deep high dynamic range imaging with large foreground motions. In Proceedings of the European Conference on Computer Vision (ECCV). 117--132.
[45]
Yucheng Xu, Shiyu Ning, Rong Xie, and Li Song. 2019. Gan Based Multi-Exposure Inverse Tone Mapping. In 2019 IEEE International Conference on Image Processing (ICIP). IEEE, 1--5.
[46]
Chao Yang, Xin Lu, Zhe Lin, Eli Shechtman, Oliver Wang, and Hao Li. 2017. Highresolution image inpainting using multi-scale neural patch synthesis. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 6721--6729.
[47]
Xin Yang, Ke Xu, Yibing Song, Qiang Zhang, Xiaopeng Wei, and Rynson WH Lau. 2018. Image correction via deep reciprocating HDR transformation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1798--1807.
[48]
Jiahui Yu, Zhe Lin, Jimei Yang, Xiaohui Shen, Xin Lu, and Thomas S Huang. 2019. Free-form image inpainting with gated convolution. In Proceedings of the IEEE International Conference on Computer Vision. 4471--4480.
[49]
Richard Zhang, Phillip Isola, and Alexei A Efros. 2016. Colorful image colorization. In European Conference on Computer Vision (ECCV). Springer, 649--666.
[50]
Bolei Zhou, Agata Lapedriza, Jianxiong Xiao, Antonio Torralba, and Aude Oliva. 2014. Learning deep features for scene recognition using places database. In Advances in Neural Information Processing Systems (NeurIPS). 487--495.

Cited By

View all
  • (2024)Multi-Frame Content-Aware Mapping Network for Standard-Dynamic-Range to High-Dynamic-Range Television Artifact RemovalSensors10.3390/s2401029924:1(299)Online publication date: 4-Jan-2024
  • (2024)ERS-HDRI: Event-Based Remote Sensing HDR ImagingRemote Sensing10.3390/rs1603043716:3(437)Online publication date: 23-Jan-2024
  • (2024)Overview of High-Dynamic-Range Image Quality AssessmentJournal of Imaging10.3390/jimaging1010024310:10(243)Online publication date: 27-Sep-2024
  • Show More Cited By

Index Terms

  1. Single image HDR reconstruction using a CNN with masked features and perceptual loss

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Graphics
    ACM Transactions on Graphics  Volume 39, Issue 4
    August 2020
    1732 pages
    ISSN:0730-0301
    EISSN:1557-7368
    DOI:10.1145/3386569
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 12 August 2020
    Published in TOG Volume 39, Issue 4

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. convolutional neural network
    2. feature masking
    3. high dynamic range imaging
    4. perceptual loss

    Qualifiers

    • Research-article

    Funding Sources

    • Conselho Nacional de Desenvolvimento Científico e Tecnológico
    • TAMU T3

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)60
    • Downloads (Last 6 weeks)11
    Reflects downloads up to 21 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Multi-Frame Content-Aware Mapping Network for Standard-Dynamic-Range to High-Dynamic-Range Television Artifact RemovalSensors10.3390/s2401029924:1(299)Online publication date: 4-Jan-2024
    • (2024)ERS-HDRI: Event-Based Remote Sensing HDR ImagingRemote Sensing10.3390/rs1603043716:3(437)Online publication date: 23-Jan-2024
    • (2024)Overview of High-Dynamic-Range Image Quality AssessmentJournal of Imaging10.3390/jimaging1010024310:10(243)Online publication date: 27-Sep-2024
    • (2024)High Dynamic Range Image Reconstruction from Saturated Images of Metallic ObjectsJournal of Imaging10.3390/jimaging1004009210:4(92)Online publication date: 15-Apr-2024
    • (2024)Reconstructing High Dynamic Range Image from a Single Low Dynamic Range Image Using Histogram LearningApplied Sciences10.3390/app1421984714:21(9847)Online publication date: 28-Oct-2024
    • (2024)A Display-Adaptive Pipeline for Dynamic Range Expansion of Standard Dynamic Range Video ContentApplied Sciences10.3390/app1410408114:10(4081)Online publication date: 11-May-2024
    • (2024)Self-supervised High Dynamic Range Imaging: What Can Be Learned from a Single 8-bit Video?ACM Transactions on Graphics10.1145/364857043:2(1-16)Online publication date: 23-Mar-2024
    • (2024)Cinematic Gaussians: Real‐Time HDR Radiance Fields with Depth of FieldComputer Graphics Forum10.1111/cgf.1521443:7Online publication date: 7-Nov-2024
    • (2024)EffiHDR: An Efficient Framework for HDRTV Reconstruction and Enhancement in UHD SystemsIEEE Transactions on Broadcasting10.1109/TBC.2023.334565770:2(620-636)Online publication date: Jun-2024
    • (2024)Histohdr-Net: Histogram Equalization for Single LDR to HDR Image Translation2024 IEEE International Conference on Image Processing (ICIP)10.1109/ICIP51287.2024.10648020(2730-2736)Online publication date: 27-Oct-2024
    • Show More Cited By

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media