Towards High-Resolution Specular Highlight Detection

Gang Fu¹,
Qing Zhang²,
Lei Zhu^3,4,
Qifeng Lin⁵,
Yihao Wang¹,
Siyuan Fan¹ &
…
Chunxia Xiao¹

987 Accesses
4 Citations
1 Altmetric
Explore all metrics

Abstract

Specular highlight detection is an essential task with various applications in computer vision. This paper aims to detect specular highlights in single high-resolution images using deep learning while avoiding excessive GPU memory consumption. To achieve this, we present a high-resolution specular highlight detection dataset with manual annotations of specular highlights. Given our dataset, we propose a patch-level bidirectional refinement network for high-resolution specular highlight detection. The main idea is to utilize both the pathway from small-scale patch to large-scale patch and its reverse pathway to progressively refine the detection results of adjacent-scale specular highlight patches. Moreover, based on our detection network, we propose a modified inpainting framework for specular highlight removal as an application. Lastly, we provide ten potential research directions for specular highlight detection, inspiring researchers for further study.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 12

Orhlr-net: one-stage residual learning network for joint single-image specular highlight detection and removal

Article 24 August 2024

Document Specular Highlight Removal with Coarse-to-Fine Strategy

ET-HDR: An Efficient Two-Stage Network for Specular Highlight Detection and Removal

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Notes

https://www.flickr.com.
https://www.pinterest.com.
https://images.google.com.
https://www.bing.com/images.
We use the same term specular residual as in (Shi et al., 2017) to represent the remaining component obtained by subtracting the diffuse reflection component from an observed image.

References

Akashi, Y., & Okatani, T. (2015). Separation of reflection components by sparse non-negative matrix factorization. Computer Vision and Image Understanding, 100(146), 77–85.
Google Scholar
Angelopoulou, E. (2007) Specular highlight detection based on the fresnel reflection coefficient, In Proceedings of the IEEE International Conference on Computer Vision (pp. 1–8).
Bajcsy, R., Lee, S. W., & Leonardis, A. (1990). Color image segmentation with detection of highlights and local illumination induced by inter-reflections. In Proceedings of the IEEE International Conference on Pattern Recognition (pp. 785–790).
Barron, J. T. & Tsai, Y.-T. (2017) Fast Fourier color constancy. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 886–894)
Bell, S., Bala, K., & Snavely, N. (2014). Intrinsic images in the wild. Transactions on Graphics, 33(4), 159.
Article Google Scholar
Borji, A., Cheng, M.-M., Jiang, H., & Li, J. (2015). Salient object detection: A benchmark. IEEE Transactions on Image Processing, 24(12), 5706–5722.
Article MathSciNet Google Scholar
Brelstaff, G., & Blake, A. (1988). Detecting specular reflections using Lambertian constraints. In Proceedings of the IEEE International Conference on Computer Vision (pp. 297–302).
Chen, W., Jiang, Z., Wang, Z., Cui, K., & Qian, X. (2019). Collaborative global-local networks for memory-efficient segmentation of ultra-high resolution images. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 8924–8933).
Chen, L.-C., Zhu, Y., Papandreou, G., Schroff, F. & Adam, H. (2018a). Encoder-decoder with atrous separable convolution for semantic image segmentation. In Proceedings of the European Conference on Computer Vision (pp. 801–818).
Chen, L., Zhu, Y., Papandreou, G., Schroff, F. & Adam, H. (2018b). Encoder-decoder with atrous separable convolution for semantic image segmentation. In Proceedings of the European Conference on Computer Vision (pp. 833–851).
Cheng, H. K., Chung, J., Tai, Y.-W., & Tang, C.-K. (2020). CascadePSP: Toward class-agnostic and very high-resolution segmentation via global and local refinement. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 8890–8899).
Cheng, M.-M., Mitra, N. J., Huang, X., Torr, P. H., & Hu, S.-M. (2014). Global contrast based salient region detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 37(3), 569–582.
Article Google Scholar
Croitoru, I., Bogolin, S.-V., & Leordeanu, M. (2019). Unsupervised learning of foreground object segmentation. International Journal of Computer Vision, 127, 1279–1302.
Article Google Scholar
Fan, D.-P., Cheng, M.-M., Liu, Y., Li, T., & Borji, A. (2017). Structure-measure: A new way to evaluate foreground maps, In Proceedings of the IEEE International Conference on Computer Vision (pp. 4548–4557).
Fan, D.-P., et al. (2018). Enhanced-alignment measure for binary foreground map evaluation. arXiv preprint arXiv:1805.10421
Fu, G., Zhang, Q., Lin, Q., Zhu, L., & Xiao, C. (2020). Learning to detect spec1ular highlights from real-world images. In Proceedings of the ACM International Conference on Multimedia (pp. 1873–1881).
Fu, G., Zhang, Q., Zhu, L., Li, P., & Xiao, C. (2021). A multi-task network for joint specular highlight detection and removal. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 7752–7761).
Grosse, R., Johnson, M. K., Adelson, E. H., & Freeman, W. T. (2009). Ground truth dataset and baseline evaluations for intrinsic image algorithms. In Proceedings of the IEEE International Conference on Computer Vision (pp. 2335–2342).
He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 770–778).
Hou, Q., Cheng, M.-M., Hu, X., Borji, A., Tu, Z., & Torr, P. (2019). Deeply supervised salient object detection with short connections. IEEE Transactions on Pattern Analysis and Machine Intelligence, 41(4), 815–828.
Article Google Scholar
Hoyer, P. O. (2004). Non-negative matrix factorization with sparseness constraints. Journal of Machine Learning Research, 5(9), 1457–1469.
MathSciNet Google Scholar
Hu, J., Shen, L., & Sun, G. (2018). Squeeze-and-excitation networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 7132–7141).
Hu, X., Zhu, L., Fu, C.-W., Qin, J., & Heng, P.-A. (2018). Direction-aware spatial context features for shadow detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 7454–7462).
Hu, X., Fu, C.-W., Zhu, L., Qin, J., & Heng, P.-A. (2019). Direction-aware spatial context features for shadow detection and removal. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(11), 2795–2808.
Article Google Scholar
Huynh, C., Tran, A. T., Luu, K., & Hoai, M. (2021). Progressive semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 16755–16764).
Kim, H., Jin, H., Hadap, S., & Kweon, I. (2013). Specular reflection separation using dark channel prior. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 1460–1467).
Kingma, D. P., & Ba, J. (2014). Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980
Lang, C., Feng, J., Feng, S., Wang, J., & Yan, S. (2016). Dual low-rank pursuit: Learning salient features for saliency detection. IEEE Transactions on Neural Networks and Learning Systems, 27(6), 1190–1200.
Article MathSciNet Google Scholar
Li, Z., & Snavely, N. (2018). Learning intrinsic image decomposition from watching the world. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 9039–9048).
Li, R., et al. (2019). Specular reflections removal for endoscopic image sequences with adaptive-rpca decomposition. IEEE Transactions on Medical Imaging, 39(2), 328–340.
Article Google Scholar
Lin, T.-Y., et al. (2017). Feature pyramid networks for object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 2117–2125).
Lin, P., et al. (2020). Graph-guided architecture search for real-time semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 4203–4212).
Liu, Y., Li, Y., You, S., & Lu, F. (2020). Unsupervised learning for intrinsic image decomposition from a single image. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 3248–3257).
Liu, G., et al. (2018). Image inpainting for irregular holes using partial convolutions. In Proceedings of the European Conference on Computer Vision (pp. 85–100).
Murmann, L., Gharbi, M., Aittala, M., & Durand, F. (2019). A dataset of multi-illumination images in the wild. In Proceedings of the IEEE International Conference on Computer Vision (pp. 4080–4089).
Netz, A., & Osadchy, M. (2012). Recognition using specular highlights. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35(3), 639–652.
Article Google Scholar
Osadchy, M., Jacobs, D. W., & Ramamoorthi, R. (2003). Using specularities for recognition. In Proceedings of the IEEE International Conference on Computer Vision (pp. 1512–1519).
Pan, X., Shi, J., Luo, P., Wang, X., & Tang, X. (2018). Spatial as deep: Spatial CNN for traffic scene understanding. In Proceedings of the AAAI Conference on Artificial Intelligence (pp. 7276–7283).
Park, J. B., & Kak, A. C. (2003). A truncated least squares approach to the detection of specular highlights in color images. In Proceedings of the IEEE International Conference on Robotics and Automation (pp. 1397–1403).
Phong, B. T. (1975). Illumination for computer generated pictures. Communications of the ACM, 18(6), 311–317.
Article Google Scholar
Qin, X., et al. (2019). BASNet: Boundary-aware salient object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 7479–7489).
Shafer, S. A. (1985). Using color to separate reflection components. Color Research and Application, 10(4), 210–218.
Article Google Scholar
Shen, L., Tan, P., & Lin, S. (2008). Intrinsic image decomposition with non-local texture cues. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 1–7).
Shen, J., Yang, X., Jia, Y. & Li, X., (2011). Intrinsic images using optimization. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 3481–3487).
Shen, H.-L., & Zheng, Z.-H. (2013). Real-time highlight removal using intensity ratio. Applied Optics, 52(19), 4483–4493.
Article Google Scholar
Shi, J., Dong, Y., Su, H., & Yu, S. X. (2017). Learning non-Lambertian object intrinsics across shapenet categories. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 1685–1290).
Sun, Z., Cao, S., Yang, Y., & Kitani, K. (2020). Rethinking Transformer-based set prediction for object detection. arXiv preprint arXiv:2011.10881
Tan, R. T., & Ikeuchi, K. (2005). Separating reflection components of textured surfaces using a single image. IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(2), 178–193.
Article Google Scholar
Tian, Q., & Clark, J. J. (2013). Real-time specularity detection using unnormalized wiener entropy. In Proceedings of the IEEE International Conference on Computer and Robot Vision (pp. 356–363).
Van Gansbeke, W., Vandenhende, S., Georgoulis, S., & Van Gool, L. (2021). Unsupervised semantic segmentation by contrasting object mask proposals. In Proceedings of the IEEE International Conference on Computer Vision (pp. 10052–10062).
Wang, T., Hu, X., Wang, Q., Heng, P.-A., & Fu, C.-W. (2020). Instance shadow detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 1880–1889).
Wang, Y., Zhang, W., Wang, L., Liu, T., & Lu, H. (2022). Multi-source uncertainty mining for deep unsupervised saliency detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 11727–11736).
Wang, R., et al. (2019). Underexposed photo enhancement using deep illumination estimation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 6849–6857).
Wu, T., et al. (2020). Patch proposal network for fast semantic segmentation of high-resolution images. In Proceedings of the AAAI Conference on Artificial Intelligence (pp. 12402–12409).
Wu, Z., et al. (2021). Single-image specular highlight removal via real-world dataset construction. IEEE Transactions on Multimedia, 24, 3782–3793.
Article Google Scholar
Xie, C., et al. (2022). Pyramid grafting network for one-stage high resolution saliency detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 11717–11726).
Yang, Y., & Soatto, S. (2020). FDA: Fourier domain adaptation for semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 4085–4095).
Yang, Q., Tang, J., & Ahuja, N. (2015). Efficient and robust specular highlight removal. IEEE Transactions on Pattern Analysis and Machine Intelligence, 37(6), 1304–1311.
Article Google Scholar
Zeng, Y., Zhang, P., Zhang, J., Lin, Z., & Lu, H. (2019). Towards high-resolution salient object detection. In Proceedings of the IEEE International Conference on Computer Vision (pp. 7234–7243).
Zhang, P., Liu, W., Zeng, Y., Lei, Y., & Lu, H. (2021). Looking for the detail and context devils: High-resolution salient object detection. IEEE Transactions on Image Processing, 30, 3204-3216.
Article Google Scholar
Zhang, L., Yan, Q., Liu, Z., Zou, H., & Xiao, C. (2017). Illumination decomposition for photograph with multiple light sources. IEEE Transactions on Image Processing, 26(9), 4114–4127.
Zhang, W., Zhao, X., Morvan, J.-M., & Chen, L. (2018). Improving shadow suppression for illumination robust face recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 41(3), 611–624.
Article Google Scholar
Zheng, Q., Qiao, X., Cao, Y., & Lau, R. W. (2019). Distraction-aware shadow detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 5167–5176).
Zhou, P., Price, B., Cohen, S., Wilensky, G., & Davis, L. S. (2020). DeepStrip: High-resolution boundary refinement. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 10558–1350).
Zhou, H., Xie, X., Lai, J.-H., Chen, Z., & Yang, L. (2020). Interactive two-stream decoder for accurate and fast saliency detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 9141–9150).
Zhu, L., et al. (2018). Bidirectional feature pyramid network with recurrent attention residual modules for shadow detection. In Proceedings of the European Conference on Computer Vision (pp. 122–137).

Download references

Acknowledgements

This work is partially supported by the National Natural Science Foundation of China under Grant (No. 61972298), CAAI-Huawei MindSpore Open Fund, and the Research Program for Young and Middle-Aged Teachers of Fujian Province under Grant (No. JAT210036).

Author information

Authors and Affiliations

School of Computer Science, Wuhan University, Wuhan, 430072, Hubei, China
Gang Fu, Yihao Wang, Siyuan Fan & Chunxia Xiao
School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou, 510006, Guangdong, China
Qing Zhang
ROAS Thrust, The Hong Kong University of Science and Technology (Guangzhou), Guangzhou, 511400, Guangdong, China
Lei Zhu
Department of Electronic and Computer Engineering, The Hong Kong University of Science and Technology, Hong Kong SAR, China
Lei Zhu
College of Computer and Data Science, Fuzhou University, Fuzhou, 350116, Fujian, China
Qifeng Lin

Authors

Gang Fu
View author publications
You can also search for this author in PubMed Google Scholar
Qing Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Lei Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Qifeng Lin
View author publications
You can also search for this author in PubMed Google Scholar
Yihao Wang
View author publications
You can also search for this author in PubMed Google Scholar
Siyuan Fan
View author publications
You can also search for this author in PubMed Google Scholar
Chunxia Xiao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chunxia Xiao.

Additional information

Communicated by Shaodi You.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Fu, G., Zhang, Q., Zhu, L. et al. Towards High-Resolution Specular Highlight Detection. Int J Comput Vis 132, 95–117 (2024). https://doi.org/10.1007/s11263-023-01845-3

Download citation

Received: 02 February 2022
Accepted: 03 July 2023
Published: 23 August 2023
Issue Date: January 2024
DOI: https://doi.org/10.1007/s11263-023-01845-3

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Orhlr-net: one-stage residual learning network for joint single-image specular highlight detection and removal

Document Specular Highlight Removal with Coarse-to-Fine Strategy

ET-HDR: An Efficient Two-Stage Network for Specular Highlight Detection and Removal

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Towards High-Resolution Specular Highlight Detection

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Orhlr-net: one-stage residual learning network for joint single-image specular highlight detection and removal

Document Specular Highlight Removal with Coarse-to-Fine Strategy

ET-HDR: An Efficient Two-Stage Network for Specular Highlight Detection and Removal

Explore related subjects

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation