Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Relative CNN-RNN: Learning Relative Atmospheric Visibility From Images

Published: 01 January 2019 Publication History

Abstract

We propose a deep learning approach for directly estimating relative atmospheric visibility from outdoor photos without relying on weather images or data that require expensive sensing or custom capture. Our data-driven approach capitalizes on a large collection of Internet images to learn rich scene and visibility varieties. The relative CNN–RNN coarse-to-fine model, where CNN stands for convolutional neural network and RNN stands for recurrent neural network, exploits the joint power of relative support vector machine, which has a good ranking representation, and the data-driven deep learning features derived from our novel CNN–RNN model. The CNN–RNN model makes use of shortcut connections to bridge a CNN module and an RNN coarse-to-fine module. The CNN captures the global view while the RNN simulates human’s attention shift, namely, from the whole image (global) to the farthest discerned region (local). The learned relative model can be adapted to predict absolute visibility in limited scenarios. Extensive experiments and comparisons are performed to verify our method. We have built an annotated dataset consisting of about 40000 images with 0.2 million human annotations. The large-scale, annotated visibility data set will be made available to accompany this paper.

References

[1]
J. J. Koenderink, “ Pictorial relief,” Philos. Trans. Roy. Soc. London A, Math., Phys. Eng. Sci., vol. Volume 356, no. Issue 1740, pp. 1071–1086, 1998.
[2]
V. Mnih, N. Heess, and A. Graves, “ Recurrent models of visual attention,” in Proc. NIPS, 2014, pp. 2204–2212.
[3]
F. Liu, C. Shen, G. Lin, and I. Reid, “ Learning depth from single monocular images using deep convolutional neural fields,” IEEE Trans. Pattern Anal. Mach. Intell., vol. Volume 38, no. Issue 10, pp. 2024–2039, 2016.
[4]
N. Graves and S. Newsam, “ Using visibility cameras to estimate atmospheric light extinction,” in Proc. WACV, 2011, pp. 577–584.
[5]
D. Bäumer, S. Versick, and B. Vogel, “ Determination of the visibility using a digital panorama camera,” Atmos. Environ., vol. Volume 42, no. Issue 11, pp. 2593–2602, 2008.
[6]
N. Hautiére, R. Babari, É. Dumont, R. Brémond, and N. Paparoditis, “ Estimating meteorological visibility using cameras: A probabilistic model-driven approach,” in Proc. ACCV, 2011, pp. 243–254.
[7]
J. P. Oakley and B. L. Satherley, “ Improving image quality in poor visibility conditions using a physical model for contrast degradation,” IEEE Trans. Image Process., vol. Volume 7, no. Issue 2, pp. 167–179, 1998.
[8]
P. Carr and R. Hartley, “ Improved single image dehazing using geometry,” in Proc. Digit. Image Comput., Techn. Appl. (DICTA), 2009, pp. 103–110.
[9]
J. Kopf, “ Deep photo: Model-based photograph enhancement and viewing,” ACM Trans. Graph., vol. Volume 27, no. Issue 5, pp. 116-1–116-10, 2008.
[10]
Y. Y. Schechner, S. G. Narasimhan, and S. K. Nayar, “ Instant dehazing of images using polarization,” in Proc. CVPR, 2001, p. pp.325.
[11]
S. Shwartz, E. Namer, and Y. Y. Schechner, “ Blind haze separation,” in Proc. CVPR, 2006, pp. 1984–1991.
[12]
S. G. Narasimhan and S. K. Nayar, “ Vision and the atmosphere,” Int. J. Comput. Vis., vol. Volume 48, no. Issue 3, pp. 233–254, 2002.
[13]
K. He, J. Sun, and X. Tang, “ Single image haze removal using dark channel prior,” IEEE Trans. Pattern Anal. Mach. Intell., vol. Volume 33, no. Issue 12, pp. 2341–2353, 2011.
[14]
Q. Yan, L. Xu, and J. Jia, “ Dense scattering layer removal,” in Proc. SIGGRAPH Asia Tech. Briefs, 2013, Art. no. .
[15]
G. Meng, Y. Wang, J. Duan, S. Xiang, and C. Pan, “ Efficient image dehazing with boundary constraint and contextual regularization,” in Proc. ICCV, 2013, pp. 617–624.
[16]
R. T. Tan, “ Visibility in bad weather from a single image,” in Proc. CVPR, 2008, pp. 1–8.
[17]
R. Fattal, “ Single image dehazing,” ACM Trans. Graph., vol. Volume 27, no. Issue 3, p. pp.72, 2008.
[18]
S. G. Narasimhan and S. K. Nayar, “ Shedding light on the weather,” in Proc. CVPR, 2003, p. pp.665.
[19]
S. Metari and F. Deschenes, “ A new convolution kernel for atmospheric point spread function applied to computer vision,” in Proc. ICCV, 2007, pp. 1–8.
[20]
K. Tang, J. Yang, and J. Wang, “ Investigating haze-relevant features in a learning framework for image dehazing,” in Proc. CVPR, 2014, pp. 2995–3002.
[21]
T. Joachims, “ Optimizing search engines using clickthrough data,” in Proc. SIGKDD, 2002, pp. 133–142.
[22]
A. Krizhevsky, I. Sutskever, and G. E. Hinton, “ ImageNet classification with deep convolutional neural networks,” in Proc. NIPS, 2012, pp. 1097–1105.
[23]
K. He, X. Zhang, S. Ren, and J. Sun, “ Deep residual learning for image recognition,” in Proc. CVPR, 2016, pp. 770–778.
[24]
D. Parikh and K. Grauman, “ Relative attributes,” in Proc. ICCV, 2011, pp. 503–510.
[25]
L. Page, S. Brin, R. Motwani, and T. Winograd, “ The PageRank citation ranking: Bringing order to the Web,” <institution content-type=institution>Stanford InfoLab</institution>, Stanford, CA, USA, Tech. Rep. 1999-66, 1999.
[26]
C. Burges, “ Learning to rank using gradient descent,” in Proc. ICML, 2005, pp. 89–96.
[27]
Y. Freund, R. Iyer, R. E. Schapire, and Y. Singer, “ An efficient boosting algorithm for combining preferences,” J. Mach. Learn. Res., vol. Volume 4, pp. 933–969, 2003.
[28]
C. Lu, D. Lin, J. Jia, and C.-K. Tang, “ Two-class weather classification,” in Proc. CVPR, 2014, pp. 3718–3725.
[29]
A. Oliva and A. Torralba, “ Modeling the shape of the scene: A holistic representation of the spatial envelope,” in Proc. IJCV, 2001, pp. 145–175.
[30]
V. Dang, “ Ranklib,” <institution content-type=institution>Univ. Massachusetts Amherst</institution>, Amherst, MA, USA, 2011. {Online}. Available: https://sourceforge.net/p/lemur/wiki/RankLib/

Cited By

View all
  • (2023)Image Defogging Based on Regional Gradient Constrained PriorACM Transactions on Multimedia Computing, Communications, and Applications10.1145/361783420:3(1-17)Online publication date: 23-Oct-2023
  • (2023)From Global to Local: An Adaptive Environmental Illumination Estimation for Non-uniform ScatteringProceedings of the 5th ACM International Conference on Multimedia in Asia10.1145/3595916.3626416(1-7)Online publication date: 6-Dec-2023
  • (2023)Multi visual feature fusion based fog visibility estimation for expressway surveillance using deep learning networkExpert Systems with Applications: An International Journal10.1016/j.eswa.2023.121151234:COnline publication date: 30-Dec-2023
  • Show More Cited By

Index Terms

  1. Relative CNN-RNN: Learning Relative Atmospheric Visibility From Images
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Please enable JavaScript to view thecomments powered by Disqus.

          Information & Contributors

          Information

          Published In

          cover image IEEE Transactions on Image Processing
          IEEE Transactions on Image Processing  Volume 28, Issue 1
          Jan. 2019
          523 pages

          Publisher

          IEEE Press

          Publication History

          Published: 01 January 2019

          Qualifiers

          • Research-article

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • Downloads (Last 12 months)0
          • Downloads (Last 6 weeks)0
          Reflects downloads up to 16 Nov 2024

          Other Metrics

          Citations

          Cited By

          View all
          • (2023)Image Defogging Based on Regional Gradient Constrained PriorACM Transactions on Multimedia Computing, Communications, and Applications10.1145/361783420:3(1-17)Online publication date: 23-Oct-2023
          • (2023)From Global to Local: An Adaptive Environmental Illumination Estimation for Non-uniform ScatteringProceedings of the 5th ACM International Conference on Multimedia in Asia10.1145/3595916.3626416(1-7)Online publication date: 6-Dec-2023
          • (2023)Multi visual feature fusion based fog visibility estimation for expressway surveillance using deep learning networkExpert Systems with Applications: An International Journal10.1016/j.eswa.2023.121151234:COnline publication date: 30-Dec-2023
          • (2023)DisGeReExT: a knowledge discovery system for exploration of disease–gene associations through large-scale literature-wide analysis studyKnowledge and Information Systems10.1007/s10115-023-01862-165:8(3463-3487)Online publication date: 1-Aug-2023
          • (2022)Dynamic Sign Language Recognition Based on CBAM with Autoencoder Time Series Neural NetworkMobile Information Systems10.1155/2022/32477812022Online publication date: 1-Jan-2022
          • (2022)Deep Bayesian Active-Learning-to-Rank for Endoscopic Image DataMedical Image Understanding and Analysis10.1007/978-3-031-12053-4_45(609-622)Online publication date: 27-Jul-2022
          • (2021)Visibility estimation via deep label distribution learning in cloud environmentJournal of Cloud Computing: Advances, Systems and Applications10.1186/s13677-021-00261-710:1Online publication date: 28-Aug-2021
          • (2020)Learning to compare visibility on webcam imagesProceedings of the 10th International Conference on Climate Informatics10.1145/3429309.3429323(91-97)Online publication date: 22-Sep-2020
          • (2019)Neural Compatibility Modeling With Probabilistic Knowledge DistillationIEEE Transactions on Image Processing10.1109/TIP.2019.293674229(871-882)Online publication date: 10-Oct-2019

          View Options

          View options

          Login options

          Media

          Figures

          Other

          Tables

          Share

          Share

          Share this Publication link

          Share on social media