Multi-Oriented and Scale-Invariant License Plate Detection Based on Convolutional Neural Networks
<p>Two main challenges for existing methods. (<b>a</b>) Comparison between different bounding methods (horizontal rectangles, rotated rectangles, and our bounding parallelograms). Existing methods based on horizontal or rotated rectangles cannot tightly enclose the multi-oriented license plates. (<b>b</b>) Representative detection results of Faster R-CNN [<a href="#B27-sensors-19-01175" class="html-bibr">27</a>] trained for LPD. The detected regions and actual license plates are presented by red horizontal rectangles and green polygons, respectively. It is difficult to detect license plates with multiple scales, especially the tiny ones.</p> "> Figure 2
<p>The overall structure of our MOSI-LPD. The backbone framework is Faster R-CNN [<a href="#B27-sensors-19-01175" class="html-bibr">27</a>], the classic region-based deep learning network for object detection. To achieve multi-oriented and scale-invariant detection, several vital modifications are proposed. For the RPN sub-network, license plate proposals are generated on both the “Conv5” layer and the “Conv4” layer to combine and produce stronger proposals. The anchor boxes are set based on the priori knowledge regarding inherent shapes of license plates. For the detection sub-network, RoI pooling is conducted on the combined layers of the up-sampled “ Conv5-2x” layer and “Conv4” layer. We estimate three edge points of the license plates by regressing relative positions from horizontal proposals. The fourth edge point is inferred based on the symmetry property to form final bounding parallelograms that tightly enclose the multi-oriented license plates.</p> "> Figure 3
<p>Sample images of our license plate dataset. First two rows: positive samples containing license plates with different orientations and multiple scales. All the license plates were manually labeled by the exact four edge points. Third row: negative samples containing objects similar to license plates.</p> "> Figure 4
<p>Some representative detection results of our <b>MOSI-LPD</b>: (<b>a</b>) results on license plates with different orientations (skewing violently, modestly and slightly for each row); (<b>b</b>) results on license plates with multiple scales (tiny, medium, and large in scale for each row); (<b>c</b>) results on special or low-resolution license plates in the first row, and scarce cases of mistaking or missing of license plates (indicated by yellow ellipses) in the second row.</p> "> Figure 5
<p>Some representative detection results: (<b>a</b>) <b>MO-LPD</b>; (<b>b</b>) <b>Faster R-CNN</b> [<a href="#B27-sensors-19-01175" class="html-bibr">27</a>]; (<b>c</b>) <b>RRPN</b> [<a href="#B35-sensors-19-01175" class="html-bibr">35</a>]; (<b>d</b>) <b>TextBoxes++</b> [<a href="#B37-sensors-19-01175" class="html-bibr">37</a>]. In each subfigure, license plates in the first to the last row were severely, modestly, and slightly skewed, respectively. The parallelograms predicted by <b>MO-LPD</b> contain less redundant information than the horizontal rectangles predicted by <b>Faster R-CNN</b> and rotated rectangles predicted by <b>RRPN</b>.</p> "> Figure 6
<p>Some representative detection results: (<b>a</b>) our <b>MOSI-LPD</b>; (<b>b</b>) <b>MO-LPD</b>. In each subfigure, license plates in the first to the last row were tiny, medium, and large, respectively. Our <b>MOSI-LPD</b> is more invariant to the scale discrepancy of license plates.</p> "> Figure 7
<p>Some representative detection results of our <b>MOSI-LPD</b> on challenging data: (<b>a</b>) performance on blurred images; (<b>b</b>) performance on images with noise.</p> ">
Abstract
:1. Introduction
- We propose novel strategies to tightly enclose the multi-oriented license plates with bounding parallelograms. Both the network architecture and the loss function are elaborately designed to directly regress bounding parallelograms from horizontal proposals. Our method significantly improves the localization precision and guarantees a high detection accuracy simultaneously.
- We design effective strategies to detect license plates with multiple scales. Multiple convolutional layers are exploited both for proposal generation and feature extraction. The priori knowledge regarding inherent shapes of license plates is considered for anchor box design. Our method is highly invariant to the scale discrepancy of license plates, and effectively detects tiny license plates that are only several pixels.
- We construct a large license plate dataset. The dataset contains more than 7000 images, and all the license plates are labeled by the exact edge points. The dataset is publicly available for related research (http://cvrs.whu.edu.cn/projects/LASI-LPL/).
2. Materials and Methods
2.1. Overall Structure
2.2. Multi-Oriented Detection Based on Bounding Parallelograms
2.3. Scale-Invariant Detection
3. Results
3.1. Dataset
3.2. Implementation
3.3. Evaluation Criteria
3.4. Experimental Results
3.4.1. Overall Performance
3.4.2. Multi-Oriented Detection Based on Bounding Parallelograms
3.4.3. Scale-Invariant Detection
3.4.4. Robustness
3.4.5. Detection Speed
4. Conclusions
Author Contributions
Funding
Conflicts of Interest
References
- Bai, H.; Liu, C. A hybrid license plate extraction method based on edge statistics and morphology. In Proceedings of the International Conference on Pattern Recognition, Cambridge, UK, 23–26 August 2004; pp. 831–834. [Google Scholar]
- Nguyen, C.D.; Ardabilian, M.; Chen, L. Robust Car License Plate Localization Using a Novel Texture Descriptor. In Proceedings of the IEEE International Conference on Advanced Video and Signal Based Surveillance, Genova, Italy, 2–4 September 2009; pp. 523–528. [Google Scholar]
- Tian, B.; Li, Y.; Li, B.; Wen, D. Rear-view vehicle detection and tracking by combining multiple parts for complex urban surveillance. IEEE Trans. Intell. Transp. Syst. 2014, 15, 597–606. [Google Scholar] [CrossRef]
- Patra, S.; Veelaert, P.; Calafate, C.T.; Cano, J.; Zamora, W.; Manzoni, P.; Gonzalez, F. A Forward Collision Warning System for Smartphones Using Image Processing and V2V Communication. Sensors 2018, 18, 2672. [Google Scholar] [CrossRef]
- Rajput, H.; Som, T.; Kar, S. An automated vehicle license plate recognition system. IEEE Trans. Comput. 2015, 48, 56–61. [Google Scholar] [CrossRef]
- Li, Q. A Geometric Framework for Rectangular Shape Detection. IEEE Trans. Image Process. 2014, 23, 4139–4149. [Google Scholar] [CrossRef]
- Alghaili, A.M.; Mashohor, S.; Ramli, A.R.; Ismail, A. Vertical-Edge-Based Car-License-Plate Detection Method. IEEE Trans. Veh. Technol. 2013, 62, 26–38. [Google Scholar] [CrossRef]
- Jin, L.; Xian, H.; Bie, J.; Sun, Y.; Hou, H.; Niu, Q. License Plate Recognition Algorithm for Passenger Cars in Chinese Residential Areas. Sensors 2012, 12, 8355–8370. [Google Scholar] [CrossRef]
- Wen, Y.; Lu, Y.; Yan, J.; Zhou, Z.; Deneen, K.M.V.; Shi, P. An algorithm for license plate recognition applied to intelligent transportation system. IEEE Trans. Intell. Transp. Syst. 2011, 12, 830–845. [Google Scholar] [CrossRef]
- Ashtari, A.H.; Nordin, M.J.; Fathy, M. An Iranian license plate recognition system based on color features. IEEE Trans. Intell. Transp. Syst. 2014, 15, 1690–1705. [Google Scholar] [CrossRef]
- Dun, J.; Zhang, S.; Ye, X.; Zhang, Y. Chinese license plate localization in multi-lane with complex background based on concomitant colors. IEEE Intell. Transp. Syst. Mag. 2015, 7, 51–61. [Google Scholar] [CrossRef]
- Zhou, W.; Li, H.; Lu, Y.; Tian, Q. Principal visual word discovery for automatic license plate detection. IEEE Trans. Image Process. 2012, 21, 4269–4279. [Google Scholar] [CrossRef]
- Wang, W.; Jiang, Q.; Zhou, X.; Wan, W. Car license plate detection based on MSER. In Proceedings of the International Conference on Consumer Electronics, Communications and Networks, Xianning, China, 16–18 April 2011; pp. 3973–3976. [Google Scholar]
- Li, B.; Tian, B.; Li, Y.; Wen, D. Component-based license plate detection using conditional random field model. IEEE Trans. Intell. Transp. Syst. 2013, 14, 1690–1699. [Google Scholar] [CrossRef]
- Gou, C.; Wang, K.; Yao, Y.; Li, Z. Vehicle license plate recognition based on extremal regions and restricted boltzmann machines. IEEE Trans. Intell. Transp. Syst. 2016, 17, 1096–1107. [Google Scholar] [CrossRef]
- Lowe, D.G. Distinctive Image Features from Scale-Invariant Keypoints. IJCV 2004, 60, 91–110. [Google Scholar] [CrossRef]
- Bay, H.; Ess, A.; Tuytelaars, T.; Van Gool, L. Speeded-Up Robust Features (SURF). CVIU 2008, 110, 346–359. [Google Scholar] [CrossRef]
- Felzenszwalb, P.F.; Girshick, R.B.; Mcallester, D.A.; Ramanan, D. Object Detection with Discriminatively Trained Part-Based Models. IEEE Trans. Pattern Anal. Mach. Intell. 2010, 32, 1627–1645. [Google Scholar] [CrossRef]
- Girshick, R.B.; Iandola, F.N.; Darrell, T.; Malik, J. Deformable part models are convolutional neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015; pp. 437–446. [Google Scholar]
- Wan, L.; Eigen, D.; Fergus, R. End-to-end integration of a Convolutional Network, Deformable Parts Model and non-maximum suppression. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015; pp. 851–859. [Google Scholar]
- Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet classification with deep convolutional neural networks. In Proceedings of the International Conference on Neural Information Processing Systems, Lake Tahoe, Nevada, USA, 3–8 December 2012; pp. 1097–1105. [Google Scholar]
- Yuan, Y.; Zou, W.; Zhao, Y.; Wang, X.; Hu, X.; Komodakis, N. A robust and efficient approach to license plate detection. IEEE Trans. Image Process. 2017, 26, 1102–1114. [Google Scholar] [CrossRef]
- Li, H.; Shen, C. Reading car license plates using deep convolutional neural networks and LSTMs. arXiv, 2016; arXiv:1601.05610. [Google Scholar]
- Gao, Y.; Lee, H.J. Local Tiled Deep Networks for Recognition of Vehicle Make and Model. Sensors 2016, 16, 226. [Google Scholar] [CrossRef]
- Kurpiel, F.D.; Minetto, R.; Nassu, B.T. Convolutional Neural Networks for License Plate Detection in Images. In Proceedings of the International Conference on Image Processing, Beijing, China, 17–20 September 2017; pp. 3395–3399. [Google Scholar]
- Rafique, M.A.; Pedrycz, W.; Jeon, M. Vehicle license plate detection using region-based convolutional neural networks. Soft Comput. 2018, 22, 6429–6440. [Google Scholar] [CrossRef]
- Ren, S.; He, K.; Girshick, R.B.; Sun, J. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 1137–1149. [Google Scholar] [CrossRef]
- Girshick, R.B.; Donahue, J.; Darrell, T.; Malik, J. Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 23–28 June 2014; pp. 580–587. [Google Scholar]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 2014, 37, 1904–1916. [Google Scholar] [CrossRef]
- Girshick, R.B. Fast R-CNN. In Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile, 13–16 December 2015; pp. 1440–1448. [Google Scholar]
- Redmon, J.; Divvala, S.K.; Girshick, R.B.; Farhadi, A. You only look once: Unified, real-time object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 779–788. [Google Scholar]
- Liu, W.; Anguelov, D.; Erhan, D.; Szegedy, C.; Reed, S.E.; Fu, C.; Berg, A.C. SSD: Single shot multibox detector. In Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands, 11–14 October 2016; pp. 21–37. [Google Scholar]
- Zhang, Z.; Zhang, C.; Shen, W.; Yao, C.; Liu, W.; Bai, X. Multi-oriented text detection with fully convolutional networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 4159–4167. [Google Scholar]
- He, T.; Huang, W.; Qiao, Y.; Yao, J. Accurate text localization in natural image with cascaded convolutional text network. arXiv, 2016; arXiv:1603.09423. [Google Scholar]
- Ma, J.; Shao, W.; Ye, H.; Wang, L.; Wang, H.; Zheng, Y.; Xue, X. Arbitrary-oriented scene text detection via rotation proposals. IEEE Trans. Multimed. 2018, 20, 3111–3122. [Google Scholar] [CrossRef]
- Busta, M.; Neumann, L.; Matas, J. Deep textspotter: An end-to-end trainable scene text localization and recognition framework. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; pp. 2204–2212. [Google Scholar]
- Liao, M.; Shi, B.; Bai, X. Textboxes++: A single-shot oriented scene text detector. IEEE Trans. Image Process. 2018, 27, 3676–3690. [Google Scholar] [CrossRef]
- Long, J.; Shelhamer, E.; Darrell, T. Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015; pp. 3431–3440. [Google Scholar]
- Sifre, L.; Mallat, S. Rotation, scaling and deformation invariant scattering for texture discrimination. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA, 23–28 June 2013; pp. 1233–1240. [Google Scholar]
- Minaee, S.; Abdolrashidi, A.; Wang, Y. Iris recognition using scattering transform and textural features. In Proceedings of the IEEE Signal Processing and Signal Processing Education Workshop, Salt Lake City, UT, USA, 9–12 August 2015; pp. 37–42. [Google Scholar]
- Minaee, S.; Wang, Y. Palmprint recognition using deep scattering convolutional network. arXiv, 2016; arXiv:1603.09027. [Google Scholar]
- Cai, Z.; Fan, Q.; Feris, R.S.; Vasconcelos, N. A unified multi-scale deep convolutional neural network for fast object detection. In Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands, 8–16 October 2016; pp. 354–370. [Google Scholar]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
- Bell, S.; Lawrence Zitnick, C.; Bala, K.; Girshick, R. Inside-outside net: Detecting objects in context with skip pooling and recurrent neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 2874–2883. [Google Scholar]
- Kong, T.; Yao, A.; Chen, Y.; Sun, F. Hypernet: Towards accurate region proposal generation and joint object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 845–853. [Google Scholar]
- Li, J.; Niu, C.; Fan, M. Multi-scale convolutional neural networks for natural scene license plate detection. In Proceedings of the International Symposium on Neural Networks, Shenyang, China, 11–14 July 2012; pp. 110–119. [Google Scholar]
- Deng, J.; Dong, W.; Socher, R.; Li, L.; Li, K.; Li, F. ImageNet: A large-scale hierarchical image database. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA, 20–26 June 2009; pp. 248–255. [Google Scholar]
- Glorot, X.; Bengio, Y. Understanding the difficulty of training deep feedforward neural networks. J. Mach. Learn. Res. 2010, 9, 249–256. [Google Scholar]
Method | IoU | Precision | Recall | F-Measure |
---|---|---|---|---|
BOCO-LPD [22] | 0.72 | 0.86 | 0.84 | 0.85 |
Faster R-CNN [27] | 0.76 | 0.92 | 0.88 | 0.90 |
MOSI-LPD (ours) | 0.89 | 0.98 | 0.98 | 0.98 |
Method | MO-LPD | Faster R-CNN [27] | RRPN [35] | TextBoxes++ [37] | |||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Dataset | IoU | Precision | Recall | F-Measure | IoU | Precision | Recall | F-Measure | IoU | Precision | Recall | F-Measure | IoU | Precision | Recall | F-Measure | |
Slight | 0.93 | 0.94 | 0.95 | 0.94 | 0.90 | 0.94 | 0.93 | 0.93 | 0.91 | 0.91 | 0.95 | 0.93 | 0.93 | 0.93 | 0.91 | 0.92 | |
Modest | 0.89 | 0.92 | 0.92 | 0.92 | 0.78 | 0.87 | 0.92 | 0.90 | 0.83 | 0.93 | 0.91 | 0.92 | 0.91 | 0.84 | 0.87 | 0.85 | |
Severe | 0.87 | 0.89 | 0.87 | 0.88 | 0.59 | 0.91 | 0.86 | 0.88 | 0.72 | 0.88 | 0.84 | 0.86 | 0.89 | 0.83 | 0.83 | 0.83 |
Method | MOSI-LPD (ours) | MO-LPD | |||||||
---|---|---|---|---|---|---|---|---|---|
Dataset | Precision | Recall | F-Measure | IoU | Precision | Recall | F-Measure | IoU | |
Tiny | 0.98 | 0.96 | 0.97 | 0.92 | 0.87 | 0.86 | 0.86 | 0.91 | |
Medium | 0.99 | 0.98 | 0.98 | 0.91 | 0.96 | 0.91 | 0.93 | 0.88 | |
Large | 0.95 | 0.99 | 0.97 | 0.88 | 0.83 | 0.98 | 0.90 | 0.85 |
Testdata | Precision | Recall | F-Measure | IoU |
---|---|---|---|---|
Original Images | 0.99 | 0.98 | 0.98 | 0.93 |
Blurred Images | 0.97 | 0.97 | 0.97 | 0.91 |
Images with Noises | 0.97 | 0.99 | 0.98 | 0.93 |
Test Subset | MOSI-LPD (ours) | Faster R-CNN [27] | ||||||
---|---|---|---|---|---|---|---|---|
Conv | Proposal | Detection | Total | Conv | Proposal | Detection | Total | |
Dataset10000 | 0.133 | 0.022 | 0.061 | 0.216 | 0.126 | 0.009 | 0.043 | 0.178 |
Slight | 0.143 | 0.018 | 0.059 | 0.220 | 0.132 | 0.014 | 0.040 | 0.186 |
Modest | 0.126 | 0.013 | 0.074 | 0.213 | 0.121 | 0.009 | 0.039 | 0.169 |
Severe | 0.136 | 0.016 | 0.068 | 0.220 | 0.123 | 0.012 | 0.036 | 0.171 |
Tiny | 0.137 | 0.014 | 0.057 | 0.208 | 0.127 | 0.007 | 0.041 | 0.175 |
Medium | 0.142 | 0.018 | 0.063 | 0.223 | 0.119 | 0.011 | 0.038 | 0.168 |
Large | 0.128 | 0.024 | 0.065 | 0.217 | 0.134 | 0.016 | 0.034 | 0.184 |
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Han, J.; Yao, J.; Zhao, J.; Tu, J.; Liu, Y. Multi-Oriented and Scale-Invariant License Plate Detection Based on Convolutional Neural Networks. Sensors 2019, 19, 1175. https://doi.org/10.3390/s19051175
Han J, Yao J, Zhao J, Tu J, Liu Y. Multi-Oriented and Scale-Invariant License Plate Detection Based on Convolutional Neural Networks. Sensors. 2019; 19(5):1175. https://doi.org/10.3390/s19051175
Chicago/Turabian StyleHan, Jing, Jian Yao, Jiao Zhao, Jingmin Tu, and Yahui Liu. 2019. "Multi-Oriented and Scale-Invariant License Plate Detection Based on Convolutional Neural Networks" Sensors 19, no. 5: 1175. https://doi.org/10.3390/s19051175
APA StyleHan, J., Yao, J., Zhao, J., Tu, J., & Liu, Y. (2019). Multi-Oriented and Scale-Invariant License Plate Detection Based on Convolutional Neural Networks. Sensors, 19(5), 1175. https://doi.org/10.3390/s19051175