Abstract
The multi-type license plate can be roughly classified into two categories, i.e., one-line and two-line. Many previous methods are proposed for horizontal one-line license plate recognition and consider license plate recognition as a one-dimensional sequence recognition problem. However, for multidirectional and two-line license plates, the features of adjacent characters may mix together when directly transforming a license plate image into a one-dimensional feature sequence. To solve this problem, we propose a two-dimensional spatial attention module to recognize license plates from a two-dimensional perspective. Specifically, we devise a lightweight and effective network for multidirectional and multi-type license plate recognition in the wild. The proposed network can work in parallel with a fast running speed because it does not contain any time-consuming recurrent structures. Extensive experiments on both public and private datasets verify that the proposed method outperforms state-of-the-art methods and achieves a real-time speed of 278 FPS. Our codes are available at https://github.com/qiLiu77/SALPR.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Baek, J., et al.: What is wrong with scene text recognition model comparisons? Dataset and model analysis. In: ICCV 2019, pp. 4714–4722. IEEE (2019). https://doi.org/10.1109/ICCV.2019.00481
Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. In: Bengio, Y., LeCun, Y. (eds.) ICLR 2015 (2015)
Breuel, T.M.: High performance text recognition using a hybrid convolutional-LSTM implementation. In: ICDAR 2017, pp. 11–16. IEEE (2017). https://doi.org/10.1109/ICDAR.2017.12
Chen, K., et al.: A compact CNN-DBLSTM based character model for online handwritten Chinese text recognition. In: ICDAR 2017, pp. 1068–1073. IEEE (2017). https://doi.org/10.1109/ICDAR.2017.177
Duan, S., Hu, W., Li, R., Li, W., Sun, S.: Attention enhanced ConvNet-RNN for Chinese vehicle license plate recognition. In: Lai, J.-H., et al. (eds.) PRCV 2018. LNCS, vol. 11257, pp. 417–428. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-03335-4_36
Gou, C., Wang, K., Yao, Y., Li, Z.: Vehicle license plate recognition based on extremal regions and restricted Boltzmann machines. IEEE Trans. Intell. Transp. Syst. 17(4), 1096–1107 (2016)
Graves, A., Fernández, S., Gomez, F.J., Schmidhuber, J.: Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks. In: ICML 2006, vol. 148, pp. 369–376. ACM (2006)
Huang, Y., Luo, C., Jin, L., Lin, Q., Zhou, W.: Attention after attention: Reading text in the wild with cross attention. In: ICDAR 2019, pp. 274–280. IEEE (2019). https://doi.org/10.1109/ICDAR.2019.00052
Jaderberg, M., Simonyan, K., Zisserman, A., Kavukcuoglu, K.: Spatial transformer networks. In: Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, pp. 2017–2025 (2015)
Kessentini, Y., Besbes, M.D., Ammar, S., Chabbouh, A.: A two-stage deep neural network for multi-norm license plate detection and recognition. Expert Syst. Appl. 136, 159–170 (2019)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: Bengio, Y., LeCun, Y. (eds.) ICLR 2015 (2015)
Li, H., Wang, P., Shen, C.: Toward end-to-end car license plate detection and recognition with deep neural networks. IEEE Trans. Intell. Transp. Syst. 20(3), 1126–1136 (2019)
Li, H., Wang, P., Shen, C., Zhang, G.: Show, attend and read: a simple and strong baseline for irregular text recognition. In: AAAI 2019, pp. 8610–8617. AAAI Press (2019). https://doi.org/10.1609/aaai.v33i01.33018610
Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2
Liu, X., Ma, H., Li, S.: PVSS: a progressive vehicle search system for video surveillance networks. J. Comput. Sci. Technol. 34(3), 634–644 (2019). https://doi.org/10.1007/s11390-019-1932-x
Lu, N., Yang, W., Meng, A., Xu, Z., Huang, H., Huang, L.: Automatic recognition for arbitrarily tilted license plate. In: ICVIP 2018, pp. 23–28. ACM (2018). https://doi.org/10.1145/3301506.3301547
Luo, C., Jin, L., Sun, Z.: MORAN: a multi-object rectified attention network for scene text recognition. Pattern Recognit. 90, 109–118 (2019)
Ly, N.T., Nguyen, C.T., Nakagawa, M.: An attention-based end-to-end model for multiple text lines recognition in Japanese historical documents. In: ICDAR 2019, pp. 629–634. IEEE (2019). https://doi.org/10.1109/ICDAR.2019.00106
Martínez-Carballido, J., Alfonso-López, R., Ramírez-Cortés, J.M.: License plate digit recognition using 7\(\times \) 5 binary templates at an outdoor parking lot entrance. In: CONIELECOMP 2011, pp. 18–21. IEEE (2011)
Qin, S., Liu, S.: Towards end-to-end car license plate location and recognition in unconstrained scenarios. CoRR abs/2008.10916 (2020)
Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger. In: CVPR 2017, pp. 6517–6525. IEEE Computer Society (2017). https://doi.org/10.1109/CVPR.2017.690
Redmon, J., Farhadi, A.: Yolov3: an incremental improvement. CoRR abs/1804.02767 (2018). http://arxiv.org/abs/1804.02767
Shao, W., Chen, L.: License plate recognition data-based traffic volume estimation using collaborative tensor decomposition. IEEE Trans. Intell. Transp. Syst. 19(11), 3439–3448 (2018)
Shi, B., Bai, X., Yao, C.: An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE Trans. Pattern Anal. Mach. Intell. 39(11), 2298–2304 (2017)
Spanhel, J., Sochor, J., Juránek, R., Herout, A., Marsik, L., Zemcík, P.: Holistic recognition of low quality license plates by CNN using track annotated data. In: AVSS 2017, pp. 1–6. IEEE Computer Society (2017). https://doi.org/10.1109/AVSS.2017.8078501
Wang, J., Huang, H., Qian, X., Cao, J., Dai, Y.: Sequence recognition of Chinese license plates. Neurocomputing 317, 149–158 (2018)
Wang, T., et al.: Decoupled attention network for text recognition. In: AAAI 2020, pp. 12216–12224. AAAI Press (2020)
Wang, Y., Bian, Z., Zhou, Y., Chau, L.: Rethinking and designing a high-performing automatic license plate recognition approach. CoRR abs/2011.14936 (2020). https://arxiv.org/abs/2011.14936
Xu, Z., et al.: Towards end-to-end license plate detection and recognition: a large dataset and baseline. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11217, pp. 261–277. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01261-8_16
Zhang, L., Wang, P., Li, H., Li, Z., Shen, C., Zhang, Y.: A robust attentional framework for license plate recognition in the wild. IEEE Trans. Intell. Transp. Syst. PP(99), 1–10 (2020)
Zherzdev, S., Gruzdev, A.: LPRNet: license plate recognition via deep neural networks. CoRR abs/1806.10447 (2018). http://arxiv.org/abs/1806.10447
Acknowledgement
The research is supported by National Natural Science Foundation of China (62006018).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Liu, Q., Chen, SL., Li, ZJ., Yang, C., Chen, F., Yin, XC. (2021). Fast Recognition for Multidirectional and Multi-type License Plates with 2D Spatial Attention. In: Lladós, J., Lopresti, D., Uchida, S. (eds) Document Analysis and Recognition – ICDAR 2021. ICDAR 2021. Lecture Notes in Computer Science(), vol 12824. Springer, Cham. https://doi.org/10.1007/978-3-030-86337-1_9
Download citation
DOI: https://doi.org/10.1007/978-3-030-86337-1_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-86336-4
Online ISBN: 978-3-030-86337-1
eBook Packages: Computer ScienceComputer Science (R0)