Nothing Special   »   [go: up one dir, main page]

Skip to main content

Fast Recognition for Multidirectional and Multi-type License Plates with 2D Spatial Attention

  • Conference paper
  • First Online:
Document Analysis and Recognition – ICDAR 2021 (ICDAR 2021)

Abstract

The multi-type license plate can be roughly classified into two categories, i.e., one-line and two-line. Many previous methods are proposed for horizontal one-line license plate recognition and consider license plate recognition as a one-dimensional sequence recognition problem. However, for multidirectional and two-line license plates, the features of adjacent characters may mix together when directly transforming a license plate image into a one-dimensional feature sequence. To solve this problem, we propose a two-dimensional spatial attention module to recognize license plates from a two-dimensional perspective. Specifically, we devise a lightweight and effective network for multidirectional and multi-type license plate recognition in the wild. The proposed network can work in parallel with a fast running speed because it does not contain any time-consuming recurrent structures. Extensive experiments on both public and private datasets verify that the proposed method outperforms state-of-the-art methods and achieves a real-time speed of 278 FPS. Our codes are available at https://github.com/qiLiu77/SALPR.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Baek, J., et al.: What is wrong with scene text recognition model comparisons? Dataset and model analysis. In: ICCV 2019, pp. 4714–4722. IEEE (2019). https://doi.org/10.1109/ICCV.2019.00481

  2. Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. In: Bengio, Y., LeCun, Y. (eds.) ICLR 2015 (2015)

    Google Scholar 

  3. Breuel, T.M.: High performance text recognition using a hybrid convolutional-LSTM implementation. In: ICDAR 2017, pp. 11–16. IEEE (2017). https://doi.org/10.1109/ICDAR.2017.12

  4. Chen, K., et al.: A compact CNN-DBLSTM based character model for online handwritten Chinese text recognition. In: ICDAR 2017, pp. 1068–1073. IEEE (2017). https://doi.org/10.1109/ICDAR.2017.177

  5. Duan, S., Hu, W., Li, R., Li, W., Sun, S.: Attention enhanced ConvNet-RNN for Chinese vehicle license plate recognition. In: Lai, J.-H., et al. (eds.) PRCV 2018. LNCS, vol. 11257, pp. 417–428. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-03335-4_36

    Chapter  Google Scholar 

  6. Gou, C., Wang, K., Yao, Y., Li, Z.: Vehicle license plate recognition based on extremal regions and restricted Boltzmann machines. IEEE Trans. Intell. Transp. Syst. 17(4), 1096–1107 (2016)

    Article  Google Scholar 

  7. Graves, A., Fernández, S., Gomez, F.J., Schmidhuber, J.: Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks. In: ICML 2006, vol. 148, pp. 369–376. ACM (2006)

    Google Scholar 

  8. Huang, Y., Luo, C., Jin, L., Lin, Q., Zhou, W.: Attention after attention: Reading text in the wild with cross attention. In: ICDAR 2019, pp. 274–280. IEEE (2019). https://doi.org/10.1109/ICDAR.2019.00052

  9. Jaderberg, M., Simonyan, K., Zisserman, A., Kavukcuoglu, K.: Spatial transformer networks. In: Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, pp. 2017–2025 (2015)

    Google Scholar 

  10. Kessentini, Y., Besbes, M.D., Ammar, S., Chabbouh, A.: A two-stage deep neural network for multi-norm license plate detection and recognition. Expert Syst. Appl. 136, 159–170 (2019)

    Article  Google Scholar 

  11. Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: Bengio, Y., LeCun, Y. (eds.) ICLR 2015 (2015)

    Google Scholar 

  12. Li, H., Wang, P., Shen, C.: Toward end-to-end car license plate detection and recognition with deep neural networks. IEEE Trans. Intell. Transp. Syst. 20(3), 1126–1136 (2019)

    Article  Google Scholar 

  13. Li, H., Wang, P., Shen, C., Zhang, G.: Show, attend and read: a simple and strong baseline for irregular text recognition. In: AAAI 2019, pp. 8610–8617. AAAI Press (2019). https://doi.org/10.1609/aaai.v33i01.33018610

  14. Liu, W., et al.: SSD: single shot multibox detector. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 21–37. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_2

    Chapter  Google Scholar 

  15. Liu, X., Ma, H., Li, S.: PVSS: a progressive vehicle search system for video surveillance networks. J. Comput. Sci. Technol. 34(3), 634–644 (2019). https://doi.org/10.1007/s11390-019-1932-x

    Article  Google Scholar 

  16. Lu, N., Yang, W., Meng, A., Xu, Z., Huang, H., Huang, L.: Automatic recognition for arbitrarily tilted license plate. In: ICVIP 2018, pp. 23–28. ACM (2018). https://doi.org/10.1145/3301506.3301547

  17. Luo, C., Jin, L., Sun, Z.: MORAN: a multi-object rectified attention network for scene text recognition. Pattern Recognit. 90, 109–118 (2019)

    Article  Google Scholar 

  18. Ly, N.T., Nguyen, C.T., Nakagawa, M.: An attention-based end-to-end model for multiple text lines recognition in Japanese historical documents. In: ICDAR 2019, pp. 629–634. IEEE (2019). https://doi.org/10.1109/ICDAR.2019.00106

  19. Martínez-Carballido, J., Alfonso-López, R., Ramírez-Cortés, J.M.: License plate digit recognition using 7\(\times \) 5 binary templates at an outdoor parking lot entrance. In: CONIELECOMP 2011, pp. 18–21. IEEE (2011)

    Google Scholar 

  20. Qin, S., Liu, S.: Towards end-to-end car license plate location and recognition in unconstrained scenarios. CoRR abs/2008.10916 (2020)

    Google Scholar 

  21. Redmon, J., Farhadi, A.: YOLO9000: better, faster, stronger. In: CVPR 2017, pp. 6517–6525. IEEE Computer Society (2017). https://doi.org/10.1109/CVPR.2017.690

  22. Redmon, J., Farhadi, A.: Yolov3: an incremental improvement. CoRR abs/1804.02767 (2018). http://arxiv.org/abs/1804.02767

  23. Shao, W., Chen, L.: License plate recognition data-based traffic volume estimation using collaborative tensor decomposition. IEEE Trans. Intell. Transp. Syst. 19(11), 3439–3448 (2018)

    Article  Google Scholar 

  24. Shi, B., Bai, X., Yao, C.: An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE Trans. Pattern Anal. Mach. Intell. 39(11), 2298–2304 (2017)

    Article  Google Scholar 

  25. Spanhel, J., Sochor, J., Juránek, R., Herout, A., Marsik, L., Zemcík, P.: Holistic recognition of low quality license plates by CNN using track annotated data. In: AVSS 2017, pp. 1–6. IEEE Computer Society (2017). https://doi.org/10.1109/AVSS.2017.8078501

  26. Wang, J., Huang, H., Qian, X., Cao, J., Dai, Y.: Sequence recognition of Chinese license plates. Neurocomputing 317, 149–158 (2018)

    Article  Google Scholar 

  27. Wang, T., et al.: Decoupled attention network for text recognition. In: AAAI 2020, pp. 12216–12224. AAAI Press (2020)

    Google Scholar 

  28. Wang, Y., Bian, Z., Zhou, Y., Chau, L.: Rethinking and designing a high-performing automatic license plate recognition approach. CoRR abs/2011.14936 (2020). https://arxiv.org/abs/2011.14936

  29. Xu, Z., et al.: Towards end-to-end license plate detection and recognition: a large dataset and baseline. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11217, pp. 261–277. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01261-8_16

    Chapter  Google Scholar 

  30. Zhang, L., Wang, P., Li, H., Li, Z., Shen, C., Zhang, Y.: A robust attentional framework for license plate recognition in the wild. IEEE Trans. Intell. Transp. Syst. PP(99), 1–10 (2020)

    Google Scholar 

  31. Zherzdev, S., Gruzdev, A.: LPRNet: license plate recognition via deep neural networks. CoRR abs/1806.10447 (2018). http://arxiv.org/abs/1806.10447

Download references

Acknowledgement

The research is supported by National Natural Science Foundation of China (62006018).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xu-Cheng Yin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Liu, Q., Chen, SL., Li, ZJ., Yang, C., Chen, F., Yin, XC. (2021). Fast Recognition for Multidirectional and Multi-type License Plates with 2D Spatial Attention. In: Lladós, J., Lopresti, D., Uchida, S. (eds) Document Analysis and Recognition – ICDAR 2021. ICDAR 2021. Lecture Notes in Computer Science(), vol 12824. Springer, Cham. https://doi.org/10.1007/978-3-030-86337-1_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-86337-1_9

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-86336-4

  • Online ISBN: 978-3-030-86337-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics