Nothing Special   »   [go: up one dir, main page]

Skip to main content

Multi-label Image Classification Optimization Model Based on Deep Learning

  • Conference paper
  • First Online:
Wireless Sensor Networks (CWSN 2020)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1321))

Included in the following conference series:

  • 425 Accesses

Abstract

In order to meet the needs of diversified image information retrieval in the real world, to solve the “semantic gap” problem of image and natural language conversion, and to optimize the accuracy and efficiency of multi-label classification method, this paper proposes a deep learning and multi-label BR algorithm. An image classification method uses a residual neural network with better overall performance to extract image depth learning features, and takes the extraction result as an input, and generates a result vector through spatial regularization of the image space and the label, and the result is obtained. The elements are added as the final prediction result by directly using the residual network prediction result. The whole network is trained by the softmax loss function. Compared with other traditional models, the spatial relationship results of the labels provide a good regularization effect for multi-label image classification, which improves the accuracy on the NUS-WIDE dataset and its recall rate, etc.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Li, S., Li, N., Li, Z.: Multi-tag data mining technology: a review. Comput. Sci. 40(04), 14–21 (2013)

    Google Scholar 

  2. Chen, Z.: Research on Graph Structure Description and Several Learning Algorithms for Multi-label Classification Problem. South China University of Technology (2015)

    Google Scholar 

  3. Feng, X.: Summarization of multi-label classification problems. Inf. Syst. Eng. (03), 137 (2016)

    Google Scholar 

  4. Krizhevsky, A., Sutskever, I., Hinton, G.: ImageNet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 25(2), 84–90 (2012)

    Google Scholar 

  5. Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8689, pp. 818–833. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10590-1_53

    Chapter  Google Scholar 

  6. Simonyan, K., Zisserman, A.: Very Deep Convolutional Networks for Large-Scale Image Recognition. Computer Science. arXiv:1049.1556 (2014)

  7. Szegedy, C., Liu, W., Jia, Y., et al.: Going deeper with convolutions. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, pp. 1–12 (2015)

    Google Scholar 

  8. Xu, K., et al.: Show, attend and tell: neural image caption generation with visual attention. In: ICML, vol. 2, no. 3 (2015)

    Google Scholar 

  9. Li, X., Zhao, F., Guo, Y.: Multi-label image classification with a probabilistic label enhancement model. In: Proceedings of the Uncertainty in Artificial Intelligence, vol. 1, no. 2 (2014)

    Google Scholar 

  10. Liu, G., Liu, S., Wu, J., Luo, W.: Machine vision target detection algorithm based on deep learning and its application in bill detection. China Test 45(05), 1–9 (2019)

    Google Scholar 

  11. Wang, Y., Zhang, H., Huang, H.: A survey of image semantic segmentation algorithms based on deep learning. Appl. Electron. Tech. 45(06), 23–27+36 (2019)

    Google Scholar 

  12. Li, Y.: Multispectral image and multi-label scene classification based on convolutional neural network. Electron. Des. Eng. 26(23), 25–29 (2018)

    Google Scholar 

  13. He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. arXiv preprint arXiv:1512.03385 (2015)

  14. He, K., Zhang, X., Ren, S., Sun, J.: Identity mappings in deep residual networks. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9908, pp. 630–645. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46493-0_38

    Chapter  Google Scholar 

  15. Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift. In: ICML (2015)

    Google Scholar 

  16. Montañes, E., Senge, R., Barranquero, J., et al.: Dependent binary relevance models for multi-label classification. Pattern Recogn. 47(3), 1494–1508 (2014)

    Google Scholar 

  17. Jia, Y., et al.: Convolutional architecture for fast feature embedding. arXiv pre-print arXiv:1408.5093 (2014)

  18. Wang, L., Xiong, Y., Wang, Z., Qiao, Y.: Towards good practices for very deep two-stream ConvNets. CoRR, abs/1507.02159 (2015)

    Google Scholar 

  19. Chua, T.-S., Tang, J., Hong, R., Li, H., Luo, Z., Zheng, Y.: NUS-WIDE: a real-world web image database from National University of Singapore. In: Proceedings of the ACM International Conference on Image and Video Retrieval (2009)

    Google Scholar 

  20. Wang, J., Yang, Y., Mao, J., Huang, Z., Huang, C., Xu, W.: CNN-RNN: a unified framework for multi-label image classification. In: CVPR (2016)

    Google Scholar 

  21. Gong, Y., Jia, Y., Leung, T., Toshev, A., Ioffe, S.: Deep convolutional ranking for multilabel image annotation. In: ICLR (2014)

    Google Scholar 

Download references

Funding

This research was funded by the National Natural Science Foundation of China under Grant No. 61762079, and No. 61662070, Innovation ability improvement project of colleges and universities in Gansu Province in 2019, Grant No: 2019B-024, the Fundamental Research Funds for the Central University of Northwest Minzu University, Grant No: 31920180050.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xiaojuan Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wang, X., Xu, J., Hua, J., Hao, Z. (2020). Multi-label Image Classification Optimization Model Based on Deep Learning. In: Hao, Z., Dang, X., Chen, H., Li, F. (eds) Wireless Sensor Networks. CWSN 2020. Communications in Computer and Information Science, vol 1321. Springer, Singapore. https://doi.org/10.1007/978-981-33-4214-9_20

Download citation

  • DOI: https://doi.org/10.1007/978-981-33-4214-9_20

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-33-4213-2

  • Online ISBN: 978-981-33-4214-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics