Multi-label Image Classification Optimization Model Based on Deep Learning

Xiaojuan Wang⁹,
Jing Xu¹⁰,
Jie Hua¹¹ &
…
Zhanjun Hao¹⁰

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1321))

Included in the following conference series:

China Conference on Wireless Sensor Networks

438 Accesses

Abstract

In order to meet the needs of diversified image information retrieval in the real world, to solve the “semantic gap” problem of image and natural language conversion, and to optimize the accuracy and efficiency of multi-label classification method, this paper proposes a deep learning and multi-label BR algorithm. An image classification method uses a residual neural network with better overall performance to extract image depth learning features, and takes the extraction result as an input, and generates a result vector through spatial regularization of the image space and the label, and the result is obtained. The elements are added as the final prediction result by directly using the residual network prediction result. The whole network is trained by the softmax loss function. Compared with other traditional models, the spatial relationship results of the labels provide a good regularization effect for multi-label image classification, which improves the accuracy on the NUS-WIDE dataset and its recall rate, etc.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Li, S., Li, N., Li, Z.: Multi-tag data mining technology: a review. Comput. Sci. 40(04), 14–21 (2013)
Google Scholar
Chen, Z.: Research on Graph Structure Description and Several Learning Algorithms for Multi-label Classification Problem. South China University of Technology (2015)
Google Scholar
Feng, X.: Summarization of multi-label classification problems. Inf. Syst. Eng. (03), 137 (2016)
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.: ImageNet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 25(2), 84–90 (2012)
Google Scholar
Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8689, pp. 818–833. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10590-1_53
Chapter Google Scholar
Simonyan, K., Zisserman, A.: Very Deep Convolutional Networks for Large-Scale Image Recognition. Computer Science. arXiv:1049.1556 (2014)
Szegedy, C., Liu, W., Jia, Y., et al.: Going deeper with convolutions. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, pp. 1–12 (2015)
Google Scholar
Xu, K., et al.: Show, attend and tell: neural image caption generation with visual attention. In: ICML, vol. 2, no. 3 (2015)
Google Scholar
Li, X., Zhao, F., Guo, Y.: Multi-label image classification with a probabilistic label enhancement model. In: Proceedings of the Uncertainty in Artificial Intelligence, vol. 1, no. 2 (2014)
Google Scholar
Liu, G., Liu, S., Wu, J., Luo, W.: Machine vision target detection algorithm based on deep learning and its application in bill detection. China Test 45(05), 1–9 (2019)
Google Scholar
Wang, Y., Zhang, H., Huang, H.: A survey of image semantic segmentation algorithms based on deep learning. Appl. Electron. Tech. 45(06), 23–27+36 (2019)
Google Scholar
Li, Y.: Multispectral image and multi-label scene classification based on convolutional neural network. Electron. Des. Eng. 26(23), 25–29 (2018)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. arXiv preprint arXiv:1512.03385 (2015)
He, K., Zhang, X., Ren, S., Sun, J.: Identity mappings in deep residual networks. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9908, pp. 630–645. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46493-0_38
Chapter Google Scholar
Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift. In: ICML (2015)
Google Scholar
Montañes, E., Senge, R., Barranquero, J., et al.: Dependent binary relevance models for multi-label classification. Pattern Recogn. 47(3), 1494–1508 (2014)
Google Scholar
Jia, Y., et al.: Convolutional architecture for fast feature embedding. arXiv pre-print arXiv:1408.5093 (2014)
Wang, L., Xiong, Y., Wang, Z., Qiao, Y.: Towards good practices for very deep two-stream ConvNets. CoRR, abs/1507.02159 (2015)
Google Scholar
Chua, T.-S., Tang, J., Hong, R., Li, H., Luo, Z., Zheng, Y.: NUS-WIDE: a real-world web image database from National University of Singapore. In: Proceedings of the ACM International Conference on Image and Video Retrieval (2009)
Google Scholar
Wang, J., Yang, Y., Mao, J., Huang, Z., Huang, C., Xu, W.: CNN-RNN: a unified framework for multi-label image classification. In: CVPR (2016)
Google Scholar
Gong, Y., Jia, Y., Leung, T., Toshev, A., Ioffe, S.: Deep convolutional ranking for multilabel image annotation. In: ICLR (2014)
Google Scholar

Download references

Funding

This research was funded by the National Natural Science Foundation of China under Grant No. 61762079, and No. 61662070, Innovation ability improvement project of colleges and universities in Gansu Province in 2019, Grant No: 2019B-024, the Fundamental Research Funds for the Central University of Northwest Minzu University, Grant No: 31920180050.

Author information

Authors and Affiliations

College of Mathematics and Computer Science, Northwest Minzu University, Lanzhou, 730030, China
Xiaojuan Wang
College of Computer Science and Engineering, Northwest Normal University, Lanzhou, 730070, China
Jing Xu & Zhanjun Hao
Hebei Institute of Mechanical and Electrical Technology, Xingtai, 054000, China
Jie Hua

Authors

Xiaojuan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jing Xu
View author publications
You can also search for this author in PubMed Google Scholar
Jie Hua
View author publications
You can also search for this author in PubMed Google Scholar
Zhanjun Hao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiaojuan Wang .

Editor information

Editors and Affiliations

Computer Science and Engineering Department, Northwest Normal University, Lanzhou, China
Zhanjun Hao
Computer Science and Engineering Department, Northwest Normal University, Lanzhou, China
Xiaochao Dang
Computer Science and Engineering Department, Northwest Normal University, Lanzhou, China
Honghong Chen
Computer Science and Engineering Department, Northwest Normal University, Lanzhou, China
Fenfang Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, X., Xu, J., Hua, J., Hao, Z. (2020). Multi-label Image Classification Optimization Model Based on Deep Learning. In: Hao, Z., Dang, X., Chen, H., Li, F. (eds) Wireless Sensor Networks. CWSN 2020. Communications in Computer and Information Science, vol 1321. Springer, Singapore. https://doi.org/10.1007/978-981-33-4214-9_20

Download citation

DOI: https://doi.org/10.1007/978-981-33-4214-9_20
Published: 20 November 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-33-4213-2
Online ISBN: 978-981-33-4214-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the China Computer Federation (CCF) (opens in a new tab)