research-article

MetaCNN: A New Hybrid Deep Learning Image-based Approach for Vehicle Classification Using Transformer-like Framework

Authors:

Ruikang LuoAuthors Info & Claims

CSSE '22: Proceedings of the 5th International Conference on Computer Science and Software Engineering

Pages 517 - 521

https://doi.org/10.1145/3569966.3570099

Published: 20 December 2022 Publication History

Abstract

Abstract—With the development of vehicles and traffic system in the early 21st century, the need for a monitored traffic system and vehicle classification is enlarging. Together with the development of deep learning, computer vision realm has emerged versatile models that is able to fulfill the need of classification. Those popular models include CNN, Vision Trans- former, Metaformer and so on. However, these models handle the problem based on different data processing techniques, they either lacks efficiency or effectiveness. In particular, CNN is shortcoming in global data while ViT is lack of extraction of local information. Therefore, based on this research gap, we proposed a model called MetaCNN, which combines CNN and Poolformer – a specific metaformer structure, which takes the strength of the two models and compensate for both models’ deficiencies. Finally, in order to verify the feasibility of our model, we tested our model on a real-world remote sensing datasets of vehicle images in six different regions with different weather conditions. Our model MetaCNN has demonstrated better recognition performance compared to other baseline models. The results further prove that our model MetaCNN is adept at vehicle classification of remote sensing images though under complex scenarios

References

[1]

G. Dimitrakopoulos and P. Demestichas, “Intelligent transportation systems,”IEEE Vehicular Technology Magazine, vol. 5, no. 1, pp. 77–84, 2010.

[2]

Y. Pei, Y. Huang, Q. Zou, Y. Lu, and S. Wang, “Does haze removal help cnn-based image classification?” in Proceedings of the European Conference on Computer Vision (ECCV), 2018, pp. 682–697.

Digital Library

[3]

A. Shustanov and P. Yakimov, “Cnn design for real-time traffic sign recognition,” Procedia engineering, vol. 201, pp. 718–725, 2017.

[4]

N. Parmar, A. Vaswani, J. Uszkoreit, L. Kaiser, N. Shazeer, A. Ku, and D. Tran, “Image transformer,” in International conference on machine learning. PMLR, 2018, pp. 4055–4064.

[5]

L. Zhao, Y. Song, C. Zhang, Y. Liu, P. Wang, T. Lin, M. Deng, and H. Li, “T-gcn: A temporal graph convolutional network for traffic prediction,” IEEE Transactions on Intelligent Transportation Systems, vol. 21, no. 9, pp. 3848–3858, 2019.

[6]

F. Wang, K. Liu, F. Long, N. Sang, X. Xia, and J. Sang, “Joint cnn and transformer network via weakly supervised learning for efficient crowd counting,” arXiv preprint arXiv:2203.06388, 2022.

[7]

J. Yang, H. Chen, Y. Xu, Z. Shi, R. Luo, L. Xie, and R. Su, “Domain adaptation for degraded remote scene classification,” in 2020 16th International Conference on Control, Automation, Robotics and Vision (ICARCV). IEEE, 2020, pp. 111– 117.

[8]

K. He, J. Sun, and X. Tang, “Single image haze removal using dark channel prior,” IEEE transactions on pattern analysis and machine intelligence, vol. 33, no. 12, pp. 2341–2353, 2010.

[9]

Z. Dong, Y. Wu, M. Pei, and Y. Jia, “Vehicle type classification using a semisupervised convolutional neural network,” IEEE transactions on intelligent transportation systems, vol. 16, no. 4, pp. 2247–2256, 2015.

[10]

A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gelly, “An image is worth 16x16 words: Transformers for image recognition at scale,” arXiv preprint arXiv:2010.11929, 2020.

[11]

W. Yu, M. Luo, P. Zhou, C. Si, Y. Zhou, X. Wang, J. Feng, and S. Yan, “Metaformer is actually what you need for vision,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 10 819–10 829.

[12]

S. Indolia, A. K. Goswami, S. P. Mishra, and P. Asopa, “Concep- tual understanding of convolutional neural network-a deep learning approach,” Procedia computer science, vol. 132, pp. 679–688, 2018.

Digital Library

[13]

A. F. Agarap, “Deep learning using rectified linear units (relu),” arXiv preprint arXiv:1803.08375, 2018.

[14]

Y. Bazi, L. Bashmal, M. M. A. Rahhal, R. A. Dayil, and N. A. Ajlan, “Vision transformers for remote sensing image classification,” Remote Sensing, vol. 13, no. 3, p.516, 2021.

[15]

Q. Diao, Y. Jiang, B. Wen, J. Sun, and Z. Yuan, “Metaformer: A unified meta framework for fine-grained recognition,” arXiv preprint arXiv:2203.02751, 2022.

[16]

T. N. Mundhenk, G. Konjevod, W. A. Sakla, and K. Boakye, “A large contextual dataset for classification, detection and counting of cars with deep learning,” in European conference on computer vision. Springer, 2016, pp. 785–800.

Index Terms

MetaCNN: A New Hybrid Deep Learning Image-based Approach for Vehicle Classification Using Transformer-like Framework
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
  2. Machine learning
    1. Learning paradigms
      1. Supervised learning
        Supervised learning by classification
    2. Machine learning approaches
      1. Neural networks

Index terms have been assigned to the content through auto-classification.

Recommendations

Pothole Detection Using Deep Learning Classification Method
Abstract
Potholes on roads have been a primary reason for road disasters and damage to the vehicles. Currently due to heavy rains and poor structure accoutrements roads surface have flaws. Detecting the potholes manually is time consuming and will not be ...
Combining CNN Deep Learning and SVM for Vehicle Classification
IC3INA '21: Proceedings of the 2021 International Conference on Computer, Control, Informatics and Its Applications

Vehicle classification plays an important role in an intelligent transportation system. The development of research in image processing, pattern recognition, and deep learning has encouraged more research interest in categorizing vehicles using deep ...
GCN-TRN: Efficient Transformer based Electric Vehicle Charging Demand Forecasting System
CSSE '22: Proceedings of the 5th International Conference on Computer Science and Software Engineering

Accurate traffic prediction has a wide spectrum of benefits for the society. As Electric Vehicles (EV) are being used more frequently, an accurate EV Charging Station availability forecast becomes necessary. These forecasting models can help alleviate ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

CSSE '22: Proceedings of the 5th International Conference on Computer Science and Software Engineering

October 2022

753 pages

ISBN:9781450397780

DOI:10.1145/3569966

Copyright © 2022 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 December 2022

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

CSSE 2022

CSSE 2022: 2022 5th International Conference on Computer Science and Software Engineering

October 21 - 23, 2022

Guilin, China

Acceptance Rates

Overall Acceptance Rate 33 of 74 submissions, 45%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
59
Total Downloads

Downloads (Last 12 months)26
Downloads (Last 6 weeks)2

Reflects downloads up to 26 Sep 2024

Other Metrics

View Author Metrics

Citations

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View Table of Contents