research-article

Open access

Evaluation and Prediction of Resource Usage for multi-parametric Deep Learning training and inference

Authors:

Dimitrios Stratogiannis,

Andreas Papadakis,

Ioannis Voyiatzis,

Maria SamarakouAuthors Info & Claims

PCI '23: Proceedings of the 27th Pan-Hellenic Conference on Progress in Computing and Informatics

Pages 67 - 73

https://doi.org/10.1145/3635059.3635070

Published: 14 February 2024 Publication History

All formats PDF

Abstract

Deep learning is increasingly used in diverse application fields with results typically surpassing those of traditional machine learning techniques. The portfolio of available neural networks is wide, consisting of the full range in terms of complexity, from compact networks to large ones with multiple layers and parameters. This heterogeneity in the model topology is reflected, not necessarily linearly, on the required computational resources for training and inference. Similarly, the environments where the neural networks are trained and executed are transformed from fully-fledged centralized nodes to distributed architectures with constrained resources. In this view, computational resource requirements can be one of the criteria for resource usage management and neural network selection. In this work we measure the training times for a set of five convolutional neural networks of varying complexity and age (GoogleNet, ShuffleNet, VGGish, YAMNet) under different training configurations, considering the batch size, the number of epochs and the learning rate. These measurements are used to create a CPU-time-training dataset of more than 500 values. This dataset is used to train and evaluate models, based on neural networks, for estimating and predicting training times depending on the models employed and the training parameters. Five regression models have been trained and evaluated in terms of correlation coefficient and root mean square error. In addition, we measure the CPU times needed for inference for a subset of the trained models, which prove to be uncorrelated with the corresponding training times.

References

[1]

Davy Preuveneers, Ilias Tsingenopoulos and Wouter Joosen. 2020. Resource Usage and Performance Trade-offs for Machine Learning Models in Smart Environments. Sensors, 20(4), 1176. https://doi.org/10.3390/s20041176

[2]

Jingoo Han, Luna Xu, Mustafa M. Rafique, Ali R. Butt and Seung-Hwan Lim. 2019. A Quantitative Study of Deep Learning Training on Heterogeneous Supercomputers. 2019 IEEE International Conference on Cluster Computing (CLUSTER), 1-12. https://doi.org/10.1109/CLUSTER.2019.8890993

[3]

Md. Maruf H. Shuvo, Syed. K. Islam, Jianlin Cheng and Bashir I. Morshed. 2022. Efficient Acceleration of Deep Learning Inference on Resource-Constrained Edge Devices: A Review. In Proceedings of the IEEE, 111(1), 42-91. https://doi.org/10.1109/JPROC.2022.3226481

[4]

Simone Bianco, Remi Cadene, Luigi Celona and Paolo Napoletano. 2018. Benchmark Analysis of Representative Deep Neural Network Architectures. IEEE Access, 6, 64270-64277. https://doi.org/10.1109/ACCESS.2018.2877890

[5]

Mimoun Lamrini, Mohamed Y. Chkouri and Abdellah Touhafi. 2023. Evaluating the Performance of Pre-Trained Convolutional Neural Network for Audio Classification on Embedded Systems for Anomaly Detection in Smart Cities. Sensors, 23(13), 6227. https://doi.org/10.3390/s23136227

[6]

Shuja-Ur-Rehman Baig, Waheed Iqbal, Josep L. Berral, Abdelkarim Erradi and David Carrera. 2019. Adaptive Prediction Models for Data Center Resources Utilization Estimation. In IEEE Transactions on Network and Service Management, 16(4), 1681-1693. https://doi.org/10.1109/TNSM.2019.2932840

[7]

Mahfoudh S. Al-Asaly, Mohamed A. Bencherif, Ahmed Alsanad and Mohammad M. Hassan. 2022. A deep learning-based resource usage prediction model for resource provisioning in an automatic cloud computing environment. Neural Computing and Applications, 34(13), 10211-10228. https://doi.org/10.1007/s00521-021-06665-5

Digital Library

[8]

Gyeongsik Yang, Changyong Shin, Jeungwan Lee, Yeonho Yoo and Chuck Yoo. 2022. Prediction of the Resource Consumption of Distributed Deep Learning Systems. Proceeding of the ACM on Measurement and Analysis of Computing Systems, 6(2), 1-25. https://doi.org/10.1145/3530895

Digital Library

[9]

Vicent S. Marco, Ben Taylor, Zheng Wang, Yehia Elkhatib. 2019. Optimizing Deep Learning Inference on Embedded Systems Through Adaptive Model Selection. ACM Transactions on Embedded Computing Systems, 19(1), 1-18. https://doi.org/10.1145/3371154

Digital Library

[10]

Eleni Tsalera, Andreas Papadakis, Maria Samarakou. 2021. Comparison of Pre-Trained CNNs for Audio Classification Using Transfer Learning. Journal of Sensor and Actuator Networks, 10(4), 72. https://doi.org/10.3390/jsan10040072

[11]

Eleni Tsalera, Andreas Papadakis, Maria Samarakou. 2021. Novel principal component analysis-based feature selection mechanism for classroom sound classification. Compuatational Intelligence, 37(4), 1827-1843. https://doi.org/10.1111/coin.12468

[12]

Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Nincent Vanhoucke and Andrew Rabinovich. 2015. Going deeper with convolutions. In Proceeding of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 1-9. https://doi.org/10.1109/CVPR.2015.7298594

[13]

Forrest N. Iandola, Song Han, Matthew W. Moskewicz, Khalid Ashraf, William J. Dally and Kurt Keutzer. 2016. SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size. arXiv:1602.07360

[14]

Xiangyu Zhang, Xinyu Zhou, Mengxiao Lin, Jian Sun. 2017. ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 6848-6856. https://doi.org/10.48550/arXiv.1707.01083

[15]

Eleni Tsalera, Andreas Papadakis, Ioannis Voyiatzis, Maria Samarakou. 2023. CNN-based, contextualized, real-time fire detection in computational resource-constrained environments. Energy Reports 9(9), 247-257. https://doi.org/10.1016/j.egyr.2023.05.260

Recommendations

New Results for Prediction of Chaotic Systems Using Deep Recurrent Neural Networks
Abstract
Prediction of nonlinear and dynamic systems is a challenging task, however with the aid of machine learning techniques, particularly neural networks, is now possible to accomplish this objective. Most common neural networks used are the multilayer ...
Inductive Semi-supervised Multi-Label Learning with Co-Training
KDD '17: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

In multi-label learning, each training example is associated with multiple class labels and the task is to learn a mapping from the feature space to the power set of label space. It is generally demanding and time-consuming to obtain labels for training ...
Transductive Multilabel Learning via Label Set Propagation

The problem of multilabel classification has attracted great interest in the last decade, where each instance can be assigned with a set of multiple class labels simultaneously. It has a wide variety of real-world applications, e.g., automatic image ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

PCI '23: Proceedings of the 27th Pan-Hellenic Conference on Progress in Computing and Informatics

November 2023

304 pages

ISBN:9798400716263

DOI:10.1145/3635059

Copyright © 2023 Owner/Author.

This work is licensed under a Creative Commons Attribution International 4.0 License.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 February 2024

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

PCI 2023

PCI 2023: 27th Pan-Hellenic Conference on Progress in Computing and Informatics

November 24 - 26, 2023

Lamia, Greece

Acceptance Rates

Overall Acceptance Rate 190 of 390 submissions, 49%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
143
Total Downloads

Downloads (Last 12 months)143
Downloads (Last 6 weeks)19

Reflects downloads up to 26 Jan 2025

Other Metrics

View Author Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Figures

Tables

Media

View Table of Conten