Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3229556.3229562acmconferencesArticle/Chapter ViewAbstractPublication PagescommConference Proceedingsconference-collections
research-article
Free access

Edge Intelligence: On-Demand Deep Learning Model Co-Inference with Device-Edge Synergy

Published: 07 August 2018 Publication History

Abstract

As the backbone technology of machine learning, deep neural networks (DNNs) have have quickly ascended to the spotlight. Running DNNs on resource-constrained mobile devices is, however, by no means trivial, since it incurs high performance and energy overhead. While offloading DNNs to the cloud for execution suffers unpredictable performance, due to the uncontrolled long wide-area network latency. To address these challenges, in this paper, we propose Edgent, a collaborative and on-demand DNN co-inference framework with device-edge synergy. Edgent pursues two design knobs: (1) DNN partitioning that adaptively partitions DNN computation between device and edge, in order to leverage hybrid computation resources in proximity for real-time DNN inference. (2) DNN right-sizing that accelerates DNN inference through early-exit at a proper intermediate DNN layer to further reduce the computation latency. The prototype implementation and extensive evaluations based on Raspberry Pi demonstrate Edgent's effectiveness in enabling on-demand low-latency edge intelligence.

References

[1]
K. Alex, I. Sutskever, and G. Hinton. 2012. ImageNet Classification with Deep Convolutional Neural Networks. In NIPS.
[2]
China Telecom et al. 2017. 5G Mobile/Multi-Access Edge Computing. (2017).
[3]
Forrest N. Iandola et al. 2016. SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size. arXiv: 1602.07360 (2016).
[4]
J. Wu et al. 2016. Quantized Convolutional Neural Networks for Mobile Devices. (2016).
[5]
S. Han, H. Mao, and W. Dally. 2015. Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding. Fiber (2015).
[6]
Y. Kang, J. Hauswald, C. Gao, A. Rovinski, T. Mudge, J. Mars, and L. Tang. 2017. Neurosurgeon: Collaborative Intelligence Between the Cloud and Mobile Edge. In 2017 ASPLOS.
[7]
Y. Kim, E. Park, S. Yoo, T. Choi, L. Yang, and D. Shin. 2016. Compression of Deep Convolutional Neural Networks for Fast and Low Power Mobile Applications. In ICLR.
[8]
Alex Krizhevsky. 2009. Learning Multiple Layers of Features from Tiny Images. (2009).
[9]
N. Lane, S. Bhattacharya, A. Mathur, C. Forlivesi, and F. Kawsar. 2016. DXTK: Enabling Resource-efficient Deep Learning on Mobile and Embedded Devices with the DeepX Toolkit.
[10]
V. Mulhollon. 2004. WonderShaper. http://manpages.ubuntu.com/manpages/trusty/man8/wondershaper-8.html. (2004).
[11]
Prefered Networks. 2017. Chainer. https://github.com/chainer/chainer/tree/v1. (2017).
[12]
Aaron Van Den et al. Oord. 2016. WaveNet: A Generative Model for Raw Audio. arXiv: 1609.03499 (2016).
[13]
Qualcomm. 2017. Augmented and Virtual Reality: the First Wave of 5G Killer Apps. (2017).
[14]
Christian et al. Szegedy. 2014. Going deeper with convolutions. (2014).
[15]
S. Teerapittayanon, B. McDanel, and H. T. Kung. 2016. BranchyNet: Fast inference via early exiting from deep neural networks. In 2016 23rd ICPR.
[16]
Di Wang and Eric Nyberg. 2015. A Long Short-Term Memory Model for Answer Sentence Selection in Question Answering. In ACL and IJCNLP.

Cited By

View all
  • (2024)Exploring Computing Paradigms for Electric Vehicles: From Cloud to Edge Intelligence, Challenges and Future DirectionsWorld Electric Vehicle Journal10.3390/wevj1502003915:2(39)Online publication date: 26-Jan-2024
  • (2024)A Survey of Seafloor Characterization and Mapping TechniquesRemote Sensing10.3390/rs1607116316:7(1163)Online publication date: 27-Mar-2024
  • (2024)Opportunities for the Development of Military Cognitive Skills II (Practical Approach)Scientific Bulletin10.2478/bsaft-2024-000929:1(80-90)Online publication date: 7-Jun-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
MECOMM'18: Proceedings of the 2018 Workshop on Mobile Edge Communications
August 2018
56 pages
ISBN:9781450359061
DOI:10.1145/3229556
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 August 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Computation Offloading
  2. Deep Learning
  3. Edge Computing
  4. Edge Intelligence

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

  • National Natural Science Foundation of China
  • Program for Guangdong Introducing Innovative and Enterpreneurial Teams
  • National Key Research and Development Program of China
  • Fundamental Research Funds for the Central Universities

Conference

SIGCOMM '18
Sponsor:
SIGCOMM '18: ACM SIGCOMM 2018 Conference
August 20, 2018
Budapest, Hungary

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)977
  • Downloads (Last 6 weeks)127
Reflects downloads up to 16 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Exploring Computing Paradigms for Electric Vehicles: From Cloud to Edge Intelligence, Challenges and Future DirectionsWorld Electric Vehicle Journal10.3390/wevj1502003915:2(39)Online publication date: 26-Jan-2024
  • (2024)A Survey of Seafloor Characterization and Mapping TechniquesRemote Sensing10.3390/rs1607116316:7(1163)Online publication date: 27-Mar-2024
  • (2024)Opportunities for the Development of Military Cognitive Skills II (Practical Approach)Scientific Bulletin10.2478/bsaft-2024-000929:1(80-90)Online publication date: 7-Jun-2024
  • (2024)Opportunities for the Development of Military Cognitive Skills I (Theoretical Approach)Scientific Bulletin10.2478/bsaft-2024-000829:1(68-79)Online publication date: 7-Jun-2024
  • (2024)Decoupled Access-Execute Enabled DVFS for TinyML Deployments on STM32 Microcontrollers2024 Design, Automation & Test in Europe Conference & Exhibition (DATE)10.23919/DATE58400.2024.10546540(1-6)Online publication date: 25-Mar-2024
  • (2024)DroneBandit: Multi-armed contextual bandits for collaborative edge-to-cloud inference in resource-constrained nanodronesProceedings of the Great Lakes Symposium on VLSI 202410.1145/3649476.3658720(98-104)Online publication date: 12-Jun-2024
  • (2024)PArtNNer: Platform-Agnostic Adaptive Edge-Cloud DNN Partitioning for Minimizing End-to-End LatencyACM Transactions on Embedded Computing Systems10.1145/363026623:1(1-38)Online publication date: 10-Jan-2024
  • (2024)An Empirical Analysis and Resource Footprint Study of Deploying Large Language Models on Edge DevicesProceedings of the 2024 ACM Southeast Conference10.1145/3603287.3651205(69-76)Online publication date: 18-Apr-2024
  • (2024)Collaborative DNNs Inference with Joint Model Partition and Compression in Mobile Edge-Cloud Computing Networks2024 IEEE Wireless Communications and Networking Conference (WCNC)10.1109/WCNC57260.2024.10571207(01-06)Online publication date: 21-Apr-2024
  • (2024)Native Support of AI Applications in 6G Mobile Networks Via an Intelligent User Plane2024 IEEE Wireless Communications and Networking Conference (WCNC)10.1109/WCNC57260.2024.10570691(1-6)Online publication date: 21-Apr-2024
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media