Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3641512.3690629acmconferencesArticle/Chapter ViewAbstractPublication PagesmobihocConference Proceedingsconference-collections
short-paper
Open access

Demo: An Experimental Platform for AI Model Partitioning on Resource-constrained Devices

Published: 01 October 2024 Publication History

Abstract

Partitioning Artificial Intelligence (AI) models such as Deep Neural Networks (DNNs) or Transformer-based Architectures is essential for minimizing latency in resource-constrained edge computing environments, which is critical for applications such as real-time video analytics, autonomous vehicles, smart IoT systems, and most importantly, on-device Large Language Models (LLMs)1. This paper presents a demonstration of an experimental platform showcased for DNN partitioning for the inference stage, which comprises a WiFi network of Raspberry Pi 4 devices. The platform supports research on optimizing inference in distributed setups, focusing on performance and resource usage and it can support several architectures, ranging from simple and deep NNs, to more advanced transformer-based architectures.

References

[1]
T. Feltin, L. Marchó, J. Cordero-Fuertes, F. Brockners, and T. Clausen. 2023. DNN Partitioning for Inference Throughput Acceleration at the Edge. IEEE Access 11 (2023), 52236--52249.
[2]
D. Kafetzis and I. Koutsopoulos. 2024. DNN Partitioning and Inference Task Offloading in 6G Resource-Constrained Networks. In 2024 EuCNC/6G Summit.
[3]
Z. Wang, L. Boccardo, and Y. Deng. 2024. An Edge-Enabled Wireless Split Learning Testbed: Design and Implementation. IEEE Communications Letters 28, 6 (2024), 1337--1341.

Index Terms

  1. Demo: An Experimental Platform for AI Model Partitioning on Resource-constrained Devices

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MobiHoc '24: Proceedings of the Twenty-fifth International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing
    October 2024
    511 pages
    ISBN:9798400705212
    DOI:10.1145/3641512
    This work is licensed under a Creative Commons Attribution International 4.0 License.

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 October 2024

    Check for updates

    Author Tags

    1. NN architecture partitioning
    2. inference
    3. edge computing
    4. resource-constrained devices
    5. experimental platform

    Qualifiers

    • Short-paper

    Conference

    MobiHoc '24
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 296 of 1,843 submissions, 16%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 39
      Total Downloads
    • Downloads (Last 12 months)39
    • Downloads (Last 6 weeks)30
    Reflects downloads up to 21 Nov 2024

    Other Metrics

    Citations

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media