Yang et al., 2023 - Google Patents

Pico: Pipeline inference framework for versatile cnns on diverse mobile devices

Yang et al., 2023

Document ID: 16060047660103038520
Author: Yang X; Xu Z; Qi Q; Wang J; Sun H; Liao J; Guo S
Publication year: 2023
Publication venue: IEEE Transactions on Mobile Computing

External Links

Cited by

Snippet

Distributing the inference of convolutional neural network (CNN) to multiple mobile devices has been studied in recent years to achieve real-time inference without losing accuracy. However, how to map CNN to devices remains a challenge. On the one hand, scheduling …

Continue reading at arxiv.org (PDF) (other versions)

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/16—Combinations of two or more digital computers each having at least an arithmetic unit, a programme unit and a register, e.g. for a simultaneous processing of several programmes
- G06F15/163—Interprocessor communication
- G06F15/173—Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star, snowflake
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored programme computers
- G06F15/80—Architectures of general purpose stored programme computers comprising an array of processing units with common control, e.g. single instruction multiple data processors
- G06F15/8007—Architectures of general purpose stored programme computers comprising an array of processing units with common control, e.g. single instruction multiple data processors single instruction multiple data [SIMD] multiprocessors
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/04—Forecasting or optimisation, e.g. linear programming, "travelling salesman problem" or "cutting stock problem"

Similar Documents

Publication	Publication Date	Title
Wahab et al.	2021	Federated machine learning: Survey, multi-level classification, desirable criteria and future directions in communication and networking systems
Wang et al.	2020	Convergence of edge computing and deep learning: A comprehensive survey
Rashidi et al.	2020	Astra-sim: Enabling sw/hw co-design exploration for distributed dl training platforms
Mao et al.	2017	Mednn: A distributed mobile system with enhanced partition and deployment for large-scale dnns
Tan et al.	2006	A distributed cooperative coevolutionary algorithm for multiobjective optimization
Yang et al.	2023	Pico: Pipeline inference framework for versatile cnns on diverse mobile devices
Singh et al.	2019	Deep-learning-based SDN model for Internet of Things: An incremental tensor train approach
Han et al.	2017	Signal processing and networking for big data applications
Wang et al.	2024	End-edge-cloud collaborative computing for deep learning: A comprehensive survey
Sun et al.	2017	Ensemble-compression: A new method for parallel training of deep neural networks
Yang et al.	2021	Towards efficient inference: Adaptively cooperate in heterogeneous iot edge cluster
Shi et al.	2022	Multiuser co-inference with batch processing capable edge server
Yadav et al.	2023	An opposition-based hybrid evolutionary approach for task scheduling in fog computing network
Luo et al.	2022	Efficient pipeline planning for expedited distributed dnn training
Zhang et al.	2021	Dynamic DNN decomposition for lossless synergistic inference
Tang et al.	2021	Low-memory and high-performance CNN inference on distributed systems at the edge
Song et al.	2021	Adaptive and collaborative edge inference in task stream with latency constraint
Wang et al.	2021	An asynchronous distributed-memory optimization solver for two-stage stochastic programming problems
Zhang et al.	2023	Effective 3C Resource Utilization and Fair Allocation Strategy for Multi-Task Federated Learning
Janbi et al.	2023	Distributed artificial intelligence: review, taxonomy, framework, and reference architecture
Cheraghchi et al.	2020	Distributed multi-objective cooperative coevolution algorithm for big-data-enabled vessel schedule recovery problem
Fang et al.	2023	Joint architecture design and workload partitioning for dnn inference on industrial iot clusters
CN116400963A (en)	2023-07-07	Model automatic parallel method, device and storage medium based on load balancing
Guo et al.	2023	Tree learning: Towards promoting coordination in scalable multi-client training acceleration
Ahn et al.	2021	Scissionlite: Accelerating distributed deep neural networks using transfer layer