Yang et al., 2023 - Google Patents
Pico: Pipeline inference framework for versatile cnns on diverse mobile devicesYang et al., 2023
View PDF- Document ID
- 16060047660103038520
- Author
- Yang X
- Xu Z
- Qi Q
- Wang J
- Sun H
- Liao J
- Guo S
- Publication year
- Publication venue
- IEEE Transactions on Mobile Computing
External Links
Snippet
Distributing the inference of convolutional neural network (CNN) to multiple mobile devices has been studied in recent years to achieve real-time inference without losing accuracy. However, how to map CNN to devices remains a challenge. On the one hand, scheduling …
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/16—Combinations of two or more digital computers each having at least an arithmetic unit, a programme unit and a register, e.g. for a simultaneous processing of several programmes
- G06F15/163—Interprocessor communication
- G06F15/173—Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star, snowflake
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored programme computers
- G06F15/80—Architectures of general purpose stored programme computers comprising an array of processing units with common control, e.g. single instruction multiple data processors
- G06F15/8007—Architectures of general purpose stored programme computers comprising an array of processing units with common control, e.g. single instruction multiple data processors single instruction multiple data [SIMD] multiprocessors
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/04—Forecasting or optimisation, e.g. linear programming, "travelling salesman problem" or "cutting stock problem"
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Wahab et al. | Federated machine learning: Survey, multi-level classification, desirable criteria and future directions in communication and networking systems | |
Wang et al. | Convergence of edge computing and deep learning: A comprehensive survey | |
Rashidi et al. | Astra-sim: Enabling sw/hw co-design exploration for distributed dl training platforms | |
Mao et al. | Mednn: A distributed mobile system with enhanced partition and deployment for large-scale dnns | |
Tan et al. | A distributed cooperative coevolutionary algorithm for multiobjective optimization | |
Yang et al. | Pico: Pipeline inference framework for versatile cnns on diverse mobile devices | |
Singh et al. | Deep-learning-based SDN model for Internet of Things: An incremental tensor train approach | |
Han et al. | Signal processing and networking for big data applications | |
Wang et al. | End-edge-cloud collaborative computing for deep learning: A comprehensive survey | |
Sun et al. | Ensemble-compression: A new method for parallel training of deep neural networks | |
Yang et al. | Towards efficient inference: Adaptively cooperate in heterogeneous iot edge cluster | |
Shi et al. | Multiuser co-inference with batch processing capable edge server | |
Yadav et al. | An opposition-based hybrid evolutionary approach for task scheduling in fog computing network | |
Luo et al. | Efficient pipeline planning for expedited distributed dnn training | |
Zhang et al. | Dynamic DNN decomposition for lossless synergistic inference | |
Tang et al. | Low-memory and high-performance CNN inference on distributed systems at the edge | |
Song et al. | Adaptive and collaborative edge inference in task stream with latency constraint | |
Wang et al. | An asynchronous distributed-memory optimization solver for two-stage stochastic programming problems | |
Zhang et al. | Effective 3C Resource Utilization and Fair Allocation Strategy for Multi-Task Federated Learning | |
Janbi et al. | Distributed artificial intelligence: review, taxonomy, framework, and reference architecture | |
Cheraghchi et al. | Distributed multi-objective cooperative coevolution algorithm for big-data-enabled vessel schedule recovery problem | |
Fang et al. | Joint architecture design and workload partitioning for dnn inference on industrial iot clusters | |
CN116400963A (en) | Model automatic parallel method, device and storage medium based on load balancing | |
Guo et al. | Tree learning: Towards promoting coordination in scalable multi-client training acceleration | |
Ahn et al. | Scissionlite: Accelerating distributed deep neural networks using transfer layer |