Nothing Special   »   [go: up one dir, main page]

Yang et al., 2023 - Google Patents

Pico: Pipeline inference framework for versatile cnns on diverse mobile devices

Yang et al., 2023

View PDF
Document ID
16060047660103038520
Author
Yang X
Xu Z
Qi Q
Wang J
Sun H
Liao J
Guo S
Publication year
Publication venue
IEEE Transactions on Mobile Computing

External Links

Snippet

Distributing the inference of convolutional neural network (CNN) to multiple mobile devices has been studied in recent years to achieve real-time inference without losing accuracy. However, how to map CNN to devices remains a challenge. On the one hand, scheduling …
Continue reading at arxiv.org (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for programme control, e.g. control unit
    • G06F9/06Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5061Partitioning or combining of resources
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/16Combinations of two or more digital computers each having at least an arithmetic unit, a programme unit and a register, e.g. for a simultaneous processing of several programmes
    • G06F15/163Interprocessor communication
    • G06F15/173Interprocessor communication using an interconnection network, e.g. matrix, shuffle, pyramid, star, snowflake
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/12Computer systems based on biological models using genetic models
    • G06N3/126Genetic algorithms, i.e. information processing using digital simulations of the genetic system
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/76Architectures of general purpose stored programme computers
    • G06F15/80Architectures of general purpose stored programme computers comprising an array of processing units with common control, e.g. single instruction multiple data processors
    • G06F15/8007Architectures of general purpose stored programme computers comprising an array of processing units with common control, e.g. single instruction multiple data processors single instruction multiple data [SIMD] multiprocessors
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • G06N99/005Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/50Computer-aided design
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computer systems based on biological models
    • G06N3/02Computer systems based on biological models using neural network models
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06QDATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation, e.g. linear programming, "travelling salesman problem" or "cutting stock problem"

Similar Documents

Publication Publication Date Title
Wahab et al. Federated machine learning: Survey, multi-level classification, desirable criteria and future directions in communication and networking systems
Wang et al. Convergence of edge computing and deep learning: A comprehensive survey
Rashidi et al. Astra-sim: Enabling sw/hw co-design exploration for distributed dl training platforms
Mao et al. Mednn: A distributed mobile system with enhanced partition and deployment for large-scale dnns
Tan et al. A distributed cooperative coevolutionary algorithm for multiobjective optimization
Yang et al. Pico: Pipeline inference framework for versatile cnns on diverse mobile devices
Singh et al. Deep-learning-based SDN model for Internet of Things: An incremental tensor train approach
Han et al. Signal processing and networking for big data applications
Wang et al. End-edge-cloud collaborative computing for deep learning: A comprehensive survey
Sun et al. Ensemble-compression: A new method for parallel training of deep neural networks
Yang et al. Towards efficient inference: Adaptively cooperate in heterogeneous iot edge cluster
Shi et al. Multiuser co-inference with batch processing capable edge server
Yadav et al. An opposition-based hybrid evolutionary approach for task scheduling in fog computing network
Luo et al. Efficient pipeline planning for expedited distributed dnn training
Zhang et al. Dynamic DNN decomposition for lossless synergistic inference
Tang et al. Low-memory and high-performance CNN inference on distributed systems at the edge
Song et al. Adaptive and collaborative edge inference in task stream with latency constraint
Wang et al. An asynchronous distributed-memory optimization solver for two-stage stochastic programming problems
Zhang et al. Effective 3C Resource Utilization and Fair Allocation Strategy for Multi-Task Federated Learning
Janbi et al. Distributed artificial intelligence: review, taxonomy, framework, and reference architecture
Cheraghchi et al. Distributed multi-objective cooperative coevolution algorithm for big-data-enabled vessel schedule recovery problem
Fang et al. Joint architecture design and workload partitioning for dnn inference on industrial iot clusters
CN116400963A (en) Model automatic parallel method, device and storage medium based on load balancing
Guo et al. Tree learning: Towards promoting coordination in scalable multi-client training acceleration
Ahn et al. Scissionlite: Accelerating distributed deep neural networks using transfer layer