Kulkarni et al., 2022 - Google Patents

Hybrid optimization for DNN model compression and inference acceleration

Kulkarni et al., 2022

Document ID: 15954460003257234614
Author: Kulkarni N; Singh N; Joshi Y; Hasabi N; Meena S; Kulkarni U; Gurlahosur S
Publication year: 2022
Publication venue: 2022 2nd International Conference on Intelligent Technologies (CONIT)

External Links

Cited by

Snippet

Deep Neural Networks are known for their applications in the domains like computer vision, natural language processing, speech recognition, pattern recognition etc. Though these models are incredibly powerful they consume a considerable amount of memory bandwidth …

Continue reading at ieeexplore.ieee.org (other versions)

238000005457 optimization 0 title abstract description 37

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G06K9/6232—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods
- G06K9/6247—Extracting features by transforming the feature space, e.g. multidimensional scaling; Mappings, e.g. subspace methods based on an approximation criterion, e.g. principal component analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G06K9/6256—Obtaining sets of training patterns; Bootstrap methods, e.g. bagging, boosting
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/16—Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/11—Complex mathematical operations for solving equations, e.g. nonlinear equations, general mathematical optimization problems
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis

Similar Documents

Publication	Publication Date	Title
Kulkarni et al.	2021	Quantization friendly mobilenet (qf-mobilenet) architecture for vision based applications on embedded platforms
US11568258B2 (en)	2023-01-31	Operation method
Singh et al.	2020	Leveraging filter correlations for deep model compression
US20180260710A1 (en)	2018-09-13	Calculating device and method for a sparsely connected artificial neural network
US11055063B2 (en)	2021-07-06	Systems and methods for deep learning processor
Dong et al.	2021	Hao: Hardware-aware neural architecture optimization for efficient inference
Daghero et al.	2021	Energy-efficient deep learning inference on edge devices
EP3816873A1 (en)	2021-05-05	Neural network circuit device, neural network processing method, and neural network execution program
Singh et al.	2019	Shunt connection: An intelligent skipping of contiguous blocks for optimizing MobileNet-V2
Wang et al.	2019	Blur image identification with ensemble convolution neural networks
CA2957695A1 (en)	2018-01-15	System and method for building artificial neural network architectures
CN112446888B (en)	2024-09-13	Image segmentation model processing method and processing device
Zhu et al.	2020	Nasb: Neural architecture search for binary convolutional neural networks
Kulkarni et al.	2022	Hybrid optimization for DNN model compression and inference acceleration
Huai et al.	2021	Zerobn: Learning compact neural networks for latency-critical edge systems
Alhamali et al.	2015	FPGA-accelerated hadoop cluster for deep learning computations
Singh et al.	2020	Hetconv: Beyond homogeneous convolution kernels for deep cnns
Cai et al.	2022	Efficient methods for deep learning
Liu et al.	2021	Rectified binary convolutional networks with generative adversarial learning
Peng et al.	2023	Mbfquant: a multiplier-bitwidth-fixed, mixed-precision quantization method for mobile cnn-based applications
Li et al.	2023	An accelerating convolutional neural networks via a 2D entropy based-adaptive filter search method for image recognition
CN111882028B (en)	2022-04-19	Convolution operation device for convolution neural network
US11481604B2 (en)	2022-10-25	Apparatus and method for neural network processing
Chen et al.	2019	Muffnet: Multi-layer feature federation for mobile deep learning
CN111582444A (en)	2020-08-25	Matrix data processing device, electronic equipment and storage medium