Network pruning using linear dependency analysis on feature maps
H Pan, Z Chao, J Qian, B Zhuang… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
H Pan, Z Chao, J Qian, B Zhuang, S Wang, J Xiao
ICASSP 2021-2021 IEEE International Conference on Acoustics …, 2021•ieeexplore.ieee.orgNetwork pruning can be achieved by removing redundant channels. In this paper, we regard
a channel 'redundant'if its output is linearly dependent with respect to those of other
channels. Inspired by this, we propose an efficient pruning method, named as LDFM, by
linear dependency analysis on all the feature maps of each individual layer. Specifically, for
each layer, by applying the QR decomposition with column pivoting (PQR) on the matrix
consisting of all feature maps, those channels corresponding to small absolute diagonal …
a channel 'redundant'if its output is linearly dependent with respect to those of other
channels. Inspired by this, we propose an efficient pruning method, named as LDFM, by
linear dependency analysis on all the feature maps of each individual layer. Specifically, for
each layer, by applying the QR decomposition with column pivoting (PQR) on the matrix
consisting of all feature maps, those channels corresponding to small absolute diagonal …
Network pruning can be achieved by removing redundant channels. In this paper, we regard a channel ‘redundant’ if its output is linearly dependent with respect to those of other channels. Inspired by this, we propose an efficient pruning method, named as LDFM, by linear dependency analysis on all the feature maps of each individual layer. Specifically, for each layer, by applying the QR decomposition with column pivoting (PQR) on the matrix consisting of all feature maps, those channels corresponding to small absolute diagonal elements of the R matrix from the PQR decomposition are identified as redundant, and are pruned naturally. Although pruning these channels causes loss of information and hence degrades accuracy, the accuracy of the pruned network can be easily recovered by fine-tuning, as the lost information in the pruned channels can be recovered from that in the retained channels. Extensive experiments demonstrate that LDFM makes great improvement on accuracy with similar parameters and FLOPs as other methods, and achieves the state-of-the-art results on several different benchmarks and networks.
ieeexplore.ieee.org