Manipulating identical filter redundancy for efficient pruning on deep and complicated CNN

Abstract

The existence of redundancy in convolutional neural networks (CNNs) enables us to remove some filters/channels with acceptable performance drops. However, the training objective of CNNs usually tends to minimize an accuracy-related loss function without any attention paid to the redundancy, making the redundancy distribute randomly on all the filters, such that removing any of them may trigger information loss and accuracy drop, necessitating a fine-tuning step for recovery. In this article, we propose to manipulate the redundancy during training to facilitate network pruning. To this end, we propose a novel centripetal SGD (C-SGD) to make some filters identical, resulting in ideal redundancy patterns , as such filters become purely redundant due to their duplicates, hence removing them does not harm the network. As shown on CIFAR and ImageNet, C-SGD delivers better performance because the redundancy is better organized, compared to the existing methods. The efficiency also characterizes C-SGD because it is as fast as regular SGD, requires no fine-tuning, and can be conducted simultaneously on all the layers even in very deep CNNs. Besides, C-SGD can improve the accuracy of CNNs by first training a model with the same architecture but wider layers and then squeezing it into the original width.

Metadata

Item Type:	Article
Authors/Creators:	Hao, T. Ding, X. Han, J. Guo, Y. Ding, G.
Copyright, Publisher and Additional Information:	© 2023 The Authors. Except as otherwise noted, this author-accepted version of a journal article published in IEEE Transactions on Neural Networks and Learning Systems is made available via the University of Sheffield Research Publications and Copyright Policy under the terms of the Creative Commons Attribution 4.0 International License (CC-BY 4.0), which permits unrestricted use, distribution and reproduction in any medium, provided the original work is properly cited. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/
Keywords:	Deep Learning; Convolutional Neural Network; Model Compression; Filter Pruning; Channel Pruning
Dates:	Accepted: 11 July 2023 Published (online): 12 October 2023 Published: November 2024
Institution:	The University of Sheffield
Academic Units:	The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield)
Depositing User:	Symplectic Sheffield
Date Deposited:	20 Jul 2023 11:29
Last Modified:	01 Nov 2024 10:48
Status:	Published
Publisher:	Institute of Electrical and Electronics Engineers
Refereed:	Yes
Identification Number:	10.1109/TNNLS.2023.3298263
Related URLs:	arXiv URL
Open Archives Initiative ID (OAI ID):	oai:eprints.whiterose.ac.uk:201659

Download

Accepted Version

Filename: CSGD_tnnls___final_.pdf

Licence: CC-BY 4.0

CLICK TO DOWNLOAD

CORE (COnnecting REpositories)

Manipulating identical filter redundancy for efficient pruning on deep and complicated CNN

Abstract

Metadata

Download

Accepted Version

Export

Statistics