Local Normalization Based BN Layer Pruning

Yuan Liu; Xi Jia; Linlin Shen; Zhong Ming; Jinming Duan

doi:10.1007/978-3-030-30484-3_28

Local Normalization Based BN Layer Pruning

Yuan Liu, Xi Jia, Linlin Shen, Zhong Ming, Jinming Duan

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

5 Citations (Scopus)

Abstract

Compression and acceleration of convolutional neural network (CNN) have raised extensive research interest in the past few years. In this paper, we proposed a novel channel-level pruning method based on gamma (scaling parameters) of Batch Normalization layer to compress and accelerate CNN models. Local gamma normalization and selection was proposed to address the over-pruning issue and introduce local information into channel selection. After that, an ablation based beta (shifting parameters) transfer, and knowledge distillation based fine-tuning were further applied to improve the performance of the pruned model. The experimental results on CIFAR-10, CIFAR-100 and LFW datasets suggest that our approach can achieve much more efficient pruning in terms of reduction of parameters and FLOPs, e.g., 8.64 × compression and 3.79 × acceleration of VGG were achieved on CIFAR, with slight accuracy loss.

Original language	English
Title of host publication	Artificial Neural Networks and Machine Learning – ICANN 2019
Subtitle of host publication	Deep Learning - 28th International Conference on Artificial Neural Networks, Proceedings
Editors	Igor V. Tetko, Pavel Karpov, Fabian Theis, Vera Kurková
Publisher	Springer Verlag
Pages	334-346
Number of pages	13
ISBN (Print)	9783030304836
DOIs	https://doi.org/10.1007/978-3-030-30484-3_28
Publication status	Published - 2019
Externally published	Yes
Event	28th International Conference on Artificial Neural Networks, ICANN 2019 - Munich, Germany Duration: 17 Sept 2019 → 19 Sept 2019

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	11728 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	28th International Conference on Artificial Neural Networks, ICANN 2019
Country/Territory	Germany
City	Munich
Period	17/09/19 → 19/09/19

Keywords

Convolutional neural network (CNN)
Knowledge distillation
Model compression and acceleration
Pruning

ASJC Scopus subject areas

Theoretical Computer Science
General Computer Science

Access to Document

10.1007/978-3-030-30484-3_28

Cite this

Liu, Y., Jia, X., Shen, L., Ming, Z., & Duan, J. (2019). Local Normalization Based BN Layer Pruning. In I. V. Tetko, P. Karpov, F. Theis, & V. Kurková (Eds.), Artificial Neural Networks and Machine Learning – ICANN 2019: Deep Learning - 28th International Conference on Artificial Neural Networks, Proceedings (pp. 334-346). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11728 LNCS). Springer Verlag. https://doi.org/10.1007/978-3-030-30484-3_28

Liu, Yuan ; Jia, Xi ; Shen, Linlin et al. / Local Normalization Based BN Layer Pruning. Artificial Neural Networks and Machine Learning – ICANN 2019: Deep Learning - 28th International Conference on Artificial Neural Networks, Proceedings. editor / Igor V. Tetko ; Pavel Karpov ; Fabian Theis ; Vera Kurková. Springer Verlag, 2019. pp. 334-346 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{57b92fa711ef4ef69a4221ea43745cde,

title = "Local Normalization Based BN Layer Pruning",

abstract = "Compression and acceleration of convolutional neural network (CNN) have raised extensive research interest in the past few years. In this paper, we proposed a novel channel-level pruning method based on gamma (scaling parameters) of Batch Normalization layer to compress and accelerate CNN models. Local gamma normalization and selection was proposed to address the over-pruning issue and introduce local information into channel selection. After that, an ablation based beta (shifting parameters) transfer, and knowledge distillation based fine-tuning were further applied to improve the performance of the pruned model. The experimental results on CIFAR-10, CIFAR-100 and LFW datasets suggest that our approach can achieve much more efficient pruning in terms of reduction of parameters and FLOPs, e.g., 8.64 × compression and 3.79 × acceleration of VGG were achieved on CIFAR, with slight accuracy loss.",

keywords = "Convolutional neural network (CNN), Knowledge distillation, Model compression and acceleration, Pruning",

author = "Yuan Liu and Xi Jia and Linlin Shen and Zhong Ming and Jinming Duan",

note = "Publisher Copyright: {\textcopyright} 2019, Springer Nature Switzerland AG.; 28th International Conference on Artificial Neural Networks, ICANN 2019 ; Conference date: 17-09-2019 Through 19-09-2019",

year = "2019",

doi = "10.1007/978-3-030-30484-3_28",

language = "English",

isbn = "9783030304836",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Verlag",

pages = "334--346",

editor = "Tetko, {Igor V.} and Pavel Karpov and Fabian Theis and Vera Kurkov{\'a}",

booktitle = "Artificial Neural Networks and Machine Learning – ICANN 2019",

address = "Germany",

}

Liu, Y, Jia, X, Shen, L, Ming, Z & Duan, J 2019, Local Normalization Based BN Layer Pruning. in IV Tetko, P Karpov, F Theis & V Kurková (eds), Artificial Neural Networks and Machine Learning – ICANN 2019: Deep Learning - 28th International Conference on Artificial Neural Networks, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 11728 LNCS, Springer Verlag, pp. 334-346, 28th International Conference on Artificial Neural Networks, ICANN 2019, Munich, Germany, 17/09/19. https://doi.org/10.1007/978-3-030-30484-3_28

Local Normalization Based BN Layer Pruning. / Liu, Yuan; Jia, Xi; Shen, Linlin et al.
Artificial Neural Networks and Machine Learning – ICANN 2019: Deep Learning - 28th International Conference on Artificial Neural Networks, Proceedings. ed. / Igor V. Tetko; Pavel Karpov; Fabian Theis; Vera Kurková. Springer Verlag, 2019. p. 334-346 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11728 LNCS).

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Local Normalization Based BN Layer Pruning

AU - Liu, Yuan

AU - Jia, Xi

AU - Shen, Linlin

AU - Ming, Zhong

AU - Duan, Jinming

PY - 2019

Y1 - 2019

N2 - Compression and acceleration of convolutional neural network (CNN) have raised extensive research interest in the past few years. In this paper, we proposed a novel channel-level pruning method based on gamma (scaling parameters) of Batch Normalization layer to compress and accelerate CNN models. Local gamma normalization and selection was proposed to address the over-pruning issue and introduce local information into channel selection. After that, an ablation based beta (shifting parameters) transfer, and knowledge distillation based fine-tuning were further applied to improve the performance of the pruned model. The experimental results on CIFAR-10, CIFAR-100 and LFW datasets suggest that our approach can achieve much more efficient pruning in terms of reduction of parameters and FLOPs, e.g., 8.64 × compression and 3.79 × acceleration of VGG were achieved on CIFAR, with slight accuracy loss.

AB - Compression and acceleration of convolutional neural network (CNN) have raised extensive research interest in the past few years. In this paper, we proposed a novel channel-level pruning method based on gamma (scaling parameters) of Batch Normalization layer to compress and accelerate CNN models. Local gamma normalization and selection was proposed to address the over-pruning issue and introduce local information into channel selection. After that, an ablation based beta (shifting parameters) transfer, and knowledge distillation based fine-tuning were further applied to improve the performance of the pruned model. The experimental results on CIFAR-10, CIFAR-100 and LFW datasets suggest that our approach can achieve much more efficient pruning in terms of reduction of parameters and FLOPs, e.g., 8.64 × compression and 3.79 × acceleration of VGG were achieved on CIFAR, with slight accuracy loss.

KW - Convolutional neural network (CNN)

KW - Knowledge distillation

KW - Model compression and acceleration

KW - Pruning

UR - http://www.scopus.com/inward/record.url?scp=85072858272&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-30484-3_28

DO - 10.1007/978-3-030-30484-3_28

M3 - Conference contribution

AN - SCOPUS:85072858272

SN - 9783030304836

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 334

EP - 346

BT - Artificial Neural Networks and Machine Learning – ICANN 2019

A2 - Tetko, Igor V.

A2 - Karpov, Pavel

A2 - Theis, Fabian

A2 - Kurková, Vera

PB - Springer Verlag

T2 - 28th International Conference on Artificial Neural Networks, ICANN 2019

Y2 - 17 September 2019 through 19 September 2019

ER -

Liu Y, Jia X, Shen L, Ming Z, Duan J. Local Normalization Based BN Layer Pruning. In Tetko IV, Karpov P, Theis F, Kurková V, editors, Artificial Neural Networks and Machine Learning – ICANN 2019: Deep Learning - 28th International Conference on Artificial Neural Networks, Proceedings. Springer Verlag. 2019. p. 334-346. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-030-30484-3_28

Local Normalization Based BN Layer Pruning

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this