UP-CNN: Un-pooling augmented convolutional neural network

Chunyan Xu; Jian Yang; Hanjiang Lai; Junbin Gao; Linlin Shen; Shuicheng Yan

doi:10.1016/j.patrec.2017.08.007

UP-CNN: Un-pooling augmented convolutional neural network

Chunyan Xu, Jian Yang, Hanjiang Lai, Junbin Gao, Linlin Shen, Shuicheng Yan

Research output: Journal Publication › Article › peer-review

24 Citations (Scopus)

Abstract

Convolutional neural network (CNN) has shown remarkable performance in various visual recognition tasks. Most of existing CNN is a purely bottom-up and feed-forward architecture, we argue that it fails to consider the interaction between low-level fine details and high-level semantic information. In this paper, a novel “Un-Pooling augmented Convolutional Neural Network” (UP-CNN) is proposed to boost the discriminative capability of the CNN with the following three distinctive properties: (1) UP-CNN is a deeper network, which is comprised of a bottom-up, a top-down and then a bottom-up sub-networks, associating to different level information that jointly improves its discriminative capability. (2) With the mixture of pooling and un-pooling layers, UP-CNN easily allows the interaction across convolutional layers with the same size from different sub-networks. This architecture effectively depresses the attenuation of important information including both activations in the forward process and gradient information in the back-propagation process. (3) UP-CNN employs the ratio un-pooling operation to reconstruct activations of the original size in the top-down sub-network, where the spatial information that is lost during pooling can be preserved within a receptive field. The experiments on four benchmark datasets (including the CIFAR-10, CIFAR-100, MNIST and SVHN datasets) well demonstrate that the proposed UP-CNN architecture considerably outperforms other state-of-the-art methods.

Original language	English
Pages (from-to)	34-40
Number of pages	7
Journal	Pattern Recognition Letters
Volume	119
DOIs	https://doi.org/10.1016/j.patrec.2017.08.007
Publication status	Published - 1 Mar 2019
Externally published	Yes

Keywords

Convolutional neural network
Cross-layer interaction
Image classification
Ratio un-pooling

ASJC Scopus subject areas

Software
Signal Processing
Computer Vision and Pattern Recognition
Artificial Intelligence

Access to Document

10.1016/j.patrec.2017.08.007

Cite this

@article{db12ad5efb7e4177b3572b2dd3f16127,

title = "UP-CNN: Un-pooling augmented convolutional neural network",

abstract = "Convolutional neural network (CNN) has shown remarkable performance in various visual recognition tasks. Most of existing CNN is a purely bottom-up and feed-forward architecture, we argue that it fails to consider the interaction between low-level fine details and high-level semantic information. In this paper, a novel “Un-Pooling augmented Convolutional Neural Network” (UP-CNN) is proposed to boost the discriminative capability of the CNN with the following three distinctive properties: (1) UP-CNN is a deeper network, which is comprised of a bottom-up, a top-down and then a bottom-up sub-networks, associating to different level information that jointly improves its discriminative capability. (2) With the mixture of pooling and un-pooling layers, UP-CNN easily allows the interaction across convolutional layers with the same size from different sub-networks. This architecture effectively depresses the attenuation of important information including both activations in the forward process and gradient information in the back-propagation process. (3) UP-CNN employs the ratio un-pooling operation to reconstruct activations of the original size in the top-down sub-network, where the spatial information that is lost during pooling can be preserved within a receptive field. The experiments on four benchmark datasets (including the CIFAR-10, CIFAR-100, MNIST and SVHN datasets) well demonstrate that the proposed UP-CNN architecture considerably outperforms other state-of-the-art methods.",

keywords = "Convolutional neural network, Cross-layer interaction, Image classification, Ratio un-pooling",

author = "Chunyan Xu and Jian Yang and Hanjiang Lai and Junbin Gao and Linlin Shen and Shuicheng Yan",

note = "Publisher Copyright: {\textcopyright} 2017 Elsevier B.V.",

year = "2019",

month = mar,

day = "1",

doi = "10.1016/j.patrec.2017.08.007",

language = "English",

volume = "119",

pages = "34--40",

journal = "Pattern Recognition Letters",

issn = "0167-8655",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - UP-CNN

T2 - Un-pooling augmented convolutional neural network

AU - Xu, Chunyan

AU - Yang, Jian

AU - Lai, Hanjiang

AU - Gao, Junbin

AU - Shen, Linlin

AU - Yan, Shuicheng

PY - 2019/3/1

Y1 - 2019/3/1

N2 - Convolutional neural network (CNN) has shown remarkable performance in various visual recognition tasks. Most of existing CNN is a purely bottom-up and feed-forward architecture, we argue that it fails to consider the interaction between low-level fine details and high-level semantic information. In this paper, a novel “Un-Pooling augmented Convolutional Neural Network” (UP-CNN) is proposed to boost the discriminative capability of the CNN with the following three distinctive properties: (1) UP-CNN is a deeper network, which is comprised of a bottom-up, a top-down and then a bottom-up sub-networks, associating to different level information that jointly improves its discriminative capability. (2) With the mixture of pooling and un-pooling layers, UP-CNN easily allows the interaction across convolutional layers with the same size from different sub-networks. This architecture effectively depresses the attenuation of important information including both activations in the forward process and gradient information in the back-propagation process. (3) UP-CNN employs the ratio un-pooling operation to reconstruct activations of the original size in the top-down sub-network, where the spatial information that is lost during pooling can be preserved within a receptive field. The experiments on four benchmark datasets (including the CIFAR-10, CIFAR-100, MNIST and SVHN datasets) well demonstrate that the proposed UP-CNN architecture considerably outperforms other state-of-the-art methods.

AB - Convolutional neural network (CNN) has shown remarkable performance in various visual recognition tasks. Most of existing CNN is a purely bottom-up and feed-forward architecture, we argue that it fails to consider the interaction between low-level fine details and high-level semantic information. In this paper, a novel “Un-Pooling augmented Convolutional Neural Network” (UP-CNN) is proposed to boost the discriminative capability of the CNN with the following three distinctive properties: (1) UP-CNN is a deeper network, which is comprised of a bottom-up, a top-down and then a bottom-up sub-networks, associating to different level information that jointly improves its discriminative capability. (2) With the mixture of pooling and un-pooling layers, UP-CNN easily allows the interaction across convolutional layers with the same size from different sub-networks. This architecture effectively depresses the attenuation of important information including both activations in the forward process and gradient information in the back-propagation process. (3) UP-CNN employs the ratio un-pooling operation to reconstruct activations of the original size in the top-down sub-network, where the spatial information that is lost during pooling can be preserved within a receptive field. The experiments on four benchmark datasets (including the CIFAR-10, CIFAR-100, MNIST and SVHN datasets) well demonstrate that the proposed UP-CNN architecture considerably outperforms other state-of-the-art methods.

KW - Convolutional neural network

KW - Cross-layer interaction

KW - Image classification

KW - Ratio un-pooling

UR - http://www.scopus.com/inward/record.url?scp=85028317988&partnerID=8YFLogxK

U2 - 10.1016/j.patrec.2017.08.007

DO - 10.1016/j.patrec.2017.08.007

M3 - Article

AN - SCOPUS:85028317988

SN - 0167-8655

VL - 119

SP - 34

EP - 40

JO - Pattern Recognition Letters

JF - Pattern Recognition Letters

ER -

UP-CNN: Un-pooling augmented convolutional neural network

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this