Robust object representation by boosting-like deep learning architecture

Lei Wang; Baochang Zhang; Jungong Han; Linlin Shen; Cheng shan Qian

doi:10.1016/j.image.2016.06.002

Robust object representation by boosting-like deep learning architecture

Lei Wang, Baochang Zhang, Jungong Han, Linlin Shen, Cheng shan Qian

Research output: Journal Publication › Article › peer-review

13 Citations (Scopus)

Abstract

This paper presents a new deep learning architecture for robust object representation, aiming at efficiently combining the proposed synchronized multi-stage feature (SMF) and a boosting-like algorithm. The SMF structure can capture a variety of characteristics from the inputting object based on the fusion of the handcraft features and deep learned features. With the proposed boosting-like algorithm, we can obtain more convergence stability on training multi-layer network by using the boosted samples. We show the generalization of our object representation architecture by applying it to undertake various tasks, i.e. pedestrian detection and action recognition. Our approach achieves 15.89% and 3.85% reduction in the average miss rate compared with ACF and JointDeep on the largest Caltech dataset, and acquires competitive results on the MSRAction3D dataset.

Original language	English
Pages (from-to)	490-499
Number of pages	10
Journal	Signal Processing: Image Communication
Volume	47
DOIs	https://doi.org/10.1016/j.image.2016.06.002
Publication status	Published - 1 Sept 2016
Externally published	Yes

Keywords

Boosting
Deep learning
Object representation
Synchronized feature

ASJC Scopus subject areas

Software
Signal Processing
Computer Vision and Pattern Recognition
Electrical and Electronic Engineering

Access to Document

10.1016/j.image.2016.06.002

Cite this

@article{36fa0e95ef3849639e09a85d5c094f77,

title = "Robust object representation by boosting-like deep learning architecture",

abstract = "This paper presents a new deep learning architecture for robust object representation, aiming at efficiently combining the proposed synchronized multi-stage feature (SMF) and a boosting-like algorithm. The SMF structure can capture a variety of characteristics from the inputting object based on the fusion of the handcraft features and deep learned features. With the proposed boosting-like algorithm, we can obtain more convergence stability on training multi-layer network by using the boosted samples. We show the generalization of our object representation architecture by applying it to undertake various tasks, i.e. pedestrian detection and action recognition. Our approach achieves 15.89% and 3.85% reduction in the average miss rate compared with ACF and JointDeep on the largest Caltech dataset, and acquires competitive results on the MSRAction3D dataset.",

keywords = "Boosting, Deep learning, Object representation, Synchronized feature",

author = "Lei Wang and Baochang Zhang and Jungong Han and Linlin Shen and Qian, {Cheng shan}",

note = "Publisher Copyright: {\textcopyright} 2016 Elsevier B.V.",

year = "2016",

month = sep,

day = "1",

doi = "10.1016/j.image.2016.06.002",

language = "English",

volume = "47",

pages = "490--499",

journal = "Signal Processing: Image Communication",

issn = "0923-5965",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - Robust object representation by boosting-like deep learning architecture

AU - Wang, Lei

AU - Zhang, Baochang

AU - Han, Jungong

AU - Shen, Linlin

AU - Qian, Cheng shan

PY - 2016/9/1

Y1 - 2016/9/1

N2 - This paper presents a new deep learning architecture for robust object representation, aiming at efficiently combining the proposed synchronized multi-stage feature (SMF) and a boosting-like algorithm. The SMF structure can capture a variety of characteristics from the inputting object based on the fusion of the handcraft features and deep learned features. With the proposed boosting-like algorithm, we can obtain more convergence stability on training multi-layer network by using the boosted samples. We show the generalization of our object representation architecture by applying it to undertake various tasks, i.e. pedestrian detection and action recognition. Our approach achieves 15.89% and 3.85% reduction in the average miss rate compared with ACF and JointDeep on the largest Caltech dataset, and acquires competitive results on the MSRAction3D dataset.

AB - This paper presents a new deep learning architecture for robust object representation, aiming at efficiently combining the proposed synchronized multi-stage feature (SMF) and a boosting-like algorithm. The SMF structure can capture a variety of characteristics from the inputting object based on the fusion of the handcraft features and deep learned features. With the proposed boosting-like algorithm, we can obtain more convergence stability on training multi-layer network by using the boosted samples. We show the generalization of our object representation architecture by applying it to undertake various tasks, i.e. pedestrian detection and action recognition. Our approach achieves 15.89% and 3.85% reduction in the average miss rate compared with ACF and JointDeep on the largest Caltech dataset, and acquires competitive results on the MSRAction3D dataset.

KW - Boosting

KW - Deep learning

KW - Object representation

KW - Synchronized feature

UR - http://www.scopus.com/inward/record.url?scp=84991454187&partnerID=8YFLogxK

U2 - 10.1016/j.image.2016.06.002

DO - 10.1016/j.image.2016.06.002

M3 - Article

AN - SCOPUS:84991454187

SN - 0923-5965

VL - 47

SP - 490

EP - 499

JO - Signal Processing: Image Communication

JF - Signal Processing: Image Communication

ER -

Robust object representation by boosting-like deep learning architecture

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this