TY - GEN
T1 - Feature Fusing of Feature Pyramid Network for Multi-Scale Pedestrian Detection
AU - Tesema, Fiseha B.
AU - Lin, Junpeng
AU - Ou, Jie
AU - Wu, Hong
AU - Zhu, William
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/7/2
Y1 - 2018/7/2
N2 - Pedestrian detection is a fundamental component in many real-world applications such as automatic driving, intelligent surveillance, person re-identification and robotics. Therefore, it has attracted massive attention in the last decades. However, pedestrians in an images always exhibit different scales, which constitutes a significant mode of intra-class variability and affect the performance of pedestrian detection algorithm. To address this problem, we apply FPN (Feature Pyramid Network)for pedestrian detection. FPN exploits the inherent multi-scale structure of a deep convolutional network to construct a feature pyramid that has rich semantics at all levels and facilitates the detection of objects at different scales. To leverage the information from different levels of the feature pyramid, we extend the FPN-based pedestrian detection by fusing the feature of each level with adaptive feature pooling. Furthermore, we also integrate a Squeeze and Excitation module to the ROI pooled features from each level before the feature fusion. The experiment result on Caltech dataset shows that our approach outperforms the basic FPN-based pedestrian detection and robust towards to various scale of pedestrian.
AB - Pedestrian detection is a fundamental component in many real-world applications such as automatic driving, intelligent surveillance, person re-identification and robotics. Therefore, it has attracted massive attention in the last decades. However, pedestrians in an images always exhibit different scales, which constitutes a significant mode of intra-class variability and affect the performance of pedestrian detection algorithm. To address this problem, we apply FPN (Feature Pyramid Network)for pedestrian detection. FPN exploits the inherent multi-scale structure of a deep convolutional network to construct a feature pyramid that has rich semantics at all levels and facilitates the detection of objects at different scales. To leverage the information from different levels of the feature pyramid, we extend the FPN-based pedestrian detection by fusing the feature of each level with adaptive feature pooling. Furthermore, we also integrate a Squeeze and Excitation module to the ROI pooled features from each level before the feature fusion. The experiment result on Caltech dataset shows that our approach outperforms the basic FPN-based pedestrian detection and robust towards to various scale of pedestrian.
KW - Adaptive feature pooling
KW - FPN
KW - Pedestrian detection
KW - Squeeze and excitation network
UR - http://www.scopus.com/inward/record.url?scp=85062835993&partnerID=8YFLogxK
U2 - 10.1109/ICCWAMTIP.2018.8632614
DO - 10.1109/ICCWAMTIP.2018.8632614
M3 - Conference contribution
AN - SCOPUS:85062835993
T3 - 2018 15th International Computer Conference on Wavelet Active Media Technology and Information Processing, ICCWAMTIP 2018
SP - 10
EP - 13
BT - 2018 15th International Computer Conference on Wavelet Active Media Technology and Information Processing, ICCWAMTIP 2018
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 15th International Computer Conference on Wavelet Active Media Technology and Information Processing, ICCWAMTIP 2018
Y2 - 14 December 2018 through 16 December 2018
ER -