TY - GEN
T1 - Feature map masking based single-stage face detection
AU - Zhang, Xi
AU - Chen, Junliang
AU - Xie, Weicheng
AU - Shen, Linlin
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/9/28
Y1 - 2020/9/28
N2 - Although great progress has been made in face detection, a trade-off between speed and accuracy is still a great challenge. We propose in this paper a feature map masking based approach for single-stage face detection. As feature maps extracted from feature pyramid network might contain face unrelated features, we propose a mask generation branch to predict those significant units for face detection. The masked feature maps, where only important features are left, are then passed through the following detection process. Ground truth masks, directly generated from the training images, based on the face bounding boxes, are used to train the feature mask generation module. A mask constrained dropout module has also been proposed to drop out significant units of the shared feature maps, such that the detection performance can be further improved. The proposed approach is extensively tested using the WIDER FACE dataset. The results suggest that our detector with ResNet-152 backbone, achieves the best precision-recall performance among competing methods. As high as 95.4%, 94.0% and 86.9% accuracies have been achieved on the easy, medium and hard subsets, respectively.
AB - Although great progress has been made in face detection, a trade-off between speed and accuracy is still a great challenge. We propose in this paper a feature map masking based approach for single-stage face detection. As feature maps extracted from feature pyramid network might contain face unrelated features, we propose a mask generation branch to predict those significant units for face detection. The masked feature maps, where only important features are left, are then passed through the following detection process. Ground truth masks, directly generated from the training images, based on the face bounding boxes, are used to train the feature mask generation module. A mask constrained dropout module has also been proposed to drop out significant units of the shared feature maps, such that the detection performance can be further improved. The proposed approach is extensively tested using the WIDER FACE dataset. The results suggest that our detector with ResNet-152 backbone, achieves the best precision-recall performance among competing methods. As high as 95.4%, 94.0% and 86.9% accuracies have been achieved on the easy, medium and hard subsets, respectively.
UR - http://www.scopus.com/inward/record.url?scp=85099692967&partnerID=8YFLogxK
U2 - 10.1109/IJCB48548.2020.9304927
DO - 10.1109/IJCB48548.2020.9304927
M3 - Conference contribution
AN - SCOPUS:85099692967
T3 - IJCB 2020 - IEEE/IAPR International Joint Conference on Biometrics
BT - IJCB 2020 - IEEE/IAPR International Joint Conference on Biometrics
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2020 IEEE/IAPR International Joint Conference on Biometrics, IJCB 2020
Y2 - 28 September 2020 through 1 October 2020
ER -