Improved softmax loss for deep learning-based face and expression recognition

Jiancan Zhou; Xi Jia; Linlin Shen; Zhenkun Wen; Zhong Ming

doi:10.1049/ccs.2019.0010

Improved softmax loss for deep learning-based face and expression recognition

Jiancan Zhou, Xi Jia, Linlin Shen, Zhenkun Wen, Zhong Ming

Research output: Journal Publication › Article › peer-review

20 Citations (Scopus)

Abstract

In recent years, deep convolutional neural networks (CNN) have been widely used in computer vision and significantly improved the performance of image recognition tasks. Most works use softmax loss to supervise the training of CNN and then adopt the output of last layer as features. However, the discriminative capability of the softmax loss is limited. Here, the authors analyse and improve the softmax loss by manipulating the cosine value and input feature length. As the approach does not change the principle of the softmax loss, the network can easily be optimised by typical stochastic gradient descent. The MNIST handwritten digits dataset is employed to visualise the features learned by the improved softmax loss. The CASIA-WebFace and FER2013 training set are adopted to train deep CNN for face and expression recognition, respectively. Results on both the LFW dataset and FER2013 test set show that the proposed softmax loss can learn more discriminative features and achieve better performance.

Original language	English
Pages (from-to)	97-102
Number of pages	6
Journal	Cognitive Computation and Systems
Volume	1
Issue number	4
DOIs	https://doi.org/10.1049/ccs.2019.0010
Publication status	Published - Dec 2019
Externally published	Yes

ASJC Scopus subject areas

Experimental and Cognitive Psychology
Computer Vision and Pattern Recognition
Computer Science Applications
Cognitive Neuroscience
Artificial Intelligence

Access to Document

10.1049/ccs.2019.0010

Cite this

@article{d3b27ff0e0374e2ab3339f577d1fc542,

title = "Improved softmax loss for deep learning-based face and expression recognition",

abstract = "In recent years, deep convolutional neural networks (CNN) have been widely used in computer vision and significantly improved the performance of image recognition tasks. Most works use softmax loss to supervise the training of CNN and then adopt the output of last layer as features. However, the discriminative capability of the softmax loss is limited. Here, the authors analyse and improve the softmax loss by manipulating the cosine value and input feature length. As the approach does not change the principle of the softmax loss, the network can easily be optimised by typical stochastic gradient descent. The MNIST handwritten digits dataset is employed to visualise the features learned by the improved softmax loss. The CASIA-WebFace and FER2013 training set are adopted to train deep CNN for face and expression recognition, respectively. Results on both the LFW dataset and FER2013 test set show that the proposed softmax loss can learn more discriminative features and achieve better performance.",

author = "Jiancan Zhou and Xi Jia and Linlin Shen and Zhenkun Wen and Zhong Ming",

year = "2019",

month = dec,

doi = "10.1049/ccs.2019.0010",

language = "English",

volume = "1",

pages = "97--102",

journal = "Cognitive Computation and Systems",

issn = "2517-7567",

publisher = "John Wiley and Sons Inc.",

number = "4",

}

TY - JOUR

T1 - Improved softmax loss for deep learning-based face and expression recognition

AU - Zhou, Jiancan

AU - Jia, Xi

AU - Shen, Linlin

AU - Wen, Zhenkun

AU - Ming, Zhong

PY - 2019/12

Y1 - 2019/12

N2 - In recent years, deep convolutional neural networks (CNN) have been widely used in computer vision and significantly improved the performance of image recognition tasks. Most works use softmax loss to supervise the training of CNN and then adopt the output of last layer as features. However, the discriminative capability of the softmax loss is limited. Here, the authors analyse and improve the softmax loss by manipulating the cosine value and input feature length. As the approach does not change the principle of the softmax loss, the network can easily be optimised by typical stochastic gradient descent. The MNIST handwritten digits dataset is employed to visualise the features learned by the improved softmax loss. The CASIA-WebFace and FER2013 training set are adopted to train deep CNN for face and expression recognition, respectively. Results on both the LFW dataset and FER2013 test set show that the proposed softmax loss can learn more discriminative features and achieve better performance.

AB - In recent years, deep convolutional neural networks (CNN) have been widely used in computer vision and significantly improved the performance of image recognition tasks. Most works use softmax loss to supervise the training of CNN and then adopt the output of last layer as features. However, the discriminative capability of the softmax loss is limited. Here, the authors analyse and improve the softmax loss by manipulating the cosine value and input feature length. As the approach does not change the principle of the softmax loss, the network can easily be optimised by typical stochastic gradient descent. The MNIST handwritten digits dataset is employed to visualise the features learned by the improved softmax loss. The CASIA-WebFace and FER2013 training set are adopted to train deep CNN for face and expression recognition, respectively. Results on both the LFW dataset and FER2013 test set show that the proposed softmax loss can learn more discriminative features and achieve better performance.

UR - http://www.scopus.com/inward/record.url?scp=85091405367&partnerID=8YFLogxK

U2 - 10.1049/ccs.2019.0010

DO - 10.1049/ccs.2019.0010

M3 - Article

AN - SCOPUS:85091405367

SN - 2517-7567

VL - 1

SP - 97

EP - 102

JO - Cognitive Computation and Systems

JF - Cognitive Computation and Systems

IS - 4

ER -

Improved softmax loss for deep learning-based face and expression recognition

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this