Multi Task-Based Facial Expression Synthesis with Supervision Learning and Feature Disentanglement of Image Style

Wenya Lu; Zhibin Peng; Cheng Luo; Weicheng Xie; Jiajun Wen; Zhihui Lai; Linlin Shen

doi:10.1109/ICIP49359.2023.10223136

Multi Task-Based Facial Expression Synthesis with Supervision Learning and Feature Disentanglement of Image Style

Wenya Lu, Zhibin Peng, Cheng Luo, Weicheng Xie, Jiajun Wen, Zhihui Lai, Linlin Shen

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

Abstract

Image-to-Image synthesis paradigms have been widely used for facial expression synthesis. However, current generators are apt to either produce artifacts for largely posed and non-aligned faces or unduly change the identity information like AdaIN-based generator. In this work, we suggest to use image style feature to surrogate the expression cues in the generator, and propose a multi-task learning paradigm to explore this style information via the supervision learning and feature disentanglement. While the supervision learning can make the encoded style specifically represent the expression cues and enable the generator to produce correct expression, the feature disentanglement of content and style cues enables the generator to better preserve the identity information in expression synthesis. Experimental results show that the proposed algorithm can well reduce the artifacts for the synthesis of posed and non-aligned expressions, and achieves competitive performances in terms of FID, PNSR and classification accuracy, compared with four publicly available GANs. The code and pre-trained models are available at https://github.com/lumanxi236/MTSS.

Original language	English
Title of host publication	2023 IEEE International Conference on Image Processing, ICIP 2023 - Proceedings
Publisher	IEEE Computer Society
Pages	1005-1009
Number of pages	5
ISBN (Electronic)	9781728198354
DOIs	https://doi.org/10.1109/ICIP49359.2023.10223136
Publication status	Published - 2023
Externally published	Yes
Event	30th IEEE International Conference on Image Processing, ICIP 2023 - Kuala Lumpur, Malaysia Duration: 8 Oct 2023 → 11 Oct 2023

Publication series

Name	Proceedings - International Conference on Image Processing, ICIP
ISSN (Print)	1522-4880

Conference

Conference	30th IEEE International Conference on Image Processing, ICIP 2023
Country/Territory	Malaysia
City	Kuala Lumpur
Period	8/10/23 → 11/10/23

Keywords

expression style learning
Facial expression synthesis
multi-task learning
style and content disentanglement

ASJC Scopus subject areas

Software
Computer Vision and Pattern Recognition
Signal Processing

Access to Document

10.1109/ICIP49359.2023.10223136

Cite this

Lu, W., Peng, Z., Luo, C., Xie, W., Wen, J., Lai, Z., & Shen, L. (2023). Multi Task-Based Facial Expression Synthesis with Supervision Learning and Feature Disentanglement of Image Style. In 2023 IEEE International Conference on Image Processing, ICIP 2023 - Proceedings (pp. 1005-1009). (Proceedings - International Conference on Image Processing, ICIP). IEEE Computer Society. https://doi.org/10.1109/ICIP49359.2023.10223136

@inproceedings{a267c3759a9148e9b7fcc07a8e18731f,

title = "Multi Task-Based Facial Expression Synthesis with Supervision Learning and Feature Disentanglement of Image Style",

abstract = "Image-to-Image synthesis paradigms have been widely used for facial expression synthesis. However, current generators are apt to either produce artifacts for largely posed and non-aligned faces or unduly change the identity information like AdaIN-based generator. In this work, we suggest to use image style feature to surrogate the expression cues in the generator, and propose a multi-task learning paradigm to explore this style information via the supervision learning and feature disentanglement. While the supervision learning can make the encoded style specifically represent the expression cues and enable the generator to produce correct expression, the feature disentanglement of content and style cues enables the generator to better preserve the identity information in expression synthesis. Experimental results show that the proposed algorithm can well reduce the artifacts for the synthesis of posed and non-aligned expressions, and achieves competitive performances in terms of FID, PNSR and classification accuracy, compared with four publicly available GANs. The code and pre-trained models are available at https://github.com/lumanxi236/MTSS.",

keywords = "expression style learning, Facial expression synthesis, multi-task learning, style and content disentanglement",

author = "Wenya Lu and Zhibin Peng and Cheng Luo and Weicheng Xie and Jiajun Wen and Zhihui Lai and Linlin Shen",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 30th IEEE International Conference on Image Processing, ICIP 2023 ; Conference date: 08-10-2023 Through 11-10-2023",

year = "2023",

doi = "10.1109/ICIP49359.2023.10223136",

language = "English",

series = "Proceedings - International Conference on Image Processing, ICIP",

publisher = "IEEE Computer Society",

pages = "1005--1009",

booktitle = "2023 IEEE International Conference on Image Processing, ICIP 2023 - Proceedings",

address = "United States",

}

Lu, W, Peng, Z, Luo, C, Xie, W, Wen, J, Lai, Z & Shen, L 2023, Multi Task-Based Facial Expression Synthesis with Supervision Learning and Feature Disentanglement of Image Style. in 2023 IEEE International Conference on Image Processing, ICIP 2023 - Proceedings. Proceedings - International Conference on Image Processing, ICIP, IEEE Computer Society, pp. 1005-1009, 30th IEEE International Conference on Image Processing, ICIP 2023, Kuala Lumpur, Malaysia, 8/10/23. https://doi.org/10.1109/ICIP49359.2023.10223136

Multi Task-Based Facial Expression Synthesis with Supervision Learning and Feature Disentanglement of Image Style. / Lu, Wenya; Peng, Zhibin; Luo, Cheng et al.
2023 IEEE International Conference on Image Processing, ICIP 2023 - Proceedings. IEEE Computer Society, 2023. p. 1005-1009 (Proceedings - International Conference on Image Processing, ICIP).

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Multi Task-Based Facial Expression Synthesis with Supervision Learning and Feature Disentanglement of Image Style

AU - Lu, Wenya

AU - Peng, Zhibin

AU - Luo, Cheng

AU - Xie, Weicheng

AU - Wen, Jiajun

AU - Lai, Zhihui

AU - Shen, Linlin

PY - 2023

Y1 - 2023

N2 - Image-to-Image synthesis paradigms have been widely used for facial expression synthesis. However, current generators are apt to either produce artifacts for largely posed and non-aligned faces or unduly change the identity information like AdaIN-based generator. In this work, we suggest to use image style feature to surrogate the expression cues in the generator, and propose a multi-task learning paradigm to explore this style information via the supervision learning and feature disentanglement. While the supervision learning can make the encoded style specifically represent the expression cues and enable the generator to produce correct expression, the feature disentanglement of content and style cues enables the generator to better preserve the identity information in expression synthesis. Experimental results show that the proposed algorithm can well reduce the artifacts for the synthesis of posed and non-aligned expressions, and achieves competitive performances in terms of FID, PNSR and classification accuracy, compared with four publicly available GANs. The code and pre-trained models are available at https://github.com/lumanxi236/MTSS.

AB - Image-to-Image synthesis paradigms have been widely used for facial expression synthesis. However, current generators are apt to either produce artifacts for largely posed and non-aligned faces or unduly change the identity information like AdaIN-based generator. In this work, we suggest to use image style feature to surrogate the expression cues in the generator, and propose a multi-task learning paradigm to explore this style information via the supervision learning and feature disentanglement. While the supervision learning can make the encoded style specifically represent the expression cues and enable the generator to produce correct expression, the feature disentanglement of content and style cues enables the generator to better preserve the identity information in expression synthesis. Experimental results show that the proposed algorithm can well reduce the artifacts for the synthesis of posed and non-aligned expressions, and achieves competitive performances in terms of FID, PNSR and classification accuracy, compared with four publicly available GANs. The code and pre-trained models are available at https://github.com/lumanxi236/MTSS.

KW - expression style learning

KW - Facial expression synthesis

KW - multi-task learning

KW - style and content disentanglement

UR - http://www.scopus.com/inward/record.url?scp=85180743053&partnerID=8YFLogxK

U2 - 10.1109/ICIP49359.2023.10223136

DO - 10.1109/ICIP49359.2023.10223136

M3 - Conference contribution

AN - SCOPUS:85180743053

T3 - Proceedings - International Conference on Image Processing, ICIP

SP - 1005

EP - 1009

BT - 2023 IEEE International Conference on Image Processing, ICIP 2023 - Proceedings

PB - IEEE Computer Society

T2 - 30th IEEE International Conference on Image Processing, ICIP 2023

Y2 - 8 October 2023 through 11 October 2023

ER -

Lu W, Peng Z, Luo C, Xie W, Wen J, Lai Z et al. Multi Task-Based Facial Expression Synthesis with Supervision Learning and Feature Disentanglement of Image Style. In 2023 IEEE International Conference on Image Processing, ICIP 2023 - Proceedings. IEEE Computer Society. 2023. p. 1005-1009. (Proceedings - International Conference on Image Processing, ICIP). doi: 10.1109/ICIP49359.2023.10223136

Multi Task-Based Facial Expression Synthesis with Supervision Learning and Feature Disentanglement of Image Style

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Cite this