Gated SwitchGAN for Multi-Domain Facial Image Translation

Xiaokang Zhang; Yuanlue Zhu; Wenting Chen; Wenshuang Liu; Linlin Shen

doi:10.1109/TMM.2021.3074807

Gated SwitchGAN for Multi-Domain Facial Image Translation

Xiaokang Zhang, Yuanlue Zhu, Wenting Chen, Wenshuang Liu, Linlin Shen

Research output: Journal Publication › Article › peer-review

5 Citations (Scopus)

Abstract

Recent studies on multi-domain facial image translation have achieved impressive results. The existing methods generally provide a discriminator with an auxiliary classifier to impose domain translation. However, these methods neglect important information regarding domain distribution matching. To solve this problem, we propose a switch generative adversarial network (SwitchGAN) with a more adaptive discriminator structure and a matched generator to perform delicate image translation among multiple domains. A feature-switching operation is proposed to achieve feature selection and fusion in our conditional modules. We demonstrate the effectiveness of our model. Furthermore, we also introduce a new capability of our generator that represents attribute intensity control and extracts content information without tailored training. Experiments on the Morph, RaFD and CelebA databases visually and quantitatively show that our extended SwitchGAN (i.e., Gated SwitchGAN) can achieve better translation results than StarGAN, AttGAN and STGAN. The attribute classification accuracy achieved using the trained ResNet-18 model and the FID score obtained using the ImageNet pretrained Inception-v3 model also quantitatively demonstrate the superior performance of our models.

Original language	English
Pages (from-to)	1990-2003
Number of pages	14
Journal	IEEE Transactions on Multimedia
Volume	24
DOIs	https://doi.org/10.1109/TMM.2021.3074807
Publication status	Published - 2022
Externally published	Yes

Keywords

Attribute intensity control
Feature switching
GANs
Image translation

ASJC Scopus subject areas

Signal Processing
Media Technology
Computer Science Applications
Electrical and Electronic Engineering

Access to Document

10.1109/TMM.2021.3074807

Cite this

@article{af93787bbbd348fca8973810735cb25d,

title = "Gated SwitchGAN for Multi-Domain Facial Image Translation",

abstract = "Recent studies on multi-domain facial image translation have achieved impressive results. The existing methods generally provide a discriminator with an auxiliary classifier to impose domain translation. However, these methods neglect important information regarding domain distribution matching. To solve this problem, we propose a switch generative adversarial network (SwitchGAN) with a more adaptive discriminator structure and a matched generator to perform delicate image translation among multiple domains. A feature-switching operation is proposed to achieve feature selection and fusion in our conditional modules. We demonstrate the effectiveness of our model. Furthermore, we also introduce a new capability of our generator that represents attribute intensity control and extracts content information without tailored training. Experiments on the Morph, RaFD and CelebA databases visually and quantitatively show that our extended SwitchGAN (i.e., Gated SwitchGAN) can achieve better translation results than StarGAN, AttGAN and STGAN. The attribute classification accuracy achieved using the trained ResNet-18 model and the FID score obtained using the ImageNet pretrained Inception-v3 model also quantitatively demonstrate the superior performance of our models.",

keywords = "Attribute intensity control, Feature switching, GANs, Image translation",

author = "Xiaokang Zhang and Yuanlue Zhu and Wenting Chen and Wenshuang Liu and Linlin Shen",

note = "Publisher Copyright: {\textcopyright} 1999-2012 IEEE.",

year = "2022",

doi = "10.1109/TMM.2021.3074807",

language = "English",

volume = "24",

pages = "1990--2003",

journal = "IEEE Transactions on Multimedia",

issn = "1520-9210",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Gated SwitchGAN for Multi-Domain Facial Image Translation

AU - Zhang, Xiaokang

AU - Zhu, Yuanlue

AU - Chen, Wenting

AU - Liu, Wenshuang

AU - Shen, Linlin

PY - 2022

Y1 - 2022

N2 - Recent studies on multi-domain facial image translation have achieved impressive results. The existing methods generally provide a discriminator with an auxiliary classifier to impose domain translation. However, these methods neglect important information regarding domain distribution matching. To solve this problem, we propose a switch generative adversarial network (SwitchGAN) with a more adaptive discriminator structure and a matched generator to perform delicate image translation among multiple domains. A feature-switching operation is proposed to achieve feature selection and fusion in our conditional modules. We demonstrate the effectiveness of our model. Furthermore, we also introduce a new capability of our generator that represents attribute intensity control and extracts content information without tailored training. Experiments on the Morph, RaFD and CelebA databases visually and quantitatively show that our extended SwitchGAN (i.e., Gated SwitchGAN) can achieve better translation results than StarGAN, AttGAN and STGAN. The attribute classification accuracy achieved using the trained ResNet-18 model and the FID score obtained using the ImageNet pretrained Inception-v3 model also quantitatively demonstrate the superior performance of our models.

AB - Recent studies on multi-domain facial image translation have achieved impressive results. The existing methods generally provide a discriminator with an auxiliary classifier to impose domain translation. However, these methods neglect important information regarding domain distribution matching. To solve this problem, we propose a switch generative adversarial network (SwitchGAN) with a more adaptive discriminator structure and a matched generator to perform delicate image translation among multiple domains. A feature-switching operation is proposed to achieve feature selection and fusion in our conditional modules. We demonstrate the effectiveness of our model. Furthermore, we also introduce a new capability of our generator that represents attribute intensity control and extracts content information without tailored training. Experiments on the Morph, RaFD and CelebA databases visually and quantitatively show that our extended SwitchGAN (i.e., Gated SwitchGAN) can achieve better translation results than StarGAN, AttGAN and STGAN. The attribute classification accuracy achieved using the trained ResNet-18 model and the FID score obtained using the ImageNet pretrained Inception-v3 model also quantitatively demonstrate the superior performance of our models.

KW - Attribute intensity control

KW - Feature switching

KW - GANs

KW - Image translation

UR - http://www.scopus.com/inward/record.url?scp=85104676204&partnerID=8YFLogxK

U2 - 10.1109/TMM.2021.3074807

DO - 10.1109/TMM.2021.3074807

M3 - Article

AN - SCOPUS:85104676204

SN - 1520-9210

VL - 24

SP - 1990

EP - 2003

JO - IEEE Transactions on Multimedia

JF - IEEE Transactions on Multimedia

ER -

Gated SwitchGAN for Multi-Domain Facial Image Translation

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this