TY - GEN
T1 - Fine-Grained Expression Manipulation Via Structured Latent Space
AU - Tang, Junshu
AU - Shao, Zhiwen
AU - Ma, Lizhuang
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/7
Y1 - 2020/7
N2 - Fine-grained facial expression manipulation is a challenging problem, as fine-grained expression details are difficult to be captured. Most existing expression manipulation methods resort to discrete expression labels, which mainly edit global expressions and ignore the manipulation of fine details. To tackle this limitation, we propose an end-to-end expression-guided generative adversarial network (EGGAN), which utilizes structured latent codes and continuous expression labels as input to generate images with expected expressions. Specifically, we adopt an adversarial autoencoder to map a source image into a structured latent space. Then, given the source latent code and the target expression label, we employ a conditional GAN to generate a new image with the target expression. Moreover, we introduce a perceptual loss and a multi-scale structural similarity loss to preserve identity and global shape during generation. Extensive experiments show that our method can manipulate fine-grained expressions, and generate continuous intermediate expressions between source and target expressions.
AB - Fine-grained facial expression manipulation is a challenging problem, as fine-grained expression details are difficult to be captured. Most existing expression manipulation methods resort to discrete expression labels, which mainly edit global expressions and ignore the manipulation of fine details. To tackle this limitation, we propose an end-to-end expression-guided generative adversarial network (EGGAN), which utilizes structured latent codes and continuous expression labels as input to generate images with expected expressions. Specifically, we adopt an adversarial autoencoder to map a source image into a structured latent space. Then, given the source latent code and the target expression label, we employ a conditional GAN to generate a new image with the target expression. Moreover, we introduce a perceptual loss and a multi-scale structural similarity loss to preserve identity and global shape during generation. Extensive experiments show that our method can manipulate fine-grained expressions, and generate continuous intermediate expressions between source and target expressions.
KW - Continuous expression
KW - Fine-grained expression manipulation
KW - Structured latent space
UR - https://www.scopus.com/pages/publications/85090399448
U2 - 10.1109/ICME46284.2020.9102852
DO - 10.1109/ICME46284.2020.9102852
M3 - 会议稿件
AN - SCOPUS:85090399448
T3 - Proceedings - IEEE International Conference on Multimedia and Expo
BT - 2020 IEEE International Conference on Multimedia and Expo, ICME 2020
PB - IEEE Computer Society
T2 - 2020 IEEE International Conference on Multimedia and Expo, ICME 2020
Y2 - 6 July 2020 through 10 July 2020
ER -