Disentangling the Spatial Structure and Style in Conditional VAE

Ziye Zhang, Li Sun, Zhilin Zheng, Qingli Li

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Scopus citations

Abstract

This paper proposes a structure in conditional variation autoencoder (cVAE) to disentangle the latent vector into a spatial structure and a style code, complementary to each other, with the one ( zs) being label relevant and the other ( zu) irrelevant. Different from traditional cVAE, our network maps the condition label into its relevant code zs through a separated module. Depending on whether the label directly relates to the image spatial structure or not, zs output from the condition mapping module is used either as the style code with the two spatial dimension of 1 \times 1, or as the spatial structure code with a single channel. Based on the input image and its corresponding zs, the encoder provides the posterior distribution close to a common prior regardless of its label, thus zu sampled from it becomes label irrelevant. The decoder employs zs and zu by two typical adaptive normalization modules to reconstruct the input image. Results on two datasets with different types of labels show the effectiveness of our method.

Original languageEnglish
Title of host publication2020 IEEE International Conference on Image Processing, ICIP 2020 - Proceedings
PublisherIEEE Computer Society
Pages1626-1630
Number of pages5
ISBN (Electronic)9781728163956
DOIs
StatePublished - Oct 2020
Externally publishedYes
Event2020 IEEE International Conference on Image Processing, ICIP 2020 - Virtual, Abu Dhabi, United Arab Emirates
Duration: 25 Sep 202028 Sep 2020

Publication series

NameProceedings - International Conference on Image Processing, ICIP
Volume2020-October
ISSN (Print)1522-4880

Conference

Conference2020 IEEE International Conference on Image Processing, ICIP 2020
Country/TerritoryUnited Arab Emirates
CityVirtual, Abu Dhabi
Period25/09/2028/09/20

Keywords

  • GAN
  • cVAE
  • disentanglement

Fingerprint

Dive into the research topics of 'Disentangling the Spatial Structure and Style in Conditional VAE'. Together they form a unique fingerprint.

Cite this