跳到主要导航 跳到搜索 跳到主要内容

Disentangling latent space for vae by label relevant/irrelevant dimensions

  • East China Normal University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

VAE requires the standard Gaussian distribution as a prior in the latent space. Since all codes tend to follow the same prior, it often suffers the so-called 'posterior collapse'. To avoid this, this paper introduces the class specific distribution for the latent code. But different from CVAE, we present a method for disentangling the latent space into the label relevant and irrelevant dimensions, zs and zu, for a single input. We apply two separated encoders to map the input into zs and zu respectively, and then give the concatenated code to the decoder to reconstruct the input. The label irrelevant code zu represent the common characteristics of all inputs, hence they are constrained by the standard Gaussian, and their encoder is trained in amortized variational inference way, like VAE. While zs is assumed to follow the Gaussian mixture distribution in which each component corresponds to a particular class. The parameters for the Gaussian components in zs encoder are optimized by the label supervision in a global stochastic way. In theory, we show that our method is actually equivalent to adding a KL divergence term on the joint distribution of zs and the class label c, and it can directly increase the mutual information between zs and the label c. Our model can also be extended to GAN by adding a discriminator in the pixel domain so that it produces high quality and diverse images.

源语言英语
主期刊名Proceedings - 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2019
出版商IEEE Computer Society
12184-12193
页数10
ISBN(电子版)9781728132938
DOI
出版状态已出版 - 6月 2019
活动32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2019 - Long Beach, 美国
期限: 16 6月 201920 6月 2019

出版系列

姓名Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
2019-June
ISSN(印刷版)1063-6919

会议

会议32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2019
国家/地区美国
Long Beach
时期16/06/1920/06/19

指纹

探究 'Disentangling latent space for vae by label relevant/irrelevant dimensions' 的科研主题。它们共同构成独一无二的指纹。

引用此