跳到主要导航 跳到搜索 跳到主要内容

FiTGAN: Content Fusion with Style Transformation for Few-shot Image Generation

  • Yingbo Zhou
  • , Pengyu Zhang
  • , Yutong Ye
  • , Zhihao Yue
  • , Xian Wei
  • , Mingsong Chen*
  • *此作品的通讯作者
  • East China Normal University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Due to the semantic entanglement in fusion strategies or unstable training in complicated image transformations, existing few-shot image generation methods still suffer from low generation quality and diversity. To tackle the above problems, we propose a novel fusion- and transformation-based framework named content Fusion with style Transformation Generative Adversarial Network (FiTGAN) for few-shot image generation. The basic assumption is that any image consists of a collection of content-related and style-related features. FiTGAN disentangles internal representations with two independent encoders and combines the fused contents and transformed styles to generate new images. Specifically, we design a multi-scale content fusion strategy and a reparameterized style transformation mechanism to learn more fine-grained semantics without changing category-relevant attributes. Furthermore, we formulate a content reconstruction loss and a style divergence loss to provide better training stability and generation performance. Comprehensive experiments on three well-known datasets demonstrate that FiTGAN can not only produce more realistic and diverse images for few-shot image generation but also achieve better classification accuracy for downstream visual applications with limited data.

源语言英语
主期刊名2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025 - Proceedings
编辑Bhaskar D Rao, Isabel Trancoso, Gaurav Sharma, Neelesh B. Mehta
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9798350368741
DOI
出版状态已出版 - 2025
活动2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025 - Hyderabad, 印度
期限: 6 4月 202511 4月 2025

出版系列

姓名ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN(印刷版)1520-6149

会议

会议2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025
国家/地区印度
Hyderabad
时期6/04/2511/04/25

指纹

探究 'FiTGAN: Content Fusion with Style Transformation for Few-shot Image Generation' 的科研主题。它们共同构成独一无二的指纹。

引用此