Few-Shot Diffusion Models Escape the Curse of Dimensionality

  • Ruofeng Yang
  • , Bo Jiang
  • , Cheng Chen
  • , Ruinan Jin
  • , Baoxiang Wang
  • , Shuai Li*
  • *Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

2 Scopus citations

Abstract

While diffusion models have demonstrated impressive performance, there is a growing need for generating samples tailored to specific user-defined concepts. The customized requirements promote the development of few-shot diffusion models, which use limited nta target samples to fine-tune a pre-trained diffusion model trained on ns source samples. Despite the empirical success, no theoretical work specifically analyzes few-shot diffusion models. Moreover, the existing results for diffusion models without a fine-tuning phase can not explain why few-shot models generate great samples due to the curse of dimensionality. In this work, we analyze few-shot diffusion models under a linear structure distribution with a latent dimension d. From the approximation perspective, we prove that few-shot models have a Oe(ns2/d + nta−1/2) bound to approximate the target score function, which is better than nta2/d results. From the optimization perspective, we consider a latent Gaussian special case and prove that the optimization problem has a closed-form minimizer. This means few-shot models can directly obtain an approximated minimizer without a complex optimization process. Furthermore, we also provide the accuracy bound Oe(1/nta + 1/√ns) for the empirical solution, which still has better dependence on nta compared to ns. The results of the real-world experiments also show that the models obtained by only fine-tuning the encoder and decoder specific to the target distribution can produce novel images with the target feature, which supports our theoretical results.

Original languageEnglish
JournalAdvances in Neural Information Processing Systems
Volume37
StatePublished - 2024
Event38th Conference on Neural Information Processing Systems, NeurIPS 2024 - Vancouver, Canada
Duration: 9 Dec 202415 Dec 2024

Fingerprint

Dive into the research topics of 'Few-Shot Diffusion Models Escape the Curse of Dimensionality'. Together they form a unique fingerprint.

Cite this