Artistry in Pixels: FVS - A Framework for Evaluating Visual Elegance and Sentiment Resonance in Generated Images

  • Weijie Li
  • , Luwei Xiao
  • , Xingjiao Wu
  • , Tianlong Ma
  • , Jiabao Zhao*
  • , Liang He*
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

The field of image generation models has seen substantial progress, characterized by a proliferation of diverse generative models and their associated outputs. However, there currently exists a deficiency in methodologies that can concurrently and effectively evaluate both the intrinsic quality of generated images and the alignment between image features and textual prompts. To address these challenges, we propose a novel Framework for evaluating Visual elegance and Sentiment resonance (FVS). The FVS incorporates a novel image aesthetic assessment model, specifically trained to assess the visual attractiveness of the generated images. Additionally, it evaluates the sentiment and aesthetic consistency between textual prompt and the generated image. Experimental results verify that the evaluations from our framework align more closely with human preferences. Moreover, we apply our framework to filter and construct a higher-quality training set of generated images. This curated dataset is then exploited to adapt the generative model, resulting in enhanced generation quality.

Original languageEnglish
Title of host publication2024 IEEE International Conference on Multimedia and Expo, ICME 2024
PublisherIEEE Computer Society
ISBN (Electronic)9798350390155
DOIs
StatePublished - 2024
Event2024 IEEE International Conference on Multimedia and Expo, ICME 2024 - Niagra Falls, Canada
Duration: 15 Jul 202419 Jul 2024

Publication series

NameProceedings - IEEE International Conference on Multimedia and Expo
ISSN (Print)1945-7871
ISSN (Electronic)1945-788X

Conference

Conference2024 IEEE International Conference on Multimedia and Expo, ICME 2024
Country/TerritoryCanada
CityNiagra Falls
Period15/07/2419/07/24

Keywords

  • Generative model
  • Image quality
  • Text-to-image alignment

Fingerprint

Dive into the research topics of 'Artistry in Pixels: FVS - A Framework for Evaluating Visual Elegance and Sentiment Resonance in Generated Images'. Together they form a unique fingerprint.

Cite this