Smarttext: Learning to generate harmonious textual layout over natural image

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

12 Scopus citations

Abstract

Automatic typography is important because it helps designers avoid highly repetitive tasks and amateur users achieve high-quality textual layout designs. However, there are often many parameters that need to be adjusted in automatic typography work. In this paper, we propose an efficient content-aware learning-based framework to generate harmonious textual layout over natural image. Our method incorporates both semantic features and visual perception principles. First, we combine a semantic visual saliency detection network with diffusion equations and a text-region proposal algorithm to generate candidate text anchors with various positions and sizes. Second, we develop a deep scoring network to assess the aesthetic quality of the candidate results. We design multiple evaluations to compare our method with several baselines and a commercial poster design tool. The results demonstrate that our method can generate harmonious textual layout in various actual scenarios with better performance.

Original languageEnglish
Title of host publication2020 IEEE International Conference on Multimedia and Expo, ICME 2020
PublisherIEEE Computer Society
ISBN (Electronic)9781728113319
DOIs
StatePublished - Jul 2020
Event2020 IEEE International Conference on Multimedia and Expo, ICME 2020 - London, United Kingdom
Duration: 6 Jul 202010 Jul 2020

Publication series

NameProceedings - IEEE International Conference on Multimedia and Expo
Volume2020-July
ISSN (Print)1945-7871
ISSN (Electronic)1945-788X

Conference

Conference2020 IEEE International Conference on Multimedia and Expo, ICME 2020
Country/TerritoryUnited Kingdom
CityLondon
Period6/07/2010/07/20

Keywords

  • Deep learning
  • Image aesthetics
  • Saliency detection
  • Textual layout
  • Visual design

Fingerprint

Dive into the research topics of 'Smarttext: Learning to generate harmonious textual layout over natural image'. Together they form a unique fingerprint.

Cite this