跳到主要导航 跳到搜索 跳到主要内容

Image automatic annotation via multi-view deep representation

  • Yang Yang
  • , Wensheng Zhang*
  • , Yuan Xie
  • *此作品的通讯作者
  • University of Chinese Academy of Sciences

科研成果: 期刊稿件文章同行评审

摘要

The performance of text-based image retrieval is highly dependent on the tedious and inefficient manual work. For the purpose of realizing image keywords generated automatically, extensive work has been done in the area of image annotation. However, how to treat image diverse keywords and choose appropriate features are still two difficult problems. To address this challenge, we propose the multi-view stacked auto-encoder (MVSAE) framework to establish the correlations between the low-level visual features and high-level semantic information. In this paper, a new method, which incorporates the keyword frequencies and log-entropy, is presented to address the imbalanced distribution of keywords. In order to utilize the complementarities among diverse visual descriptors, we tactfully apply multi-view learning to search for the label-specific features. Thereafter, the image keywords are finally produced by appropriate features. Conducting extensive experiments on three popular data sets, we demonstrate that our proposed framework can achieve effective and favorable performance for image annotation.

源语言英语
页(从-至)368-377
页数10
期刊Journal of Visual Communication and Image Representation
33
DOI
出版状态已出版 - 1 11月 2015
已对外发布

指纹

探究 'Image automatic annotation via multi-view deep representation' 的科研主题。它们共同构成独一无二的指纹。

引用此