Joint Deep Multi-View Learning for Image Clustering

  • Yuan Xie
  • , Bingqian Lin
  • , Yanyun Qu*
  • , Cuihua Li
  • , Wensheng Zhang
  • , Lizhuang Ma
  • , Yonggang Wen
  • , Dacheng Tao
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

114 Scopus citations

Abstract

In this paper, a novel Deep Multi-view Joint Clustering (DMJC) framework is proposed, where multiple deep embedded features, multi-view fusion mechanism, and clustering assignments can be learned simultaneously. Through the joint learning strategy, the clustering-friendly multi-view features and useful multi-view complementary information can be exploited effectively to improve the clustering performance. Under the proposed joint learning framework, we design two ingenious variants of deep multi-view joint clustering models, whose multi-view fusion is implemented by two kinds of simple yet effective schemes. The first model, called DMJC-S, performs multi-view fusion in an implicit way via a novel multi-view soft assignment distribution. The second model, termed DMJC-T, defines a novel multi-view auxiliary target distribution to conduct the multi-view fusion explicitly. Both DMJC-S and DMJC-T are optimized under a KL divergence objective. Experiments on eight challenging image datasets demonstrate the superiority of both DMJC-S and DMJC-T over single/multi-view baselines and the state-of-the-art multi-view clustering methods, which proves the effectiveness of the proposed DMJC framework. To the best of our knowledge, this is the first work to model the multi-view clustering in a deep joint framework, which will provide a meaningful thinking in unsupervised multi-view learning.

Original languageEnglish
Pages (from-to)3594-3606
Number of pages13
JournalIEEE Transactions on Knowledge and Data Engineering
Volume33
Issue number11
DOIs
StatePublished - 1 Nov 2021

Keywords

  • Multi-view clustering
  • deep learning
  • multi-view fusion

Fingerprint

Dive into the research topics of 'Joint Deep Multi-View Learning for Image Clustering'. Together they form a unique fingerprint.

Cite this