Topic model for graph mining based on hierarchical Dirichlet process

Haibin Zhang, Shang Huating, Xianyi Wu

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

In this paper, a nonparametric Bayesian graph topic model (GTM) based on hierarchical Dirichlet process (HDP) is proposed. The HDP makes the number of topics selected flexibly, which breaks the limitation that the number of topics need to be given in advance. Moreover, the GTM releases the assumption of ‘bag of words’ and considers the graph structure of the text. The combination of HDP and GTM takes advantage of both which is named as HDP–GTM. The variational inference algorithm is used for the posterior inference and the convergence of the algorithm is analysed. We apply the proposed model in text categorisation, comparing to three related topic models, latent Dirichlet allocation (LDA), GTM and HDP.

Original languageEnglish
Pages (from-to)66-77
Number of pages12
JournalStatistical Theory and Related Fields
Volume4
Issue number1
DOIs
StatePublished - 2 Jan 2020

Keywords

  • Graph topic model
  • hierarchical Dirichlet process
  • text classification
  • variational inference

Fingerprint

Dive into the research topics of 'Topic model for graph mining based on hierarchical Dirichlet process'. Together they form a unique fingerprint.

Cite this