TY - JOUR
T1 - Topic model for graph mining based on hierarchical Dirichlet process
AU - Zhang, Haibin
AU - Huating, Shang
AU - Wu, Xianyi
N1 - Publisher Copyright:
© 2019, © East China Normal University 2019.
PY - 2020/1/2
Y1 - 2020/1/2
N2 - In this paper, a nonparametric Bayesian graph topic model (GTM) based on hierarchical Dirichlet process (HDP) is proposed. The HDP makes the number of topics selected flexibly, which breaks the limitation that the number of topics need to be given in advance. Moreover, the GTM releases the assumption of ‘bag of words’ and considers the graph structure of the text. The combination of HDP and GTM takes advantage of both which is named as HDP–GTM. The variational inference algorithm is used for the posterior inference and the convergence of the algorithm is analysed. We apply the proposed model in text categorisation, comparing to three related topic models, latent Dirichlet allocation (LDA), GTM and HDP.
AB - In this paper, a nonparametric Bayesian graph topic model (GTM) based on hierarchical Dirichlet process (HDP) is proposed. The HDP makes the number of topics selected flexibly, which breaks the limitation that the number of topics need to be given in advance. Moreover, the GTM releases the assumption of ‘bag of words’ and considers the graph structure of the text. The combination of HDP and GTM takes advantage of both which is named as HDP–GTM. The variational inference algorithm is used for the posterior inference and the convergence of the algorithm is analysed. We apply the proposed model in text categorisation, comparing to three related topic models, latent Dirichlet allocation (LDA), GTM and HDP.
KW - Graph topic model
KW - hierarchical Dirichlet process
KW - text classification
KW - variational inference
UR - https://www.scopus.com/pages/publications/85069960824
U2 - 10.1080/24754269.2019.1593098
DO - 10.1080/24754269.2019.1593098
M3 - 文章
AN - SCOPUS:85069960824
SN - 2475-4269
VL - 4
SP - 66
EP - 77
JO - Statistical Theory and Related Fields
JF - Statistical Theory and Related Fields
IS - 1
ER -