Automatic topic identification using webpage clustering

  • Xiaofeng He*
  • , Chris H.Q. Ding
  • , Hongyuan Zha
  • , Horst D. Simon
  • *此作品的通讯作者

科研成果: 书/报告/会议事项章节会议稿件同行评审

65 引用 (Scopus)

摘要

Grouping webpages into distinct topics is one way to organize the large amount of retrieved information on the web. In this paper, we report that based on similarity metric which incorporates textual information, hyperlink structure and co-citation relations, an unsupervised clustering method can automatically and effectively identify relevant topics, as shown in experiments on several retrieved sets of webpages. The clustering method is a state-of-art spectral graph partitioning method based on normalized cut criterion first developed for image segmentation.

源语言英语
主期刊名Proceedings - 2001 IEEE International Conference on Data Mining, ICDM'01
195-202
页数8
出版状态已出版 - 2001
已对外发布
活动1st IEEE International Conference on Data Mining, ICDM'01 - San Jose, CA, 美国
期限: 29 11月 20012 12月 2001

出版系列

姓名Proceedings - IEEE International Conference on Data Mining, ICDM
ISSN(印刷版)1550-4786

会议

会议1st IEEE International Conference on Data Mining, ICDM'01
国家/地区美国
San Jose, CA
时期29/11/012/12/01

指纹

探究 'Automatic topic identification using webpage clustering' 的科研主题。它们共同构成独一无二的指纹。

引用此