跳到主要导航 跳到搜索 跳到主要内容

An improved algorithm for weighting keywords in web documents

  • Shuang Sun*
  • , Liang He
  • , Jing Yang
  • , Jun Zhong Gu
  • *此作品的通讯作者
  • East China Normal University

科研成果: 期刊稿件文章同行评审

摘要

In this paper, an improved algorithm, web-based keyword weight algorithm (WKWA), is presented to weight keywords in web documents. WKWA takes into account representation features of web documents and advantages of the TF*IDF, TFC and ITC algorithms in order to make it more appropriate for web documents. Meanwhile, the presented algorithm is applied to improved vector space model (IVSM). A real system has been implemented for calculating semantic similarities of web documents. Four experiments have been carried out. They are keyword weight calculation, feature item selection, semantic similarity calculation, and WKWA time performance. The results demonstrate accuracy of keyword weight, and semantic similarity is improved.

源语言英语
页(从-至)235-239
页数5
期刊Journal of Shanghai University
12
3
DOI
出版状态已出版 - 6月 2008

指纹

探究 'An improved algorithm for weighting keywords in web documents' 的科研主题。它们共同构成独一无二的指纹。

引用此