跳到主要导航 跳到搜索 跳到主要内容

Using evidence based content trust model for spam detection

  • Wei Wang
  • , Guosun Zeng
  • , Daizhong Tang*
  • *此作品的通讯作者
  • Tongji University
  • Technology Center of High Performance
  • Ministry of Education of the People's Republic of China

科研成果: 期刊稿件文章同行评审

摘要

Content trust is one of the main components in the research of information retrieval. As it gets easier to add information to the Web via HTML pages, wikis, blogs, and other documents, it gets tougher to distinguish accurate or trustworthy information from inaccurate or untrustworthy information on the Web. Current technology of spam detection is based on binary metric, that is binary classification is adapted in the spam detection. In order to meet the users' need and preference, more accurate metric is needed in the content trust as well as in detecting spam information. In this paper, we use the notion of content trust for spam detection, and regard it as a ranking problem. Besides traditional text feature attributes, information quality based evidence is introduced to define the trust feature of spam information, and a novel content trust learning algorithm based on these evidence is proposed. Finally, a Web spam detection system is developed and the experiments on the real Web data are carried out, which show the proposed method performs very well in practice.

源语言英语
页(从-至)5599-5606
页数8
期刊Expert Systems with Applications
37
8
DOI
出版状态已出版 - 8月 2010
已对外发布

指纹

探究 'Using evidence based content trust model for spam detection' 的科研主题。它们共同构成独一无二的指纹。

引用此