EviRank: An evidence based content trust model for web spam detection

  • Wei Wang
  • , Guosun Zeng
  • , Mingjun Sun
  • , Huanan Gu
  • , Quan Zhang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Creating an effective spam detection method is a challenging task. Traditional works usually regard this kind of work as a problem of binary classification. In this paper, however, we argue that it is more property to use the notion of content trust for it, and regard it as a ranking or ordinal regression problem. Evidence is utilized to define the feature of spam web pages, and machine learning techniques are employed to combine the evidence to create a highly efficient and reasonably-accurate detection algorithm. Experiments on real web data are carried out, which improve the proposed method performs very well in practice.

Original languageEnglish
Title of host publicationAdvances in Web and Network Technologies, and Information Management - APWeb/WAIM 2007 International Workshops DBMAN 2007, WebETrends 2007, PAIS 2007 and ASWAN 2007, Proceedings
PublisherSpringer Verlag
Pages299-307
Number of pages9
ISBN (Print)9783540729082
DOIs
StatePublished - 2007
Externally publishedYes
EventApWeb/WAIM 2007 International Workshops: 1st International workshop on Database Management and Applications over Networks, DBMAN 2007 - 1st Workshop on Emerging Trends of Web Technologies and Applications, WebETrends 2007 - International Workshop on - Huang Shan, China
Duration: 16 Jun 200718 Jun 2007

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4537 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceApWeb/WAIM 2007 International Workshops: 1st International workshop on Database Management and Applications over Networks, DBMAN 2007 - 1st Workshop on Emerging Trends of Web Technologies and Applications, WebETrends 2007 - International Workshop on
Country/TerritoryChina
CityHuang Shan
Period16/06/0718/06/07

Keywords

  • Content trust
  • Evidence
  • Ranking
  • SVM, learning
  • Web spam

Fingerprint

Dive into the research topics of 'EviRank: An evidence based content trust model for web spam detection'. Together they form a unique fingerprint.

Cite this