TY - GEN
T1 - Content trust model for detecting web spam
AU - Wang, Wei
AU - Zeng, Guosun
PY - 2007
Y1 - 2007
N2 - As it gets easier to add information to the web via html pages, wikis, blogs, and other documents, it gets tougher to distinguish accurate or trustworthy information from inaccurate or untrustworthy information. Moreover, apart from inaccurate or untrustworthy information, we also need to anticipate web spam - where spammers publish false facts and scams to deliberately mislead users. Creating an effective spam detection method is a challenge. In this paper, we use the notion of content trust for spam detection, and regard it as a ranking problem. Evidence is utilized to define the feature of spam web pages, and machine learning techniques are employed to combine the evidence to create a highly efficient and reasonably-accurate spam detection algorithm. Experiments on real web data are carried out, which show the proposed method performs very well in practice.
AB - As it gets easier to add information to the web via html pages, wikis, blogs, and other documents, it gets tougher to distinguish accurate or trustworthy information from inaccurate or untrustworthy information. Moreover, apart from inaccurate or untrustworthy information, we also need to anticipate web spam - where spammers publish false facts and scams to deliberately mislead users. Creating an effective spam detection method is a challenge. In this paper, we use the notion of content trust for spam detection, and regard it as a ranking problem. Evidence is utilized to define the feature of spam web pages, and machine learning techniques are employed to combine the evidence to create a highly efficient and reasonably-accurate spam detection algorithm. Experiments on real web data are carried out, which show the proposed method performs very well in practice.
UR - https://www.scopus.com/pages/publications/36649031638
U2 - 10.1007/978-0-387-73655-6_10
DO - 10.1007/978-0-387-73655-6_10
M3 - 会议稿件
AN - SCOPUS:36649031638
SN - 9780387736549
T3 - IFIP International Federation for Information Processing
SP - 139
EP - 152
BT - Trust Management
A2 - Etalle, Sandro
A2 - Marsh, Stephen
ER -