An information theoretic approach to sentiment polarity classification

Yuming Lin, Jingwei Zhang, Xiaoling Wang, Aoying Zhou

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

44 Scopus citations

Abstract

Sentiment classification is a task of classifying documents according to their overall sentiment inclination. It is very important and popular in many web applications, such as credibility analysis of news sites on the Web, recommendation system and mining online discussion. Vector space model is widely applied on modeling documents in supervised sentiment classification, in which the feature presentation (including features type and weight function) is crucial for classification accuracy. The traditional feature presentation methods of text categorization do not perform well in sentiment classification, because the expressing manners of sentiment are more subtle. We analyze the relationships of terms with sentiment labels based on information theory, and propose a method by applying information theoretic approach on sentiment classification of documents. In this paper, we adopt mutual information on quantifying the sentiment polarities of terms in a document firstly. Then the terms are weighted in vector space based on both sentiment scores and contribution to the document. We perform extensive experiments with SVM on the sets of multiple product reviews, and the experimental results show our approach is more effective than the traditional ones.

Original languageEnglish
Title of host publicationWebQuality 2012 - Proceedings of the 2nd Joint WICOW/AIRWeb Workshop on Web Quality
Pages35-40
Number of pages6
DOIs
StatePublished - 2012
Event2nd Joint WICOW/AIRWeb Workshop on Web Quality, WebQuality 2012 - Lyon, France
Duration: 16 Apr 201216 Apr 2012

Publication series

NameACM International Conference Proceeding Series

Conference

Conference2nd Joint WICOW/AIRWeb Workshop on Web Quality, WebQuality 2012
Country/TerritoryFrance
CityLyon
Period16/04/1216/04/12

Keywords

  • Feature presentation
  • Information theory
  • Mutual information
  • Sentiment classification

Fingerprint

Dive into the research topics of 'An information theoretic approach to sentiment polarity classification'. Together they form a unique fingerprint.

Cite this