Predicting poll trends using twitter and multivariate time-series classification

  • Tom Mirowski
  • , Shoumik Roychoudhury
  • , Fang Zhou
  • , Zoran Obradovic*
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

Social media outlets, such as Twitter, provide invaluable information for understanding the social and political climate surrounding particular issues. Millions of people who vary in age, social class, and political beliefs come together in conversation. However, this information poses challenges to making inferences from these tweets. Using the tweets from the 2016 U.S. Presidential campaign, one main research question is addressed in this work. That is, can accurate predictions be made detecting changes in a political candidate’s poll score trends utilizing tweets created during their campaign? The novelty of this work is that we formulate the problem as a multivariate time-series classification problem, which fits the temporal nature of tweets, rather than as a traditional attribute-based classification. Features that represent various aspects of support for (or against) a candidate are tracked on an hour-by-hour basis. Together these form multivariate time-series. One commonly used approach to this problem is based on the majority voting scheme. This method assumes the univariate time-series from different features have equal importance. To alleviate this issue a weighted shapelet transformation model is proposed. Extensive experiments on over 12 million tweets between November 2015 and January 2016 related to the four primary candidates (Bernie Sanders, Hillary Clinton, Donald Trump and Ted Cruz) indicate that the multivariate time-series approach outperforms traditional attribute-based approaches.

Original languageEnglish
Title of host publicationSocial Informatics - 8th International Conference, SocInfo 2016, Proceedings
EditorsEmma Spiro, Yong-Yeol Ahn
PublisherSpringer Verlag
Pages273-289
Number of pages17
ISBN (Print)9783319478791
DOIs
StatePublished - 2016
Externally publishedYes
Event8th International Conference on Social Informatics, SocInfo 2016 - Bellevue, United States
Duration: 11 Nov 201614 Nov 2016

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10046 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference8th International Conference on Social Informatics, SocInfo 2016
Country/TerritoryUnited States
CityBellevue
Period11/11/1614/11/16

Fingerprint

Dive into the research topics of 'Predicting poll trends using twitter and multivariate time-series classification'. Together they form a unique fingerprint.

Cite this