WISE 2014 challenge: Multi-label classification of print media articles to topics

  • Grigorios Tsoumakas
  • , Apostolos Papadopoulos
  • , Weining Qian
  • , Stavros Vologiannidis
  • , Alexander D’yakonov
  • , Antti Puurula
  • , Jesse Read
  • , Jan Švec
  • , Stanislav Semenov

Research output: Contribution to journalArticlepeer-review

14 Scopus citations

Abstract

The WISE 2014 challenge was concerned with the task of multi-label classification of articles coming from Greek print media. Raw data comes from the scanning of print media, article segmentation, and optical character segmentation, and therefore is quite noisy. Each article is examined by a human annotator and categorized to one or more of the topics being monitored. Topics range from specific persons, products, and companies that can be easily categorized based on keywords, to more general semantic concepts, such as environment or economy. Building multi-label classifiers for the automated annotation of articles into topics can support the work of human annotators by suggesting a list of all topics by order of relevance, or even automate the annotation process for media and/or categories that are easier to predict. This saves valuable time and allows a media monitoring company to expand the portfolio of media being monitored. This paper summarizes the approaches of the top 4 among the 121 teams that participated in the competition.

Original languageEnglish
Pages (from-to)541-548
Number of pages8
JournalLecture Notes in Computer Science
Volume8787
DOIs
StatePublished - 2014

Fingerprint

Dive into the research topics of 'WISE 2014 challenge: Multi-label classification of print media articles to topics'. Together they form a unique fingerprint.

Cite this