Skip to main navigation Skip to search Skip to main content

WISE 2014 challenge: Multi-label classification of print media articles to topics

  • Grigorios Tsoumakas
  • , Apostolos Papadopoulos
  • , Weining Qian
  • , Stavros Vologiannidis
  • , Alexander D’yakonov
  • , Antti Puurula
  • , Jesse Read
  • , Jan Švec
  • , Stanislav Semenov
  • Aristotle University of Thessaloniki
  • DataScouting
  • Lomonosov Moscow State University
  • University of Waikato
  • Aalto University
  • University of West Bohemia
  • Yandex School of Data Analysis

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The WISE 2014 challenge was concerned with the task of multi-label classification of articles coming from Greek print media. Raw data comes from the scanning of print media, article segmentation, and optical character segmentation, and therefore is quite noisy. Each article is examined by a human annotator and categorized to one or more of the topics being monitored. Topics range from specific persons, products, and companies that can be easily categorized based on keywords, to more general semantic concepts, such as environment or economy. Building multi-label classifiers for the automated annotation of articles into topics can support the work of human annotators by suggesting a list of all topics by order of relevance, or even automate the annotation process for media and/or categories that are easier to predict. This saves valuable time and allows a media monitoring company to expand the portfolio of media being monitored. This paper summarizes the approaches of the top 4 among the 121 teams that participated in the competition.

Original languageEnglish
Title of host publicationWeb Information Systems Engineering – WISE 2014 - 15th International Conference, Proceedings
PublisherSpringer Verlag
Pages2 of 2
ISBN (Print)9783319117454
DOIs
StatePublished - 2014

Publication series

NameLecture Notes in Computer Science
Volume8787
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Fingerprint

Dive into the research topics of 'WISE 2014 challenge: Multi-label classification of print media articles to topics'. Together they form a unique fingerprint.

Cite this