跳到主要导航 跳到搜索 跳到主要内容

WISE 2014 challenge: Multi-label classification of print media articles to topics

  • Grigorios Tsoumakas
  • , Apostolos Papadopoulos
  • , Weining Qian
  • , Stavros Vologiannidis
  • , Alexander D’yakonov
  • , Antti Puurula
  • , Jesse Read
  • , Jan Švec
  • , Stanislav Semenov
  • Aristotle University of Thessaloniki
  • DataScouting
  • Lomonosov Moscow State University
  • University of Waikato
  • Aalto University
  • University of West Bohemia
  • Yandex School of Data Analysis

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

The WISE 2014 challenge was concerned with the task of multi-label classification of articles coming from Greek print media. Raw data comes from the scanning of print media, article segmentation, and optical character segmentation, and therefore is quite noisy. Each article is examined by a human annotator and categorized to one or more of the topics being monitored. Topics range from specific persons, products, and companies that can be easily categorized based on keywords, to more general semantic concepts, such as environment or economy. Building multi-label classifiers for the automated annotation of articles into topics can support the work of human annotators by suggesting a list of all topics by order of relevance, or even automate the annotation process for media and/or categories that are easier to predict. This saves valuable time and allows a media monitoring company to expand the portfolio of media being monitored. This paper summarizes the approaches of the top 4 among the 121 teams that participated in the competition.

源语言英语
主期刊名Web Information Systems Engineering – WISE 2014 - 15th International Conference, Proceedings
出版商Springer Verlag
2 of 2
ISBN(印刷版)9783319117454
DOI
出版状态已出版 - 2014

出版系列

姓名Lecture Notes in Computer Science
8787
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

指纹

探究 'WISE 2014 challenge: Multi-label classification of print media articles to topics' 的科研主题。它们共同构成独一无二的指纹。

引用此