Improving the quality of crowdsourcing labels by combination of golden data and incentive

Peijun Yang, Haibin Cai, Zhiming Zheng

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

The rapid rise of deep learning and AI is inseparable from the support of massive labeled data. Crowdsourcing has become a cheap and efficient paradigm for providing labels for large-scale unlabeled data. But, due to the various uncertainty of crowdsourcing workers (or called labelers), much low-quality and false labeled data is yielded. To address this fundamental challenge, many redundancy-based ground truth inference algorithms have been proposed in the past few years, which assign each labeling task to multiple workers and infer the true label of each instance in task from its multiple label set. In this paper, we devise a novel scheme to improve the quality of labeled data and infer the truth label, which utilizes small proportion golden data that has been labeled correctly to estimate workers' ability and reliability and uses the incentive mechanism to motivate workers to do their best. Through experiments, we demonstrate that our method is effective and is also robust to low-quality workers as it outperforms Majority Voting (MV) and some commonly used algorithms.

Original languageEnglish
Title of host publicationProceedings of 2018 12th IEEE International Conference on Anti-Counterfeiting, Security, and Identification, ASID 2018
PublisherIEEE Computer Society
Pages10-15
Number of pages6
ISBN (Electronic)9781538660638
DOIs
StatePublished - 2 Jul 2018
Event12th IEEE International Conference on Anti-Counterfeiting, Security, and Identification, ASID 2018 - Xiamen, China
Duration: 9 Nov 201811 Nov 2018

Publication series

NameProceedings of the International Conference on Anti-Counterfeiting, Security and Identification, ASID
Volume2018-November
ISSN (Print)2163-5048
ISSN (Electronic)2163-5056

Conference

Conference12th IEEE International Conference on Anti-Counterfeiting, Security, and Identification, ASID 2018
Country/TerritoryChina
CityXiamen
Period9/11/1811/11/18

Keywords

  • crowdsourcing
  • golden data
  • ground truth inference
  • incentive mechanism
  • label quality
  • majority voting

Fingerprint

Dive into the research topics of 'Improving the quality of crowdsourcing labels by combination of golden data and incentive'. Together they form a unique fingerprint.

Cite this