Leveraging attributes and crowdsourcing for join

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Join operation is usually hard to achieve high quality with machine alone. We adopt crowdsourcing to improve the quality of join. Depending on the number of generated pairs, the overall cost can be expensive for hiring workers to do the verification. We propose a hybrid approach to generate pairs by leveraging attributes, which combines category, sorting and clustering techniques, called CSCER. We also propose an adaptive attribute-selection strategy to efficiently generate pairs based on attributes. Experiments on a real crowdsourcing platform using real datasets indicate that our approaches save the overall cost compared to existing methods and achieve high quality of join results.

Original languageEnglish
Title of host publicationWeb-Age Information Management - 15th International Conference, WAIM 2014, Proceedings
PublisherSpringer Verlag
Pages448-452
Number of pages5
ISBN (Print)9783319080093
DOIs
StatePublished - 2014
Externally publishedYes
Event15th International Conference on Web-Age Information Management, WAIM 2014 - Macau, China
Duration: 16 Jun 201418 Jun 2014

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8485 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference15th International Conference on Web-Age Information Management, WAIM 2014
Country/TerritoryChina
CityMacau
Period16/06/1418/06/14

Fingerprint

Dive into the research topics of 'Leveraging attributes and crowdsourcing for join'. Together they form a unique fingerprint.

Cite this