跳到主要导航 跳到搜索 跳到主要内容

Crowdsourced selection on multi-attribute data

  • Xueping Weng
  • , Guoliang Li
  • , Huiqi Hu
  • , Jianhua Feng

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Crowdsourced selection asks the crowd to select entities that satisfy a query condition, e.g., selecting the photos of people wearing sunglasses from a given set of photos. Existing studies focus on a single query predicate and in this paper we study the crowdsourced selection problem on multi-attribute data, e.g., selecting the female photos with dark eyes and wearing sunglasses. A straightforward method asks the crowd to answer every entity by checking every predicate in the query. Obviously, this method involves huge monetary cost. Instead, we can select an optimized predicate order and ask the crowd to answer the entities following the order. Since if an entity does not satisfy a predicate, we can prune this entity without needing to ask other predicates and thus this method can reduce the cost. There are two challenges in finding the optimized predicate order. The first is how to detect the predicate order and the second is to capture correlation among different predicates. To address this problem, we propose predicate order based framework to reduce monetary cost. Firstly, we define an expectation tree to store selectivities on predicates and estimate the best predicate order. In each iteration, we estimate the best predicate order from the expectation tree, and then choose a predicate as a question to ask the crowd. After getting the result of the current predicate, we choose next predicate to ask until we get the result. We will update the expectation tree using the answer obtained from the crowd and continue to the next iteration. We also study the problem of answering multiple queries simultaneously, and reduce its cost using the correlation between queries. Finally, we propose a confidence based method to improve the quality. The experiment result shows that our predicate order based algorithm is effective and can reduce cost significantly compared with baseline approaches.

源语言英语
主期刊名CIKM 2017 - Proceedings of the 2017 ACM Conference on Information and Knowledge Management
出版商Association for Computing Machinery
307-316
页数10
ISBN(电子版)9781450349185
DOI
出版状态已出版 - 6 11月 2017
已对外发布
活动26th ACM International Conference on Information and Knowledge Management, CIKM 2017 - Singapore, 新加坡
期限: 6 11月 201710 11月 2017

出版系列

姓名International Conference on Information and Knowledge Management, Proceedings
Part F131841

会议

会议26th ACM International Conference on Information and Knowledge Management, CIKM 2017
国家/地区新加坡
Singapore
时期6/11/1710/11/17

指纹

探究 'Crowdsourced selection on multi-attribute data' 的科研主题。它们共同构成独一无二的指纹。

引用此