Cleaning uncertain graphs via noisy crowdsourcing

  • Yongcheng Wu
  • , Xin Lin
  • , Yan Yang*
  • , Liang He
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

Uncertain graph is an important data model for many real-world applications. To answer the query on the uncertain graphs, the edges in these graphs are associated with existential probabilities that represent the likelihood of the existence of the edge. Almost all works on this area focus on how to promote the efficiency of the query processing. However, another issue should be notable, that is, the query results from the uncertain graphs are sometimes uninformative due to the edge uncertainty. We adopt a crowdsourcing-based approach to make the query results more informative. To save the monetary and time cost of crowdsourcing, we should select the optimal edges to clean to maximize the quality improvement. However, the noise of the crowdsourcing results will make the problem more complex. We prove that the problem is #P-hard and propose an efficient algorithm to derive the optimal edge. Our experimental results show that our proposed algorithm outperforms random-selection up to 22 times in quality improvement and each-edge-comparison way up to 5 times fast in elapsed time, which proves this algorithm is both effective and efficient.

Original languageEnglish
Pages (from-to)1523-1553
Number of pages31
JournalWorld Wide Web
Volume22
Issue number4
DOIs
StatePublished - 15 Jul 2019

Keywords

  • Graph cleaning
  • Noisy crowdsourcing
  • Reachability computation
  • Social network
  • Uncertain graph

Fingerprint

Dive into the research topics of 'Cleaning uncertain graphs via noisy crowdsourcing'. Together they form a unique fingerprint.

Cite this