Abstract
Uncertain graph is an important data model for many real-world applications. To answer the query on the uncertain graphs, the edges in these graphs are associated with existential probabilities that represent the likelihood of the existence of the edge. Almost all works on this area focus on how to promote the efficiency of the query processing. However, another issue should be notable, that is, the query results from the uncertain graphs are sometimes uninformative due to the edge uncertainty. We adopt a crowdsourcing-based approach to make the query results more informative. To save the monetary and time cost of crowdsourcing, we should select the optimal edges to clean to maximize the quality improvement. However, the noise of the crowdsourcing results will make the problem more complex. We prove that the problem is #P-hard and propose an efficient algorithm to derive the optimal edge. Our experimental results show that our proposed algorithm outperforms random-selection up to 22 times in quality improvement and each-edge-comparison way up to 5 times fast in elapsed time, which proves this algorithm is both effective and efficient.
| Original language | English |
|---|---|
| Pages (from-to) | 1523-1553 |
| Number of pages | 31 |
| Journal | World Wide Web |
| Volume | 22 |
| Issue number | 4 |
| DOIs | |
| State | Published - 15 Jul 2019 |
Keywords
- Graph cleaning
- Noisy crowdsourcing
- Reachability computation
- Social network
- Uncertain graph