TY - JOUR
T1 - Continuous k-Regret Minimization Queries
T2 - A Dynamic Coreset Approach
AU - Zheng, Jiping
AU - Ma, Wei
AU - Wang, Yanhao
AU - Wang, Xiaoyang
N1 - Publisher Copyright:
© 1989-2012 IEEE.
PY - 2023/6/1
Y1 - 2023/6/1
N2 - Finding a small set of representative tuples from a large database is an important functionality for supporting multi-criteria decision making. Top-k queries and skyline queries are two widely studied queries to fulfill this task. However, both of them have some limitations: a top-k query requires the user to provide her utility functions for finding the k tuples with the highest scores as the result; a skyline query does not need any user-specified utility function but cannot control the result size. To overcome their drawbacks, the k-regret minimization query was proposed and received much attention recently, since it does not require any user-specified utility function and returns a fixed-size result set. Specifically, it selects a set R of tuples with a pre-defined size r from a database D such that the maximum k-regret ratio, which captures how well the top-ranked tuple in R represents the top-k tuples in D for any possible utility function, is minimized. Although there have been many methods for k-regret minimization query processing, most of them are designed for static databases without tuple insertions and deletions. The only known algorithm to process continuous k-regret minimization queries (CkRMQ) in dynamic databases suffers from suboptimal approximation and high time complexity. In this paper, we propose a novel dynamic coreset-based approach, called DynCore, for CkRMQ processing. It achieves the same (asymptotically optimal) upper bound on the maximum k-regret ratio as the best-known static algorithm. Meanwhile, its time complexity is sublinear to the database size, which is significantly lower than that of the existing dynamic algorithm. The efficiency and effectiveness of DynCore is confirmed by experimental results on real-world and synthetic datasets.
AB - Finding a small set of representative tuples from a large database is an important functionality for supporting multi-criteria decision making. Top-k queries and skyline queries are two widely studied queries to fulfill this task. However, both of them have some limitations: a top-k query requires the user to provide her utility functions for finding the k tuples with the highest scores as the result; a skyline query does not need any user-specified utility function but cannot control the result size. To overcome their drawbacks, the k-regret minimization query was proposed and received much attention recently, since it does not require any user-specified utility function and returns a fixed-size result set. Specifically, it selects a set R of tuples with a pre-defined size r from a database D such that the maximum k-regret ratio, which captures how well the top-ranked tuple in R represents the top-k tuples in D for any possible utility function, is minimized. Although there have been many methods for k-regret minimization query processing, most of them are designed for static databases without tuple insertions and deletions. The only known algorithm to process continuous k-regret minimization queries (CkRMQ) in dynamic databases suffers from suboptimal approximation and high time complexity. In this paper, we propose a novel dynamic coreset-based approach, called DynCore, for CkRMQ processing. It achieves the same (asymptotically optimal) upper bound on the maximum k-regret ratio as the best-known static algorithm. Meanwhile, its time complexity is sublinear to the database size, which is significantly lower than that of the existing dynamic algorithm. The efficiency and effectiveness of DynCore is confirmed by experimental results on real-world and synthetic datasets.
KW - Continuous k-regret minimization query
KW - dynamic coreset
KW - nearest neighbor search
KW - δ-net
KW - ϵ-kernel
UR - https://www.scopus.com/pages/publications/85128607719
U2 - 10.1109/TKDE.2022.3166835
DO - 10.1109/TKDE.2022.3166835
M3 - 文章
AN - SCOPUS:85128607719
SN - 1041-4347
VL - 35
SP - 5680
EP - 5694
JO - IEEE Transactions on Knowledge and Data Engineering
JF - IEEE Transactions on Knowledge and Data Engineering
IS - 6
ER -