TY - JOUR
T1 - EPPQ
T2 - Efficient and Privacy-Preserving kNN Query Processing for Outsourced High-Dimensional Data
AU - Wang, Jing
AU - Bao, Haiyong
AU - Lu, Rongxing
AU - Huang, Cheng
AU - Guan, Menghong
AU - Xing, Lu
N1 - Publisher Copyright:
© 2008-2012 IEEE.
PY - 2025
Y1 - 2025
N2 - Extensive schemes have been conducted on the development of efficient and privacy-preserving kNN query algorithms in data outsourcing scenarios. However, existing researches primarily address low-dimensional data, posing scalability challenges in higher dimensions. To tackle this issue, we propose an efficient and privacy-preserving kNN query scheme for outsourced high-dimensional data (EPPQ), emphasizing the complete lifecycle from secure dimensionality reduction of high-dimensional data to secure kNN query on the reduced-dimensional data. Specifically, in the secure dimensionality reduction phase: on the one hand, EPPQ integrates principal component analysis (PCA) for dimensionality reduction to minimize computational overhead. On the other hand, to address privacy concerns during the process of PCA, by incorporating differential privacy (DP), we propose the PrivacyPreserving Data Dimensionality Reduction Algorithm based on PCA (PDDRP). In the secure kNN query phase:for one thing, EPPQ facilitates the index of the reduced-dimensional data by k-d tree. To enhance index efficiency, we innovatively propose plaintexts-based distance calculation definitions (PDC definitions) and construct an efficient variant of k-d tree (Ek-d tree), for the first time. For another, the Paillier homomorphic encryption (PHE) technique is leveraged to safeguard privacy when outsourcing Ek-d tree to untrusted cloud servers. Additionally, for ciphertexts-based distance calculations and comparisons, we design the Secure Precomputed Distance protocol (SPCD) and Secure Comparison protocol (SCOM). Finally, we creatively present the Privacy-Preserving kNN Query Algorithm based on Ek-d tree (PKQKT) for efficient and secure kNN query. Comprehensive security analysis demonstrates that the EPPQ scheme meets the required security properties under the honest-but-curious model. Extensive experiments confirms that EPPQ achieves high computational efficiency and query accuracy.
AB - Extensive schemes have been conducted on the development of efficient and privacy-preserving kNN query algorithms in data outsourcing scenarios. However, existing researches primarily address low-dimensional data, posing scalability challenges in higher dimensions. To tackle this issue, we propose an efficient and privacy-preserving kNN query scheme for outsourced high-dimensional data (EPPQ), emphasizing the complete lifecycle from secure dimensionality reduction of high-dimensional data to secure kNN query on the reduced-dimensional data. Specifically, in the secure dimensionality reduction phase: on the one hand, EPPQ integrates principal component analysis (PCA) for dimensionality reduction to minimize computational overhead. On the other hand, to address privacy concerns during the process of PCA, by incorporating differential privacy (DP), we propose the PrivacyPreserving Data Dimensionality Reduction Algorithm based on PCA (PDDRP). In the secure kNN query phase:for one thing, EPPQ facilitates the index of the reduced-dimensional data by k-d tree. To enhance index efficiency, we innovatively propose plaintexts-based distance calculation definitions (PDC definitions) and construct an efficient variant of k-d tree (Ek-d tree), for the first time. For another, the Paillier homomorphic encryption (PHE) technique is leveraged to safeguard privacy when outsourcing Ek-d tree to untrusted cloud servers. Additionally, for ciphertexts-based distance calculations and comparisons, we design the Secure Precomputed Distance protocol (SPCD) and Secure Comparison protocol (SCOM). Finally, we creatively present the Privacy-Preserving kNN Query Algorithm based on Ek-d tree (PKQKT) for efficient and secure kNN query. Comprehensive security analysis demonstrates that the EPPQ scheme meets the required security properties under the honest-but-curious model. Extensive experiments confirms that EPPQ achieves high computational efficiency and query accuracy.
KW - PCA
KW - differential privacy
KW - high-dimension
KW - k-d tree
KW - kNN
KW - privacy preservation
UR - https://www.scopus.com/pages/publications/105013769341
U2 - 10.1109/TSC.2025.3600124
DO - 10.1109/TSC.2025.3600124
M3 - 文章
AN - SCOPUS:105013769341
SN - 1939-1374
VL - 18
SP - 2851
EP - 2864
JO - IEEE Transactions on Services Computing
JF - IEEE Transactions on Services Computing
IS - 5
ER -