TY - JOUR
T1 - Data Level Privacy Preserving
T2 - A Stochastic Perturbation Approach Based on Differential Privacy
AU - Ma, Chuan
AU - Yuan, Long
AU - Han, Li
AU - Ding, Ming
AU - Bhaskar, Raghav
AU - Li, Jun
N1 - Publisher Copyright:
© 1989-2012 IEEE.
PY - 2023/4/1
Y1 - 2023/4/1
N2 - With the great amount of available data, especially collecting from the ubiquitous Internet of Things (IoT), the issue of privacy leakage arises increasingly concerns recently. To preserve the privacy of IoT datasets, traditional methods usually calibrate random noises on the data values to achieve differential privacy (DP). However, the amount of the calibrating noises should be carefully designed and a heedless value will definitely degrade the availability of datasets. Thus, in this work, we propose a stochastic perturbation method to sanitize the dataset, where the perturbation is obtained from the rest samples in the same dataset. In addition, we derive the expression of the utility level based on its unique framework and prove that the proposed algorithm can achieve the ϵ-DP. To show the effectiveness of the proposed algorithm, we conduct extensive experiments on real-life datasets by various functions, such as query answers and machine learning tasks. By comparing with the state-of-the-art methods, our proposed algorithm can achieve a better performance under the same privacy level.
AB - With the great amount of available data, especially collecting from the ubiquitous Internet of Things (IoT), the issue of privacy leakage arises increasingly concerns recently. To preserve the privacy of IoT datasets, traditional methods usually calibrate random noises on the data values to achieve differential privacy (DP). However, the amount of the calibrating noises should be carefully designed and a heedless value will definitely degrade the availability of datasets. Thus, in this work, we propose a stochastic perturbation method to sanitize the dataset, where the perturbation is obtained from the rest samples in the same dataset. In addition, we derive the expression of the utility level based on its unique framework and prove that the proposed algorithm can achieve the ϵ-DP. To show the effectiveness of the proposed algorithm, we conduct extensive experiments on real-life datasets by various functions, such as query answers and machine learning tasks. By comparing with the state-of-the-art methods, our proposed algorithm can achieve a better performance under the same privacy level.
KW - Differential privacy
KW - stochastic perturbation
KW - tabular dataset
UR - https://www.scopus.com/pages/publications/85122060491
U2 - 10.1109/TKDE.2021.3137047
DO - 10.1109/TKDE.2021.3137047
M3 - 文章
AN - SCOPUS:85122060491
SN - 1041-4347
VL - 35
SP - 3619
EP - 3631
JO - IEEE Transactions on Knowledge and Data Engineering
JF - IEEE Transactions on Knowledge and Data Engineering
IS - 4
ER -