Research on data quality and data cleaning: A survey

  • Zhi Mao Guo*
  • , Ao Ying Zhou
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

91 Scopus citations

Abstract

Data quality, especially data cleaning, is surveyed. The importance of data quality, and its measurement metrics are described. The data cleaning problems are defined and classified. The approaches to solving data quality problems are detailed. How to combine the techniques in other research areas with data cleaning is overviewed, and several data cleaning frameworks proposed previously by others are introduced. The future research topics related to data cleaning problems are also discussed.

Original languageEnglish
Pages (from-to)2076-2082
Number of pages7
JournalRuan Jian Xue Bao/Journal of Software
Volume13
Issue number11
StatePublished - Nov 2002
Externally publishedYes

Keywords

  • Data cleaning
  • Data cleaning framework
  • Data integration
  • Data quality
  • Duplicate record

Fingerprint

Dive into the research topics of 'Research on data quality and data cleaning: A survey'. Together they form a unique fingerprint.

Cite this