An extensible system for data cleaning

R. H. Yu, Z. M. Guo, Z. P. Tian, A. Y. Zhou

Research output: Contribution to journalConference articlepeer-review

Abstract

Organization data is confronted all kinds of data quality problems. Thus the process of data cleaning becomes crucial because of the "garbage in, garbage out" principle. However, it's not trivial to make data cleaning process flexible enough. In this paper, we present an open and extensible framework for data cleaning. It gains its extensibility by employing innovative features like term model, processing description file and rule&Dic base. A visual GUI environment is implemented and workflow capability is provided in this system.

Original languageEnglish
Pages (from-to)189-192
Number of pages4
JournalJournal of Shanghai University
Volume5
Issue numberSUPPL. SEPT.
StatePublished - Sep 2001
Externally publishedYes
Event2nd International Conference on Computer and Information Technology (CIT'2001) - Shangai, China
Duration: 12 Sep 200115 Sep 2001

Keywords

  • Data cleaning
  • Data preparation
  • Term model

Fingerprint

Dive into the research topics of 'An extensible system for data cleaning'. Together they form a unique fingerprint.

Cite this