Title:
|
LATENT DATA ATTRIBUTE RELATIONSHIPS AND DATA ERROR REPAIRS |
Author(s):
|
C. Sophie Lee |
ISBN:
|
978-972-8924-97-3 |
Editors:
|
Hans Weghorn and Pedro IsaĆas |
Year:
|
2009 |
Edition:
|
V I, 2 |
Keywords:
|
Data Quality, Error Propagation, Data Repairs |
Type:
|
Full Paper |
First Page:
|
93 |
Last Page:
|
100 |
Language:
|
English |
Cover:
|
|
Full Contents:
|
click to dowload
|
Paper Abstract:
|
Business database system processes a vast amount of data on a daily basis. A typical system consists of waterfall-like
tables, where an upstream data is transformed into downstream data through a set of well defined logic. Data errors can
quickly contaminate to downstream dataspace and affect data quality of the entire system. Finding and replacing such
data errors can be a tedious and error-prone process if not modeled carefully. This paper proposes to follow the Data
Error Identification Model to model data propagation behavior. Two major contributions are made to the literature. This
study proposes the modeling of latent data relationships on top of traditional entity-relationship data models. Attribute
level reference relationships are identified which leads to more precise data error identification. This paper also identifies
methods to repair child dataspace when the an invalid parent dataspace is repaired by three correction types. |
|
|
|
|