Title:
|
IDQMS: AN INTELLIGENT DATA QUALITY MANAGEMENT SYSTEM TOOL |
Author(s):
|
Aïcha Ben Salem and Faouzi Boufares |
ISBN:
|
978-989-8533-95-1 |
Editors:
|
Hans Weghorn |
Year:
|
2019 |
Edition:
|
Single |
Keywords:
|
Data Quality, Semantic Data Categorization, Semantic Dependencies, Data Cleaning |
Type:
|
Full Paper |
First Page:
|
3 |
Last Page:
|
10 |
Language:
|
English |
Cover:
|
|
Full Contents:
|
click to dowload
|
Paper Abstract:
|
Today, the quantity of data continues to increase; furthermore, the data are distributed and heterogeneous, from multiple
sources (structured, semi-structured and unstructured) and with different levels of quality. Therefore, it is very likely to
manipulate data without knowledge about their structures and their semantics. The subject covered in this paper aims at
assisting the user in its quality approach. The data must be related to its semantic meaning, data types, constraints,
comments and origin. We deal with the semantic schema recognition of a data source. It consists of categorizing the data
by assigning it to a category and possibly a sub-category, and secondly, of establishing relations between columns and
possibly discovering the semantics of the manipulated data source. These links detected between columns offer a better
understanding of the source and the alternatives for correcting data. |
|
|
|
|