Title:
|
SOURCE IDENTIFICATION AND QUERY REWRITING IN OPEN XML DATA INTEGRATION SYSTEMS |
Author(s):
|
Francois Boisson , Michel Scholl , Imen Sebei , Dan Vodislav |
ISBN:
|
ISSN: 1645-7641 |
Editors:
|
Pedro IsaĆas |
Year:
|
2007 |
Edition:
|
V V, 1 |
Keywords:
|
XML, heterogeneous data integration, ontology, query rewriting, source identification |
Type:
|
Journal Paper |
First Page:
|
29 |
Last Page:
|
44 |
Language:
|
English |
Cover:
|
|
Full Contents:
|
click to dowload
|
Paper Abstract:
|
This paper presents OpenXView, a model for open, large scale XML data integration systems,
characterized by the autonomy of users that publish XML data on a common topic. Autonomy implies
frequent and unpredictable changes to data and a high degree of structure heterogeneity. OpenXView
provides an original integration schema, based on an hybrid ontology - XML schema structure model.
We propose solutions for several important problems in such systems: easy access to data through a
simple query language over the common schema, simple data integration view management when data
changes and scalable query rewriting algorithms. This paper focuses on source identification for query
rewriting in OpenXView, i.e. the computation of combinations of sources that can answer a user query. It
proposes two algorithms for minimal source combinations, scalable with the number of sources. The first
one is based on a general branch-and-bound strategy, while the second one, very efficient, is limited to
queries whose number of attributes is no more than 8, sufficient in most applications. |
|
|
|
|