Title:
|
EXTENDED TREE-PATTERN CLUSTERING TECHNIQUES FOR MASSIVE XML STORAGES |
Author(s):
|
Weiyi Ho , Li Bai |
ISBN:
|
ISSN: 1646-3692 |
Editors:
|
Pedro IsaĆas and Marcin Paprzycki |
Year:
|
2006 |
Edition:
|
V I, 1 |
Keywords:
|
XML, Tree Pattern, Path Expression. |
Type:
|
Journal Paper |
First Page:
|
15 |
Last Page:
|
29 |
Language:
|
English |
Cover:
|
|
Full Contents:
|
click to dowload
|
Paper Abstract:
|
The Extensible Mark-up Language (XML) is an emerging standard for describing data on the Web. As the widespread activities of the Internet and the Web results in vast amounts of data to be generated everyday, the manipulation of such semi-structured textual data is however becoming an important issue in XML storage research. The unique feature of semi-structured data is generally suitable for storage in a tree-like data form. Locating data in this form is based on tree-pattern matching techniques. As a result, effectively evaluating path expression is the key to provide efficient access to such tree-like data storage. In this paper, we apply two novel signature based access methods, which can significantly extend the scope of tree-pattern cluster in order to navigate massive XML databases. We present the process of producing the signatures in details, and further provide the algorithms to demonstrate how they would work. We also show the advantages of using extended tree-pattern clustering techniques in handling large amounts of XML documents. |
|
|
|
|