Title:
|
FRACTAL DIMENSION OF TEXT DOCUMENTS: APPLICATION IN FRACTAL SUMMARIZATION |
Author(s):
|
M. Dolores Ruiz , Antonio B. Bailón |
ISBN:
|
972-8924-19-4 |
Editors:
|
Pedro Isaías, Miguel Baptista Nunes and Inmaculada J. Martínez |
Year:
|
2005 |
Edition:
|
V II, 2 |
Type:
|
Short Paper |
First Page:
|
349 |
Last Page:
|
353 |
Language:
|
English |
Cover:
|
|
Full Contents:
|
click to dowload
|
Paper Abstract:
|
The calculation of dimensions is a useful tool to quantify structural information of artificial and natural objects. There are some types of dimension [9]: the Euclidean one, the Hausdorff-Besicovitch dimension, and so on. We are going to work with the fractal dimension in the special case of text documents. There are many objects which fractal dimension cannot be determined analytically, but there exits estimators for those cases. We review some of them and choose the best for our purpose: the calculation of fractal dimension of text documents. Every day we search new information in the web, and we found a lot of documents which contain pages with a great amount of information. There is a big demand for automatic summarization in a rapid and precise way. Many methods have been used in automatic extraction but most of them do not take into account the hierarchical structure of the documents. A novel method using the structure of the document was introduced by Yang and Wang [15]. It is based in a fractal view method for controlling the information. It has some drawbacks and we solve them doing a new adaptation of the fractal view method. We also use the new concept of fractal dimension of a text document to achieve a better diversification of the extracted sentences. |
|
|
|
|