Title:
|
MINING RELAXED GRAPH PROPERTIES IN INTERNET |
Author(s):
|
Wilhelmiina Hämäläinen , Hannu Toivonen , Vladimir Poroshin |
ISBN:
|
972-99353-0-0 |
Editors:
|
Pedro Isaías and Nitya Karmakar |
Year:
|
2004 |
Edition:
|
1 |
Keywords:
|
graph, data mining, WWW. |
Type:
|
Full Paper |
First Page:
|
152 |
Last Page:
|
159 |
Language:
|
English |
Cover:
|
|
Full Contents:
|
click to dowload
|
Paper Abstract:
|
Many real world datasets are represented in the form of graphs. The classical graph properties found in the data, like cliques or independent sets, can reveal new interesting information in the data. However, such properties can be either too rare or too trivial in the given context. By relaxing the criteria of the classical properties, we can find more and to-tally new patterns in the data. In this paper, we define relaxed graph properties and study their use in analyzing and processing graph-based data. Especially, we consider the problem of finding self-referring groups in WWW, and give a general algorithm for mining all such patterns from a collection of WWW pages. We suggest that such self-referring groups can reveal web communities or other clustering in WWW and also facilitate in compression of graph-formed data. |
|
|
|
|