Title:
|
CREATING A POLITE, ADAPTIVE AND SELECTIVE INCREMENTAL CRAWLER |
Author(s):
|
Christos Bouras , Vassilis Poulopoulos , Athena Thanou |
ISBN:
|
972-8924-02-X |
Editors:
|
Pedro Isaías and Miguel Baptista Nunes |
Year:
|
2005 |
Edition:
|
1 |
Keywords:
|
Crawler, Incremental and Adaptive Crawler, Data Mining, Crawling policies. |
Type:
|
Full Paper |
First Page:
|
307 |
Last Page:
|
314 |
Language:
|
English |
Cover:
|
|
Full Contents:
|
click to dowload
|
Paper Abstract:
|
The expansion of the World Wide Web has led to a chaotic state where the users of the internet have to face and overcome the major problem of discovering information. For the solution of this problem, many mechanisms were created based on crawlers who are browsing the www and downloading pages. In this paper we will describe a crawling mechanism which is created in order to support data mining and processing systems and to obtain a history of the webs content. A crawler has to be efficient and polite, trying not to harm or overload the pages it is visiting. Therefore, it is extremely important to follow specific rules when crawling. In addition to these rules, the mechanism we created includes a selective incremental algorithm, which is used to make the crawler more efficient and more polite in parallel. The structure and design of the mechanism is simple, but the experimental results showed us that this simplicity makes our crawler a very strong and stable mechanism. |
|
|
|
|