Title:
|
EVALUATION OF AUTOMATIC LINKING STRATEGIES FOR WIKIPEDIA PAGES |
Author(s):
|
Michael Granitzer , Mario Zechner , Christin Seifert , Josef Kolbitsch , Peter Kemper , Ronald Int Velt |
ISBN:
|
978-972-8924-68-3 |
Editors:
|
Pedro Isaías, Miguel Baptista Nunes and Dirk Ifenthaler |
Year:
|
2008 |
Edition:
|
Single |
Keywords:
|
Retrieval, Link Generation, Evaluation, Wikipedia |
Type:
|
Full Paper |
First Page:
|
243 |
Last Page:
|
250 |
Language:
|
English |
Cover:
|
|
Full Contents:
|
click to dowload
|
Paper Abstract:
|
Wikipedia contains an enormous amount of human knowledge. The wide range of covered topics is hierarchically
organized in categories and strongly inter-linked. Its structure, its size and the fact that it is generated by humans are the
reasons for the attention Wikipedia receives from researchers in different fields. Especially the link structure of
Wikipedia is of huge importance not only for humans browsing the collection, but also as a resource for bootstrapping
machine intelligence and the semantic web.
Motivated by the fact that manual maintenance and creation of hyperlinks is labor intensive, this paper explores
properties for automatic link creation between Wikipedia pages in this paper. Focusing on ad-hoc linking approaches we
evaluate linking strategies on the word as well as on the document level using a standard test data set. As it is shown,
rather simple approaches yield to reliable results and may be applicable in different application scenarios.
Disambiguation strategies based on standard IR techniques help to boost accuracy delivering reasonable results. |
|
|
|
|