Title:
|
SIGNIFICANCE OF HTML TAGS FOR DOCUMENT INDEXING AND RETRIEVAL |
Author(s):
|
Byurhan Hyusein , Ahmed Patel |
ISBN:
|
972-98947-1-X |
Editors:
|
Pedro IsaĆas and Nitya Karmakar |
Year:
|
2003 |
Edition:
|
2 |
Keywords:
|
Information retrieval, term selection, tags, search engines, HTML. |
Type:
|
Short Paper |
First Page:
|
817 |
Last Page:
|
820 |
Language:
|
English |
Cover:
|
|
Full Contents:
|
click to dowload
|
Paper Abstract:
|
Indexing quality has an overwhelming effect on retrieval effectiveness of search engines. In the past few years it has become one of the major challenges in the search engines area, particularly the task of automatically assigning high-quality terms to Web documents, which remains elusive. High indexing and retrieval quality requires work on term selection algorithms. This paper investigates the feasibility of HTML tags to represent the contents of Web documents. Experiments were performed on the WT10g collection of a 1.69-million page corpus using many different combinations of term selection and information retrieval algorithms. |
|
|
|
|