Digital Library

cab1

 
Title:      FINDEX: EVALUATION OF TWO WEB SEARCH RESULT CATEGORIZATION ALGORITHMS
Author(s):      Mika Käki
ISBN:      ISSN: 1645-7641
Editors:      Pedro Isaías
Year:      2006
Edition:      V IV, 1
Keywords:      Web search, algorithm, categorization, clustering, information access, evaluation.
Type:      Journal Paper
First Page:      1
Last Page:      15
Language:      English
Cover:      no-img_eng.gif          
Full Contents:      click to dowload Download
Paper Abstract:      The vast amount of information in the web combined with the short queries submitted to the web search engines may cause the result ranking methods to fail. Result categorization methods have become popular in solving the problem. We have developed Findex search user interface and two simple algorithms for this purpose and we have showed their utility with experiments. However, the details of the algorithms have not been discussed. The algorithms are based on word frequencies in the result summaries (snippets) the web search engines typically return. After extracting the most frequent words and phrases the two algorithms use different ways to filter out uninteresting candidates and to merge the similar ones. We evaluated the algorithms heuristically and empirically. The results show that the algorithm utilizing the query term contexts produce more understandable category names, but the simpler statistical approach offers better coverage and performance. Both algorithms perform well under certain conditions and help users in accessing the search results.
   

Social Media Links

Search

Login