Digital Library

cab1

 
Title:      ORDINARY WEB PAGES AS A SOURCE FOR METADATA ACQUISITION FOR OPEN CORPUS USER MODELING
Author(s):      Michal Barla, Mária Bieliková
ISBN:      978-972-8939-25-0
Editors:      Bebo White, Pedro Isaías and Diana Andone
Year:      2010
Edition:      Single
Keywords:      Keyword extraction, user modeling, proxy
Type:      Full Paper
First Page:      227
Last Page:      233
Language:      English
Cover:      cover          
Full Contents:      click to dowload Download
Paper Abstract:      Personalization and adaptivity of the Web as we know of today is often “closed” within a particular web-based system. As a result there are only a few “personalized islands” within the whole Web. Spreading the personalization to the whole Web either via an enhanced proxy server or using an agent residing on a client-side brings a challenge how to determine metadata within an open corpus Web domain, which would allow for an efficient creation of overlayed user model. In this paper we present our approach to metadata acquisition for open corpus user modeling applicable on the “wild” Web, where we decided to take into account metadata in the form of keywords representing the visited web pages. We present the user modeling process (which is thus keyword-based) built on the top of an enhanced proxy server, capable of personalizing user browsing sessions via pluggable modules. The paper focuses on comparison of algorithms and third-party services which allow for extraction of required keywords from ordinary web pages, which is a crucial step of our user modeling approach.
   

Social Media Links

Search

Login