Digital Library

cab1

 
Title:      INFORMATION EXTRACTION ON E-MAIL TEXTS FOR PERSONAL INFORMATION MANAGEMENT DOMAIN
Author(s):      Kyungkoo Min , Hanmin Jung , Jungyun Seo
ISBN:      972-99353-0-0
Editors:      Pedro IsaĆ­as and Nitya Karmakar
Year:      2004
Edition:      2
Keywords:      Information Extraction, Lexico-Semantic Pattern, Personal Information Management, E-mail Texts.
Type:      Poster/Demonstration
First Page:      1247
Last Page:      1248
Language:      English
Cover:      cover          
Full Contents:      click to dowload Download
Paper Abstract:      Information extraction on free texts is a difficult application due to frequently omitted or colloquial words and phrases with various ambiguities. We introduce a three-leveled information extraction architecture, which consists of instance extraction, filtering, and ranking rules. The extraction rules find instance candidates using named entities and context-independent lexico-semantic patterns. With context-dependent patterns and slot names produced from the previous step, the filtering rules remove improper candidates, and the ranking rules score the remaining instances. Finally, top-ranked instances of each slot are assigned to multi-targets. Experimental result shows 93.6 F-measure on e-mail texts that have three targets (header, address, and schedule) for personal information management domain.
   

Social Media Links

Search

Login