INFORMATION EXTRACTION ON E-MAIL TEXTS FOR PERSONAL INFORMATION MANAGEMENT DOMAIN

Home

Document Info

Title:	INFORMATION EXTRACTION ON E-MAIL TEXTS FOR PERSONAL INFORMATION MANAGEMENT DOMAIN
Author(s):	Kyungkoo Min , Hanmin Jung , Jungyun Seo
ISBN:	972-99353-0-0
Editors:	Pedro Isaías and Nitya Karmakar
Year:	2004
Edition:	2
Keywords:	Information Extraction, Lexico-Semantic Pattern, Personal Information Management, E-mail Texts.
Type:	Poster/Demonstration
First Page:	1247
Last Page:	1248
Language:	English
Cover:
Full Contents:	click to dowload
Paper Abstract:	Information extraction on free texts is a difficult application due to frequently omitted or colloquial words and phrases with various ambiguities. We introduce a three-leveled information extraction architecture, which consists of instance extraction, filtering, and ranking rules. The extraction rules find instance candidates using named entities and context-independent lexico-semantic patterns. With context-dependent patterns and slot names produced from the previous step, the filtering rules remove improper candidates, and the ranking rules score the remaining instances. Finally, top-ranked instances of each slot are assigned to multi-targets. Experimental result shows 93.6 F-measure on e-mail texts that have three targets (header, address, and schedule) for personal information management domain.

	Go Back