Title:
|
MINING POSTAL ADDRESSES |
Author(s):
|
José Carlos Cortizo Pérez , José María Gómez Hidalgo , Yaiza Temprado , Diego Martín , Federico Rodríguez |
ISBN:
|
978-972-8924-63-8 |
Editors:
|
Hans Weghorn and Ajith P. Abraham |
Year:
|
2008 |
Edition:
|
Single |
Keywords:
|
Text mining, Approximate Information Retrieval, Record linkage |
Type:
|
Short Paper |
First Page:
|
67 |
Last Page:
|
72 |
Language:
|
English |
Cover:
|
|
Full Contents:
|
click to dowload
|
Paper Abstract:
|
This paper presents FuMaS (Fuzzy Matching System), a system capable of an efficient retrieval of postal addresses from
noisy queries. The fuzzy postal addresses retrieval has many possible applications, ranging from datawarehouse dedumping,
to the correction of input forms, or the integration within online street directories, etc.
This paper presents the system architecture along with a series of experiments performed using FuMaS. The experimental results show that FuMaS is a very useful system when retrieving noisy postal addresses, being able to
retrieve almost 85% of the total ones. This represents an improvement of the 15% when comparing with other systems tested in this set of experiments. |
|
|
|
|