Title:
|
EXTENDING OPEN SOURCE SOFTWARE SOLUTIONS FOR CRM TEXT MINING |
Author(s):
|
Piotr Gawrysiak , Henryk Rybiński , Damian Gajda , Marcin Gołębski |
ISBN:
|
972-99353-0-0 |
Editors:
|
Pedro IsaĆas and Nitya Karmakar |
Year:
|
2004 |
Edition:
|
2 |
Keywords:
|
Text mining, open source, customer relationship management, document clustering. |
Type:
|
Short Paper |
First Page:
|
869 |
Last Page:
|
872 |
Language:
|
English |
Cover:
|
|
Full Contents:
|
click to dowload
|
Paper Abstract:
|
This paper is an overview of a Text Mining project carried out by the Warsaw University of Technology in the business analysis department of a Polish mass services company. This project has provided an excellent opportunity to test the usefulness of existing TM tools and algorithms for processing a large CRM database constituting a text corpus in Polish language. As these tools proved to be highly inadequate for the task, some new algorithms and methods have been devised, that would be specifically tailored to peculiarities of CRM data in general and Polish language in particular. Noteworthy, some of the solutions implemented were based on publicly available open source software, which was modified and extended by the project team, thus allowing one to evaluate the robustness of open source code for large scientific and business deployments. The paper describes briefly the Text Mining problem that had to be tackled and contains overview of the implemented solutions, together with information about obtained experimental results |
|
|
|
|