Title:
|
A NOVEL SEMANTIC APPROACH TO DOCUMENT COLLECTIONS |
Author(s):
|
Andrea Addis , Manuela Angioni , Giuliano Armano , Roberto Demontis , Franco Tuveri , Eloisa Vargiu |
ISBN:
|
ISSN: 1646-3692 |
Editors:
|
Pedro IsaĆas and Marcin Paprzycki |
Year:
|
2009 |
Edition:
|
V IV,2 |
Keywords:
|
Text Categorization, Document Collections, Intelligent Software Systems, Machine Learning. |
Type:
|
Journal Paper |
First Page:
|
73 |
Last Page:
|
85 |
Language:
|
English |
Cover:
|
|
Full Contents:
|
click to dowload
|
Paper Abstract:
|
Available document collections are more and more required for supervised text categorization tasks.
They are typically collections of documents classified by domain engineers. In this paper, we propose a
semantic text categorization approach able to automatically create document collections in which
documents are classified according to WordNet Domains taxonomy. Experiments have been performed
by training a classifier with an automatic document collection and comparing results with those obtained by training the same classifier with a document collection classified by domain engineers. Experimental
results point out that, on average, the performances of the automatic approach are quite similar to those
obtained on a document collection classified by hand. |
|
|
|
|