|
Title:
|
TOWARDS AN ERROR-FREE STEMMING |
|
Author(s):
|
Eiman Tamah Al-shammari |
|
ISBN:
|
978-972-8924-63-8 |
|
Editors:
|
Hans Weghorn and Ajith P. Abraham |
|
Year:
|
2008 |
|
Edition:
|
Single |
|
Keywords:
|
Pre-processing, Stemming, Algorithm, English |
|
Type:
|
Reflection Paper |
|
First Page:
|
160 |
|
Last Page:
|
163 |
|
Language:
|
English |
|
Cover:
|
|
|
Full Contents:
|
click to dowload
|
|
Paper Abstract:
|
Stemming is a fundamental step in processing textual data preceding the tasks of information retrieval, text mining, and
natural language processing. The common goal of stemming is to standardize words by reducing a word to its base.
However, simply removing the suffix of the word can cause stemming errors such as under-stemming or over-stemming.
Sophisticated stemmers tend to weakly stem documents with very computationally expensive approaches such as
dictionary lookup. This paper presents a new dictionary free, simple, highly effective stemmer that can reduce stemming
errors in addition to decreasing computational time and data storage. |
|
|
|
|
|
|