Title:
|
TOWARDS AN ERROR-FREE STEMMING |
Author(s):
|
Eiman Tamah Al-shammari |
ISBN:
|
978-972-8924-63-8 |
Editors:
|
Hans Weghorn and Ajith P. Abraham |
Year:
|
2008 |
Edition:
|
Single |
Keywords:
|
Pre-processing, Stemming, Algorithm, English |
Type:
|
Reflection Paper |
First Page:
|
160 |
Last Page:
|
163 |
Language:
|
English |
Cover:
|
|
Full Contents:
|
click to dowload
|
Paper Abstract:
|
Stemming is a fundamental step in processing textual data preceding the tasks of information retrieval, text mining, and
natural language processing. The common goal of stemming is to standardize words by reducing a word to its base.
However, simply removing the suffix of the word can cause stemming errors such as under-stemming or over-stemming.
Sophisticated stemmers tend to weakly stem documents with very computationally expensive approaches such as
dictionary lookup. This paper presents a new dictionary free, simple, highly effective stemmer that can reduce stemming
errors in addition to decreasing computational time and data storage. |
|
|
|
|