Title:
|
EMOTION ANALYSIS USING SELF-TRAINING
ON MALAYSIAN CODE-MIXED TWITTER DATA |
Author(s):
|
Kathleen Swee Neo Tan, Tong Ming Lim and Yee Mei Lim |
ISBN:
|
978-989-8704-19-1 |
Editors:
|
Piet Kommers and Guo Chao Peng |
Year:
|
2020 |
Edition:
|
Single |
Keywords:
|
Emotion Analysis, Code-Mixed, Twitter, Self-Training, Sentiment Analysis |
Type:
|
Full |
First Page:
|
181 |
Last Page:
|
188 |
Language:
|
English |
Cover:
|
|
Full Contents:
|
click to dowload
|
Paper Abstract:
|
Microblogs such as Twitter has made available a vast resource of User Generated Content (UGC) on which emotion
analysis may be performed. Organizations increasingly value the opinions obtained from emotion analysis. These insights
help to drive decision-making activities and provide constructive inputs to engage their customers and services. In a
multiracial country such as Malaysia, it is common to find that tweets are written in mixed languages of Malay,
Malaysian slang and English. These tweets increase the complexity of the emotion analysis task, especially considering
that there is a serious lack of labeled data available in order to make use of supervised learning techniques. This paper
explores the use of self-training, a semi-supervised technique that only requires a small initial labeled dataset to conduct
emotion analysis of Malaysian code-mixed Twitter data. The results are promising as the accuracy achieved is higher
compared to the baseline models. |
|
|
|
|