Title:
|
APPLIED COSINE SIMILARITY ALGORITHM IN POLITICS: THE CASE OF MACEDONIAN PARLIAMENT |
Author(s):
|
Visar Shehu, Nuhi Besimi, Adrian Besimi |
ISBN:
|
978-989-8704-03-0 |
Editors:
|
Piet Kommers and Pedro Isaías |
Year:
|
2014 |
Edition:
|
Single |
Keywords:
|
Text mining, cosine similarity, data extraction, scraping, government, open data. |
Type:
|
Full Paper |
First Page:
|
170 |
Last Page:
|
176 |
Language:
|
English |
Cover:
|
|
Full Contents:
|
click to dowload
|
Paper Abstract:
|
This paper presents the application of the cosine similarity algorithm as a basis for grouping similar political representatives speeches in the Macedonian Parliament. In this paper we present both techniques related to information retrieval (data extraction, scraping and organization) from the official website of the Macedonian Parliament, as well as application of text mining algorithms with the purpose of extracting relevant and previously hidden information from the large text corpus. The paper also describes statistical approaches undertaken to transform speeches from their textual representation into quantitative representation. |
|
|
|
|