Title:
|
EXTRACTING DOMAIN-INDEPENDENT INFORMATION FROM THE WEB FOR PORTUGUESE USING REVERB |
Author(s):
|
Julio Cesar Batista Pires, Cedric Luiz de Carvalho |
ISBN:
|
978-989-8533-57-9 |
Editors:
|
Pedro IsaĆas |
Year:
|
2016 |
Edition:
|
Single |
Keywords:
|
IE, Open IE, Semantic Web, Information Extraction, Natural Language Processing |
Type:
|
Full Paper |
First Page:
|
103 |
Last Page:
|
110 |
Language:
|
English |
Cover:
|
|
Full Contents:
|
click to dowload
|
Paper Abstract:
|
Nowadays people are increasingly connected to the web, looking for all kinds of things. The web is a huge source of information. So, they can find almost anything they want. However, web information is not well organized and has little formal structure. This hampers machine processing and consequently makes information access more difficult. Bringing structure to the web is a key point for facilitating user searching and navigation. A recent technique, Open Information Extraction, has been successfully applied to extracting structured information from the web. This technique has been applied mostly to web pages in English. This paper presents a work focused on information extraction for Portuguese. |
|
|
|
|