Title:
|
EXTRACTION OF STRUCTURAL ELEMENTS
OF INVENTIONS FROM RUSSIAN-LANGUAGE PATENTS |
Author(s):
|
Dmitriy M. Korobkin, Sergey S. Vasiliev, Sergey A. Fomenkov and Vladimir I. Lobeyko |
ISBN:
|
978-989-8533-92-0 |
Editors:
|
Ajith P. Abraham and Jörg Roth |
Year:
|
2019 |
Edition:
|
Single |
Keywords:
|
Patents, Information Extraction, SAO, Claims, CAI-Systems, Shallow Parsing |
Type:
|
Full Paper |
First Page:
|
159 |
Last Page:
|
166 |
Language:
|
English |
Cover:
|
|
Full Contents:
|
click to dowload
|
Paper Abstract:
|
The paper investigates the issue of extracting structured data from Russian-language patents in the field of new technical
solution synthesis. The predicate-argument constructions that characterize the composition of the structural elements of
the inventions and the relations between them are the extracted entities. The paper also analyses existing natural language
processing tools as applied to patent processing. The authors have offered a new method for extracting
predicate-argument constructs taking into account the specificity of the text of patents based on shallow parsing and
segmentation of sentences. The value of the F1 metric with a rigorous and lax evaluation of data extraction is 63% and
79%, respectively, which suggests that the proposed approach is promising. The extracted structures are used to construct
the graph of the structural elements of the invention. |
|
|
|
|