Digital Library

cab1

 
Title:      EVALUATION OF NAMED ENTITY RECOGNITION FOR THE GERMAN E-COMMERCE DOMAIN
Author(s):      Sergej Denisov and Frederik S. Bäumer
ISBN:      978-989-8704-34-4
Editors:      Pedro Isaías and Hans Weghorn
Year:      2021
Edition:      Single
Type:      Full
First Page:      226
Last Page:      230
Language:      English
Cover:      cover          
Full Contents:      click to dowload Download
Paper Abstract:      Large online marketplaces offer search engines as an important navigation aid for their customers to navigate through the enormous number of different products and merchants. The quality of the search results depends to a large extent on the product information provided by the retailers. One way to improve search quality is to perform linguistic enhancement of the product data. For this, we use Named Entity Recognition to identify specific e-commerce entity types and give them higher weighting in the search. Typical entity types for the e-commerce domain are products, brands, and various product attributes. Recognition of e-commerce entity types for the German language remains a challenge due to the limited availability of existing resources and linguistic complexity. We address this challenge by acquiring data from two online e-commerce marketplaces to create six NER datasets based on German product titles and descriptions. Across these datasets, we evaluate the NER performance of the state-of-the-art models mBERT, GermanBERT, and XLM-RoBERTa. As a result, the XLM-RoBERTa model archived the best performance with an F1 score of 0.8611 averaged over all datasets.
   

Social Media Links

Search

Login