Smart Context Generation for Disambiguation to Wikipedia
Autor: | Andrey Sysoev, Irina Nikishina |
---|---|
Rok vydání: | 2018 |
Předmět: |
Russian language
Machine translation business.industry Computer science InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL Context (language use) 02 engineering and technology computer.software_genre Task (project management) 03 medical and health sciences 0302 clinical medicine 030221 ophthalmology & optometry 0202 electrical engineering electronic engineering information engineering Neural network architecture 020201 artificial intelligence & image processing Artificial intelligence business computer Natural language processing Meaning (linguistics) |
Zdroj: | Communications in Computer and Information Science ISBN: 9783030012038 |
Popis: | Wikification is a crucial NLP task that aims to identify entities in text and disambiguate their meaning. Being partially solved for English, the problem still remains fairly untouched for Russian. In this article we present a novel approach to Disambiguation to Wikipedia applied to the Russian language. Inspired by the Neural Machine Translation task our method implements encoder-decoder neural network architecture. It translates text tokens into concept embeddings that are subsequently used as context for disambiguation. In order to test our hypothesis we add our context features to GLOW system considered a baseline. Moreover, we present commonly available dataset for the Disambiguation to Wikipedia task. |
Databáze: | OpenAIRE |
Externí odkaz: |