Smart Context Generation for Disambiguation to Wikipedia

Autor: Andrey Sysoev, Irina Nikishina
Rok vydání: 2018
Předmět:
Zdroj: Communications in Computer and Information Science ISBN: 9783030012038
Popis: Wikification is a crucial NLP task that aims to identify entities in text and disambiguate their meaning. Being partially solved for English, the problem still remains fairly untouched for Russian. In this article we present a novel approach to Disambiguation to Wikipedia applied to the Russian language. Inspired by the Neural Machine Translation task our method implements encoder-decoder neural network architecture. It translates text tokens into concept embeddings that are subsequently used as context for disambiguation. In order to test our hypothesis we add our context features to GLOW system considered a baseline. Moreover, we present commonly available dataset for the Disambiguation to Wikipedia task.
Databáze: OpenAIRE