RuThes Thesaurus in Detecting Russian Paraphrases

Autor: Natalia V. Loukachevitch, Boris V. Dobrov, Valerie A. Mozharova, A. A. Pavlov, Aleksandr Shevelev
Rok vydání: 2017
Předmět:
Zdroj: Communications in Computer and Information Science ISBN: 9783319717456
DOI: 10.1007/978-3-319-71746-3_20
Popis: In this paper we study the contribution of semantic features to the detection of Russian paraphrases. The features were calculated on the Russian Thesaurus RuThes. First, we applied RuThes synonyms in clustering news articles, many of which had been created with rewriting (that is paraphrasing) of source news, and found significant improvement. Second, we applied several semantic similarity measures proposed for English thesaurus WordNet to RuThes thesaurus and utilized them for detecting Russian paraphrased sentences.
Databáze: OpenAIRE