Extracting relations from texts using vector language models and a neural network classifier

Autor: Maksim Shishaev, Vladimir Dikovitsky, Vadim Pimeshkov, Nikita Kuprikov, Mikhail Kuprikov, Viacheslav Shkodyrev
Jazyk: angličtina
Rok vydání: 2023
Předmět:
Zdroj: PeerJ Computer Science, Vol 9, p e1636 (2023)
Druh dokumentu: article
ISSN: 2376-5992
DOI: 10.7717/peerj-cs.1636
Popis: The article investigates the possibility of identifying the presence of SKOS (Simple Knowledge Organization System) relations between concepts represented by terms on the base of their vector representation in general natural language models. Several language models of the Word2Vec and GloVe families are considered, on the basis of which an artificial neural network (ANN) classifier of SKOS relations is formed. To train and test the efficiency of the classifier, datasets formed on the basis of the DBPedia and EuroVoc thesauri are used. The experiments performed have shown the high efficiency of the classifier trained using GloVe family models, while training it with use of Word2Vec models looks impossible in the bounds of considered ANN-based classifier architecture. Based on the results, a conclusion is made about the key role of taking into account the global context of the use of terms in the text for the possibility of identifying SKOS relations.
Databáze: Directory of Open Access Journals