A Comparison of Word Embeddings and N-gram Models for DBpedia Type and Invalid Entity Detection

Autor: Hanqing Zhou, Amal Zouaq, Diana Inkpen
Jazyk: angličtina
Rok vydání: 2018
Předmět:
Zdroj: Information, Vol 10, Iss 1, p 6 (2018)
Druh dokumentu: article
ISSN: 2078-2489
DOI: 10.3390/info10010006
Popis: This article presents and evaluates a method for the detection of DBpedia types and entities that can be used for knowledge base completion and maintenance. This method compares entity embeddings with traditional N-gram models coupled with clustering and classification. We tackle two challenges: (a) the detection of entity types, which can be used to detect invalid DBpedia types and assign DBpedia types for type-less entities; and (b) the detection of invalid entities in the resource description of a DBpedia entity. Our results show that entity embeddings outperform n-gram models for type and entity detection and can contribute to the improvement of DBpedia’s quality, maintenance, and evolution.
Databáze: Directory of Open Access Journals
Nepřihlášeným uživatelům se plný text nezobrazuje