NELL2RDF: Reading the Web, and Publishing it as Linked Data

Autor: Giménez-García, José M., Duarte, Maísa, Zimmermann, Antoine, Gravier, Christophe, Hruschke Jr., Estevam R., Maret, Pierre
Rok vydání: 2018
Předmět:
Druh dokumentu: Working Paper
Popis: NELL is a system that continuously reads the Web to extract knowledge in form of entities and relations between them. It has been running since January 2010 and extracted over 50,000,000 candidate statements. NELL's generated data comprises all the candidate statements together with detailed information about how it was generated. This information includes how each component of the system contributed to the extraction of the statement, as well as when that happened and how confident the system is in the veracity of the statement. However, the data is only available in an ad hoc CSV format that makes it difficult to exploit out of the context of NELL. In order to make it more usable for other communities, we adopt Linked Data principles to publish a more standardized, self-describing dataset with rich provenance metadata.
Databáze: arXiv