CrossOIE: Cross-Lingual Classifier for Open Information Extraction

Autor: Daniela Barreiro Claro, Marlo Souza, Rafael Glauber, Bruno Souza Cabral
Rok vydání: 2020
Předmět:
Zdroj: Lecture Notes in Computer Science ISBN: 9783030415044
PROPOR
DOI: 10.1007/978-3-030-41505-1_35
Popis: Open information extraction (Open IE) is the task of extracting open-domain assertions from natural language sentences. Considering the low availability of datasets and tools for this task in languages other than English, recently it has been proposed that multilingual resources can be used to improve Open IE methods for different languages. In this work, we present the CrossOIE, a multilingual publicly available relation tuple validity classifier that scores Open IE systems’ extractions based on their estimated quality and can be used to improve Open IE systems and assist in the creation of Open IE benchmarks for different languages. Experiments show that our model trained using a small corpus in English, Spanish, and Portuguese can trade recall performance for up to 27% improvement in precision. This result was also archived in a zero-shot scenario, demonstrating a successful knowledge transfer across the languages.
Databáze: OpenAIRE