A Vietnamese Dataset for Evaluating Machine Reading Comprehension

Autor:	Anh Hoang-Tu Nguyen, Kiet Van Nguyen, Vu Duc Nguyen, Ngan Luu-Thuy Nguyen
Rok vydání:	2020
Předmět:	Matching (statistics) Computer science Process (engineering) business.industry First language Vietnamese 02 engineering and technology computer.software_genre language.human_language Task (project management) Comprehension 03 medical and health sciences 0302 clinical medicine Benchmark (surveying) 030221 ophthalmology & optometry 0202 electrical engineering electronic engineering information engineering Question answering language 020201 artificial intelligence & image processing Artificial intelligence business computer Natural language processing
Zdroj:	COLING
DOI:	10.18653/v1/2020.coling-main.233
Popis:	Over 97 million inhabitants speak Vietnamese as the native language in the world. However, there are few research studies on machine reading comprehension (MRC) in Vietnamese, the task of understanding a document or text, and answering questions related to it. Due to the lack of benchmark datasets for Vietnamese, we present the Vietnamese Question Answering Dataset (UIT-ViQuAD), a new dataset for the low-resource language as Vietnamese to evaluate MRC models. This dataset comprises over 23,000 human-generated question-answer pairs based on 5,109 passages of 174 Vietnamese articles from Wikipedia. In particular, we propose a new process of dataset creation for Vietnamese MRC. Our in-depth analyses illustrate that our dataset requires abilities beyond simple reasoning like word matching and demands complicate reasoning such as single-sentence and multiple-sentence inferences. Besides, we conduct experiments on state-of-the-art MRC methods in English and Chinese as the first experimental models on UIT-ViQuAD, which will be compared to further models. We also estimate human performances on the dataset and compare it to the experimental results of several powerful machine models. As a result, the substantial differences between humans and the best model performances on the dataset indicate that improvements can be explored on UIT-ViQuAD through future research. Our dataset is freely available to encourage the research community to overcome challenges in Vietnamese MRC.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::9d23a83e692d390d0e7a697cb8ecdaf8 https://doi.org/10.18653/v1/2020.coling-main.233 Zobrazit plný text záznamu