Unsupervised Question Decomposition for Question Answering
Autor: | Patrick S. H. Lewis, Kyunghyun Cho, Wen-tau Yih, Ethan Perez, Douwe Kiela |
---|---|
Rok vydání: | 2020 |
Předmět: |
FOS: Computer and information sciences
Transduction (machine learning) Computer Science - Machine Learning Computer Science - Computation and Language business.industry Computer science Computer Science - Artificial Intelligence 02 engineering and technology Machine learning computer.software_genre Machine Learning (cs.LG) 03 medical and health sciences Fluency 0302 clinical medicine Artificial Intelligence (cs.AI) 030221 ophthalmology & optometry 0202 electrical engineering electronic engineering information engineering Question answering Leverage (statistics) 020201 artificial intelligence & image processing Artificial intelligence business computer Computation and Language (cs.CL) |
Zdroj: | EMNLP (1) |
DOI: | 10.48550/arxiv.2002.09758 |
Popis: | We aim to improve question answering (QA) by decomposing hard questions into simpler sub-questions that existing QA systems are capable of answering. Since labeling questions with decompositions is cumbersome, we take an unsupervised approach to produce sub-questions, also enabling us to leverage millions of questions from the internet. Specifically, we propose an algorithm for One-to-N Unsupervised Sequence transduction (ONUS) that learns to map one hard, multi-hop question to many simpler, single-hop sub-questions. We answer sub-questions with an off-the-shelf QA model and give the resulting answers to a recomposition model that combines them into a final answer. We show large QA improvements on HotpotQA over a strong baseline on the original, out-of-domain, and multi-hop dev sets. ONUS automatically learns to decompose different kinds of questions, while matching the utility of supervised and heuristic decomposition methods for QA and exceeding those methods in fluency. Qualitatively, we find that using sub-questions is promising for shedding light on why a QA system makes a prediction. Comment: EMNLP 2020 Camera-Ready. Code available at https://github.com/facebookresearch/UnsupervisedDecomposition |
Databáze: | OpenAIRE |
Externí odkaz: |