Ensemble-based method of answers retrieval for domain specific questions from text-based documentation
Autor: | Dmitriy Alexandrov, Iskander Safiulin, Denis A. Nasonov, Nikolay Butakov |
---|---|
Rok vydání: | 2019 |
Předmět: |
Information retrieval
Computer science business.industry media_common.quotation_subject computer.software_genre Chatbot Domain (software engineering) Task (project management) Set (abstract data type) Documentation Knowledge base General Earth and Planetary Sciences Quality (business) business Function (engineering) computer General Environmental Science media_common |
Zdroj: | Procedia Computer Science. 156:158-165 |
ISSN: | 1877-0509 |
Popis: | Many companies want or prefer to use chatbot systems to provide smart assistants for accompanying human specialists especially newbies with automatic consulting. Implementation of a really useful smart assistant for a specific domain requires a knowledge base for this domain, that often exists only in the form of text documentation and manuals. Lacks of properly built datasets and often expensiveness in resources and time to build one from scratch to apply data-driven methods with high quality. It motivates to seek a solution that can work without such data or require only a small amount of it though having reduced quality. The reformulation of the task into an information retrieval problem where the assistant responds with a piece of documentation instead of generated sentences may make the task easier but doesn’t solve the whole problem. It allows using of metrics-based methods with reduced search quality or data-driven methods which also needs a great amount of data. In this paper, we propose a new ensemble-based data-driven method that tries to learn a scoring function by combining independent functions from a predefined set. The method may substantially improve the quality of the search in comparison with pure metrics-based methods while requiring significantly less data for training than data-driven methods. |
Databáze: | OpenAIRE |
Externí odkaz: |