Ensemble-based method of answers retrieval for domain specific questions from text-based documentation

Autor: Dmitriy Alexandrov, Iskander Safiulin, Denis A. Nasonov, Nikolay Butakov
Rok vydání: 2019
Předmět:
Zdroj: Procedia Computer Science. 156:158-165
ISSN: 1877-0509
Popis: Many companies want or prefer to use chatbot systems to provide smart assistants for accompanying human specialists especially newbies with automatic consulting. Implementation of a really useful smart assistant for a specific domain requires a knowledge base for this domain, that often exists only in the form of text documentation and manuals. Lacks of properly built datasets and often expensiveness in resources and time to build one from scratch to apply data-driven methods with high quality. It motivates to seek a solution that can work without such data or require only a small amount of it though having reduced quality. The reformulation of the task into an information retrieval problem where the assistant responds with a piece of documentation instead of generated sentences may make the task easier but doesn’t solve the whole problem. It allows using of metrics-based methods with reduced search quality or data-driven methods which also needs a great amount of data. In this paper, we propose a new ensemble-based data-driven method that tries to learn a scoring function by combining independent functions from a predefined set. The method may substantially improve the quality of the search in comparison with pure metrics-based methods while requiring significantly less data for training than data-driven methods.
Databáze: OpenAIRE