Compact and Interpretable Dialogue State Representation with Genetic Sparse Distributed Memory
Autor: | Olivier Pietquin, Layla El Asri, Romain Laroche |
---|---|
Rok vydání: | 2016 |
Předmět: |
State-space representation
business.industry Computer science media_common.quotation_subject 02 engineering and technology Machine learning computer.software_genre Grid 03 medical and health sciences 0302 clinical medicine Scalability 030221 ophthalmology & optometry 0202 electrical engineering electronic engineering information engineering Reinforcement learning State space 020201 artificial intelligence & image processing Artificial intelligence Sparse distributed memory business Function (engineering) Representation (mathematics) computer media_common |
Zdroj: | Lecture Notes in Electrical Engineering ISBN: 9789811025846 IWSDS |
DOI: | 10.1007/978-981-10-2585-3_3 |
Popis: | User satisfaction is often considered as the objective that should be achieved by spoken dialogue systems. This is why the reward function of Spoken Dialogue Systems (SDS) trained by Reinforcement Learning (RL) is often designed to reflect user satisfaction. To do so, the state space representation should be based on features capturing user satisfaction characteristics such as the mean speech recognition confidence score for instance. On the other hand, for deployment in industrial systems there is a need for state representations that are understandable by system engineers. In this article, we propose to represent the state space using a Genetic Sparse Distributed Memory. This is a state aggregation method computing state prototypes which are selected so as to lead to the best linear representation of the value function in RL. To do so, previous work on Genetic Sparse Distributed Memory for classification is adapted to the Reinforcement Learning task and a new way of building the prototypes is proposed. The approach is tested on a corpus of dialogues collected with an appointment scheduling system. The results are compared to a grid-based linear parametrisation. It is shown that learning is accelerated and made more memory efficient. It is also shown that the framework is scalable in that it is possible to include many dialogue features in the representation, interpret the resulting policy and identify the most important dialogue features. |
Databáze: | OpenAIRE |
Externí odkaz: |