Gaze-dependent response activation in dialogue agent for cognitive-behavioral therapy.

Autor: Gabor-Siatkowska, Karolina, Stefaniak, Izabela, Janicki, Artur
Předmět:
Zdroj: Procedia Computer Science; 2024, Vol. 246, p2322-2331, 10p
Abstrakt: To provide additional support to psychiatric patients who have limited access to medical staff, our team has developed a therapeutic dialogue agent called Terabot. It operates in Polish and is enhanced with text-based emotion and intention recognition. It considers the patient's emotions so that it can respond in an 'empathetic' way. Experiments have already been conducted at the Institute of Psychiatry and Neurology in Warsaw, Poland, during which we observed several problems. In particular, when the patient's utterances were too short or too long, the dialogue system reacted inappropriately, interrupting the patient's answer or causing very long pauses. The problem was that without input other than speech, the dialogue system could not provide answers in good timing. This decreased comfort and natural dialogue flow during patient therapeutic sessions. This decreased comfort and natural dialogue flow during patient therapeutic sessions. As it is known from human communication research, when a speaker is finishing the utterance, at the end of the speech, and directly after, this person is gazing at the at the interlocutor. To solve these unwanted problems in our dialogue system, we analyzed the fixation parameters of the patients in selected areas to find out where they gazed at most while they ended their utterances. After analyses, we found that the most fixations and the longest duration occurred on Terabot's face. We have found that the input to our dialogue system will be the stream of real-time fixation data in these areas. This information can help to index the end of the utterance explicitly for the dialogue system. As a result, a better timing for Terabot's speech activation can be determined. This approach allows Terabot's response time to be individually personalized to suit the patients' different response styles. A more natural and human-like flow of conversation with our dialogue agent can be achieved. [ABSTRACT FROM AUTHOR]
Databáze: Supplemental Index