Fusing Dialogue and Gaze From Discussions of 2D and 3D Scenes

Autor: Reynold Bailey, Bradley J. S. C. Olson, Preethi Vaidyanathan, Cecilia Ovesdotter Alm, Regina Wang
Rok vydání: 2019
Předmět:
Zdroj: ICMI (Adjunct)
DOI: 10.1145/3351529.3360661
Popis: Conversation partners rely on inference using each other’s gaze and utterances to negotiate shared meaning. In contrast, dialogue systems still operate mostly with unimodal question or command and response interactions. To realize systems that can intuitively discuss and collaborate with humans, we should consider other sensory information. We begin to address this limitation with an innovative study that acquires, analyzes, and fuses interlocutors’ discussion and gaze. Introducing a discussion-based elicitation task, we collect gaze with remote and wearable eye trackers alongside dialogue as interlocutors come to consensus on questions about an on-screen 2D image and a real-world 3D scene. We analyze the visual-linguistic patterns, and also map the modalities onto the visual environment by extending a multimodal image region annotation framework using statistical machine translation for multimodal fusion, applying three ways of fusing speakers’ gaze and discussion.
Databáze: OpenAIRE