Zobrazeno 1 - 7
of 7
pro vyhledávání: '"Haydarov, Kilichbek"'
Autor:
Haydarov, Kilichbek, Shen, Xiaoqian, Madasu, Avinash, Salem, Mahmoud, Li, Li-Jia, Elsayed, Gamaleldin, Elhoseiny, Mohamed
We introduce Affective Visual Dialog, an emotion explanation and reasoning task as a testbed for research on understanding the formation of emotions in visually grounded conversations. The task involves three skills: (1) Dialog-based Question Answeri
Externí odkaz:
http://arxiv.org/abs/2308.16349
Video captioning aims to convey dynamic scenes from videos using natural language, facilitating the understanding of spatiotemporal information within our environment. Although there have been recent advances, generating detailed and enriched video d
Externí odkaz:
http://arxiv.org/abs/2304.04227
Autor:
Zhu, Deyao, Chen, Jun, Haydarov, Kilichbek, Shen, Xiaoqian, Zhang, Wenxuan, Elhoseiny, Mohamed
Asking insightful questions is crucial for acquiring knowledge and expanding our understanding of the world. However, the importance of questioning has been largely overlooked in AI research, where models have been primarily developed to answer quest
Externí odkaz:
http://arxiv.org/abs/2303.06594
Autor:
Haydarov, Kilichbek
Developing intelligent systems that can recognize and express human affects is essential to bridge the gap between human and artificial intelligence. This thesis explores the creative and emotional frontiers of artificial intelligence. Specifically,
Externí odkaz:
http://hdl.handle.net/10754/673850
Datasets that capture the connection between vision, language, and affection are limited, causing a lack of understanding of the emotional aspect of human intelligence. As a step in this direction, the ArtEmis dataset was recently introduced as a lar
Externí odkaz:
http://arxiv.org/abs/2204.07660
Autor:
Achlioptas, Panos, Ovsjanikov, Maks, Haydarov, Kilichbek, Elhoseiny, Mohamed, Guibas, Leonidas
We present a novel large-scale dataset and accompanying machine learning models aimed at providing a detailed understanding of the interplay between visual content, its emotional effect, and explanations for the latter in language. In contrast to mos
Externí odkaz:
http://arxiv.org/abs/2101.07396
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.