Kinetics and Scene Features for Intent Detection

Autor:	Ching-Yung Lin, Serena Yuan, Tianle Zhu, Ziyin Wang, Vishal Anand, Wenfeng Lyu, Raksha Ramesh
Rok vydání:	2020
Předmět:	Artificial neural network Research areas business.industry Computer science Cognitive neuroscience of visual object recognition computer.software_genre Activity recognition Information extraction Relational knowledge Video tracking Artificial intelligence Baseline (configuration management) business computer Natural language processing
Zdroj:	ICMI Companion
DOI:	10.1145/3395035.3425641
Popis:	We create multi-modal fusion models to predict relational classes within entities in free-form inputs such as unseen movies. Our approach identifies information rich features within individual sources -- emotion, text-attention, age, gender, and contextual background object tracking. These information are absorbed and contrasted from baseline fusion architectures. These five models then showcase future research areas on this challenging problem of relational knowledge extraction from movies and free-form multi-modal input sources. We find that, generally, the Kinetics model added with Attributes and Objects beat the baseline models.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::d24d78ebef9765f42b32e6127f724427 https://doi.org/10.1145/3395035.3425641 Zobrazit plný text záznamu