Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Yeyun Zou"'
Visual question answering (VQA) is a task that combines both the techniques of computer vision and natural language processing. It requires models to answer a text-based question according to the information contained in a visual. In recent years, th
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::26ae4bd739aeaba1ed32f0c3669e4bdf
http://arxiv.org/abs/2105.00421
http://arxiv.org/abs/2105.00421