Výsledky vyhledávání

Autor: Qiyu Xie, Yeyun Zou

Visual question answering (VQA) is a task that combines both the techniques of computer vision and natural language processing. It requires models to answer a text-based question according to the information contained in a visual. In recent years, th

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::26ae4bd739aeaba1ed32f0c3669e4bdf
http://arxiv.org/abs/2105.00421

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání