Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Sayedshayan Hashemi Hosseinabad"'
Publikováno v:
The Visual Computer. 37:119-131
With the advent of deep learning, multi-modal data have been of great interest. One of the multi-modal tasks which can be included in the computer vision domain is visual question answering (VQA). In VQA, a question and an image are entered into the