Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Barnwal, Rohit Prakash"'
We tackle the challenge of Visual Question Answering in multi-image setting for the ISVQA dataset. Traditional VQA tasks have focused on a single-image setting where the target answer is generated from a single image. Image set VQA, however, comprise
Externí odkaz:
http://arxiv.org/abs/2104.00107