Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Naik, Nandita Shankar"'
Current visual question answering (VQA) models tend to be trained and evaluated on image-question pairs in isolation. However, the questions people ask are dependent on their informational needs and prior knowledge about the image content. To evaluat
Externí odkaz:
http://arxiv.org/abs/2402.15002