Výsledky vyhledávání

Akademický článek

SceneGATE: Scene-Graph Based Co-Attention Networks for Text Visual Question Answering

Autor: Feiqi Cao, Siwen Luo, Felipe Nunez, Zean Wen, Josiah Poon, Soyeon Caren Han

Publikováno v: Robotics, Vol 12, Iss 4, p 114 (2023)

Visual Question Answering (VQA) models fail catastrophically on questions related to the reading of text-carrying images. However, TextVQA aims to answer questions by understanding the scene texts in an image–question context, such as the brand nam

Externí odkaz: https://doaj.org/article/7d186ce23f2d4b57bdfa7605718c5654

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání