Zobrazeno 1 - 10
of 110
pro vyhledávání: '"Ernest Valveny"'
Publikováno v:
IEEE Access, Vol 10, Pp 72092-72106 (2022)
The open-ended question answering task of Text-VQA often requires reading and reasoning about rarely seen or completely unseen scene text content of an image. We address this zero-shot nature of the task by proposing the generalized use of external k
Externí odkaz:
https://doaj.org/article/0aa82099de6941e4ad0f990d0ff5084f
This work explores a closure task in comics, a medium where visual and textual elements are intricately intertwined. Specifically, Text-cloze refers to the task of selecting the correct text to use in a comic panel, given its neighboring panels. Trad
Externí odkaz:
http://arxiv.org/abs/2403.03719
Publikováno v:
Lecture Notes in Computer Science ISBN: 9783031250682
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::4cba3c96ecf2fee792602e087c866df7
https://doi.org/10.1007/978-3-031-25069-9_16
https://doi.org/10.1007/978-3-031-25069-9_16
Autor:
Dimosthenis Karatzas, Ernest Valveny, Marçal Rusiñol, Ali Furkan Biten, Lluis Gomez, Andres Mafla, Rubèn Tito
Publikováno v:
Pattern Recognition Letters. 150:242-249
This paper presents a new model for the task of scene text visual question answering, in which questions about a given image can only be answered by reading and understanding scene text that is present in it. The proposed model is based on an attenti
Publikováno v:
Pattern Recognition Letters. 149:164-171
Images with visual and scene text content are ubiquitous in everyday life. However, current image interpretation systems are mostly limited to using only the visual features, neglecting to leverage the scene text content. In this paper, we propose to
Autor:
Ernest Valveny, Ramon Vilanova
Publikováno v:
EDULEARN Proceedings.
Publikováno v:
2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).
The open-ended question answering task of Text-VQA often requires reading and reasoning about rarely seen or completely unseen scene-text content of an image. We address this zero-shot nature of the problem by proposing the generalized use of externa
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::96ee2e538bbb532e8813296779f543c2
Publikováno v:
Document Analysis and Recognition – ICDAR 2021 ISBN: 9783030863364
ICDAR (4)
ICDAR (4)
In this report we present results of the ICDAR 2021 edition of the Document Visual Question Challenges. This edition complements the previous tasks on Single Document VQA and Document Collection VQA with a newly introduced on Infographics VQA. Infogr
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::9aba9eef8f89b34e7e0d75b0014c8921
https://doi.org/10.1007/978-3-030-86337-1_42
https://doi.org/10.1007/978-3-030-86337-1_42
Publikováno v:
Document Analysis and Recognition – ICDAR 2021 ISBN: 9783030863302
ICDAR (2)
ICDAR (2)
Current tasks and methods in Document Understanding aims to process documents as single elements. However, documents are usually organized in collections (historical records, purchase invoices), that provide context useful for their interpretation. T
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::03d5eae05b8e0b683f84b095c180b546
https://doi.org/10.1007/978-3-030-86331-9_50
https://doi.org/10.1007/978-3-030-86331-9_50