Zobrazeno 1 - 5
of 5
pro vyhledávání: '"Vo, Duong T. D."'
In recent years, visual question answering (VQA) has attracted attention from the research community because of its highly potential applications (such as virtual assistance on intelligent cars, assistant devices for blind people, or information retr
Externí odkaz:
http://arxiv.org/abs/2305.04183
Autor:
Nguyen, Ngan Luu-Thuy, Nguyen, Nghia Hieu, Vo, Duong T. D, Tran, Khanh Quoc, Van Nguyen, Kiet
Visual Question Answering (VQA) is a challenging task of natural language processing (NLP) and computer vision (CV), attracting significant attention from researchers. English is a resource-rich language that has witnessed various developments in dat
Externí odkaz:
http://arxiv.org/abs/2302.11752
Recognizing handwriting images is challenging due to the vast variation in writing style across many people and distinct linguistic aspects of writing languages. In Vietnamese, besides the modern Latin characters, there are accent and letter marks to
Externí odkaz:
http://arxiv.org/abs/2211.05407
Image captioning is currently a challenging task that requires the ability to both understand visual information and use human language to describe this visual information in the image. In this paper, we propose an efficient way to improve the image
Externí odkaz:
http://arxiv.org/abs/2211.05405
Autor:
Nguyen, Ngan Luu-Thuy, Nguyen, Nghia Hieu, Vo, Duong T. D, Tran, Khanh Quoc, Van Nguyen, Kiet
Visual Question Answering (VQA) is a challenging task of natural language processing (NLP) and computer vision (CV), attracting significant attention from researchers. English is a resource-rich language that has witnessed various developments in dat
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::ae5dd98798c9efae3b56294dc9db56eb