Výsledky vyhledávání - "Thao Minh Le"

VLSP 2021 - VieCap4H Challenge: Automatic Image Caption Generation for Healthcare Domain in Vietnamese

Autor: Xuan-Son Vu, Huyen Nguyen, Thanh-Son Nguyen, Long Hoang Dang, Thao Minh Le

Publikováno v: VNU Journal of Science: Computer Science and Communication Engineering. 38

This paper presents VieCap4H, a grand data challenge on automatic image caption generation for the healthcare domain in Vietnamese. VieCap4H is held as part of the eighth annual workshop on VietnameseLanguage and Speech Processing (VLSP 2021). The ta

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::a8b5e8fd6f722f5ea1e3933956799b46
https://doi.org/10.25073/2588-1086/vnucsce.341

Zobrazit plný text záznamu

Guiding Visual Question Answering with Attention Priors

Autor: Thao Minh Le, Vuong Le, Sunil Gupta, Svetha Venkatesh, Truyen Tran

The current success of modern visual reasoning systems is arguably attributed to cross-modality attention mechanisms. However, in deliberative reasoning such as in VQA, attention is unconstrained at each step, and thus may serve as a statistical pool

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::78b9745b00d24ca314152c95d6c67f5d
http://arxiv.org/abs/2205.12616

Zobrazit plný text záznamu

Video Dialog as Conversation About Objects Living in Space-Time

Autor: Hoang-Anh Pham, Thao Minh Le, Vuong Le, Tu Minh Phuong, Truyen Tran

Publikováno v: Lecture Notes in Computer Science ISBN: 9783031198410

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::39c7011c1c589e4197d063d4da29d0fb
https://doi.org/10.1007/978-3-031-19842-7_41

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

From Deep Learning to Deep Reasoning

Autor: Hung Le, Thao Minh Le, Vuong Le, Truyen Tran

Publikováno v: KDD

The rise of big data and big compute has brought modern neural networks to many walks of digital life, thanks to the relative ease of constructing large models that scale to the real world. Current successes of Transformers and self-supervised pretra

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::dc95bdff5c72da229fe2fb5a4d958e3c
https://doi.org/10.1145/3447548.3470803

Zobrazit plný text záznamu

GEFA: Early Fusion Approach in Drug-Target Affinity Prediction

Autor: Thin Nguyen, Thao Minh Le, Tri Minh Nguyen, Truyen Tran

Publikováno v: IEEE/ACM transactions on computational biology and bioinformatics. 19(2)

Predicting the interaction between a compound and a target is crucial for rapid drug repurposing. Deep learning has been successfully applied in drug-target affinity (DTA) problem. However, previous deep learning-based methods ignore modeling the dir

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::cd60410974f2b2230e794f4e35e2d121
https://pubmed.ncbi.nlm.nih.gov/34197324

Zobrazit plný text záznamu

Object-Centric Representation Learning for Video Question Answering

Autor: Truyen Tran, Thao Minh Le, Vuong Le, Long Hoang Dang

Publikováno v: IJCNN

Video question answering (Video QA) presents a powerful testbed for human-like intelligent behaviors. The task demands new capabilities to integrate video processing, language understanding, binding abstract linguistic concepts to concrete visual art

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::8bb7ad4f2d481aeb52224caae45e9503

Zobrazit plný text záznamu

Hierarchical Object-oriented Spatio-Temporal Reasoning for Video Question Answering

Autor: Truyen Tran, Thao Minh Le, Vuong Le, Long Hoang Dang

Publikováno v: IJCAI

Video Question Answering (Video QA) is a powerful testbed to develop new AI capabilities. This task necessitates learning to reason about objects, relations, and events across visual and linguistic domains in space-time. High-level reasoning demands

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::3a2e1fa420a7213ebaba196a9c002892

Zobrazit plný text záznamu

Hierarchical Conditional Relation Networks for Multimodal Video Question Answering

Autor: Thao Minh Le, Vuong Le, Truyen Tran, Svetha Venkatesh

Video QA challenges modelers in multiple fronts. Modeling video necessitates building not only spatio-temporal models for the dynamic visual channel but also multimodal structures for associated information channels such as subtitles or audio. Video

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::5b4db2ced1c809cb99d35ea16b69151d
http://arxiv.org/abs/2010.10019

Zobrazit plný text záznamu

Dynamic Language Binding in Relational Visual Reasoning

Autor: Truyen Tran, Svetha Venkatesh, Thao Minh Le, Vuong Le

Publikováno v: IJCAI

We present Language-binding Object Graph Network, the first neural reasoning method with dynamic relational structures across both visual and textual domains with applications in visual question answering. Relaxing the common assumption made by curre

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::3f82d0467fa04352b312f534654fbc10
https://doi.org/10.24963/ijcai.2020/114

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání