Zobrazeno 1 - 10
of 9 149
pro vyhledávání: '"P. Ngan"'
Autor:
Ngan, Kinfung, Sun, Shuo
Solid-state quantum emitters, such as quantum dots, color centers, rare-earth dopants, and organic molecules, offer qubit systems that integrate well with chip-scale photonic and electronic devices. To fully harness their potential for quantum applic
Externí odkaz:
http://arxiv.org/abs/2412.09976
Existing Large Vision-Language Models (LVLMs) excel at matching concepts across multi-modal inputs but struggle with compositional concepts and high-level relationships between entities. This paper introduces Progressive multi-granular Vision-Languag
Externí odkaz:
http://arxiv.org/abs/2412.08125
Handling occlusion remains a significant challenge for video instance-level tasks like Multiple Object Tracking (MOT) and Video Instance Segmentation (VIS). In this paper, we propose a novel framework, Amodal-Aware Video Instance Segmentation (A2VIS)
Externí odkaz:
http://arxiv.org/abs/2412.01147
In this paper, we aimed to develop a neural parser for Vietnamese based on simplified Head-Driven Phrase Structure Grammar (HPSG). The existing corpora, VietTreebank and VnDT, had around 15% of constituency and dependency tree pairs that did not adhe
Externí odkaz:
http://arxiv.org/abs/2411.17270
Autor:
Pham, Trong Thang, Ho, Ngoc-Vuong, Bui, Nhat-Tan, Phan, Thinh, Brijesh, Patel, Adjeroh, Donald, Doretto, Gianfranco, Nguyen, Anh, Wu, Carol C., Nguyen, Hien, Le, Ngan
Developing an interpretable system for generating reports in chest X-ray (CXR) analysis is becoming increasingly crucial in Computer-aided Diagnosis (CAD) systems, enabling radiologists to comprehend the decisions made by these systems. Despite the g
Externí odkaz:
http://arxiv.org/abs/2411.15413
Natural Language Inference (NLI) is a task within Natural Language Processing (NLP) that holds value for various AI applications. However, there have been limited studies on Natural Language Inference in Vietnamese that explore the concept of joint m
Externí odkaz:
http://arxiv.org/abs/2411.13407
Autor:
Pham, Trong Thang, Nguyen, Tien-Phat, Ikebe, Yuki, Awasthi, Akash, Deng, Zhigang, Wu, Carol C., Nguyen, Hien, Le, Ngan
Medical eye-tracking data is an important information source for understanding how radiologists visually interpret medical images. This information not only improves the accuracy of deep learning models for X-ray analysis but also their interpretabil
Externí odkaz:
http://arxiv.org/abs/2411.05780
Autor:
Jianu, Tudor, Huang, Baoru, Nguyen, Hoan, Bhattarai, Binod, Do, Tuong, Tjiputra, Erman, Tran, Quang, Berthet-Rayne, Pierre, Le, Ngan, Fichera, Sebastiano, Nguyen, Anh
Endovascular surgical tool reconstruction represents an important factor in advancing endovascular tool navigation, which is an important step in endovascular surgery. However, the lack of publicly available datasets significantly restricts the devel
Externí odkaz:
http://arxiv.org/abs/2410.22224
Object tracking, especially animal tracking, is one of the key topics that attract a lot of attention due to its benefits of animal behavior understanding and monitoring. Recent state-of-the-art tracking methods are founded on deep learning architect
Externí odkaz:
http://arxiv.org/abs/2410.15518
Text-based VQA is a challenging task that requires machines to use scene texts in given images to yield the most appropriate answer for the given question. The main challenge of text-based VQA is exploiting the meaning and information from scene text
Externí odkaz:
http://arxiv.org/abs/2410.14132