Výsledky vyhledávání

Report

Multi-Dialect Vietnamese: Task, Dataset, Baseline Models and Challenges

Autor: Van Dinh, Nguyen, Dang, Thanh Chi, Nguyen, Luan Thanh, Van Nguyen, Kiet

Vietnamese, a low-resource language, is typically categorized into three primary dialect groups that belong to Northern, Central, and Southern Vietnam. However, each province within these regions exhibits its own distinct pronunciation variations. De

Externí odkaz: http://arxiv.org/abs/2410.03458

Zobrazit plný text záznamu

Report

FedMAC: Tackling Partial-Modality Missing in Federated Learning with Cross-Modal Aggregation and Contrastive Regularization

Autor: Nguyen, Manh Duong, Nguyen, Trung Thanh, Pham, Huy Hieu, Hoang, Trong Nghia, Nguyen, Phi Le, Huynh, Thanh Trung

Federated Learning (FL) is a method for training machine learning models using distributed data sources. It ensures privacy by allowing clients to collaboratively learn a shared global model while storing their data locally. However, a significant ch

Externí odkaz: http://arxiv.org/abs/2410.03070

Zobrazit plný text záznamu

Report

ViConsFormer: Constituting Meaningful Phrases of Scene Texts using Transformer-based Method in Vietnamese Text-based Visual Question Answering

Autor: Nguyen, Nghia Hieu, Quan, Tho Thanh, Nguyen, Ngan Luu-Thuy

Text-based VQA is a challenging task that requires machines to use scene texts in given images to yield the most appropriate answer for the given question. The main challenge of text-based VQA is exploiting the meaning and information from scene text

Externí odkaz: http://arxiv.org/abs/2410.14132

Zobrazit plný text záznamu

Report

Representation Learning of Structured Data for Medical Foundation Models

Autor: Dwivedi, Vijay Prakash, Schlegel, Viktor, Liu, Andy T., Nguyen, Thanh-Tung, Kashyap, Abhinav Ramesh, Wei, Jeng, Yin, Wei-Hsian, Winkler, Stefan, Tan, Robby T.

Large Language Models (LLMs) have demonstrated remarkable performance across various domains, including healthcare. However, their ability to effectively represent structured non-textual data, such as the alphanumeric medical codes used in records li

Externí odkaz: http://arxiv.org/abs/2410.13351

Zobrazit plný text záznamu

Report

AADNet: An End-to-End Deep Learning Model for Auditory Attention Decoding

Autor: Nguyen, Nhan Duc Thanh, Phan, Huy, Geirnaert, Simon, Mikkelsen, Kaare, Kidmose, Preben

Auditory attention decoding (AAD) is the process of identifying the attended speech in a multi-talker environment using brain signals, typically recorded through electroencephalography (EEG). Over the past decade, AAD has undergone continuous develop

Externí odkaz: http://arxiv.org/abs/2410.13059

Zobrazit plný text záznamu

Report

Hiding-in-Plain-Sight (HiPS) Attack on CLIP for Targetted Object Removal from Images

Autor: Daw, Arka, Chung, Megan Hong-Thanh, Mahbub, Maria, Sadovnik, Amir

Machine learning models are known to be vulnerable to adversarial attacks, but traditional attacks have mostly focused on single-modalities. With the rise of large multi-modal models (LMMs) like CLIP, which combine vision and language capabilities, n

Externí odkaz: http://arxiv.org/abs/2410.13010

Zobrazit plný text záznamu

Report

Implementing Derivations of Definite Logic Programs with Self-Attention Networks

Autor: Thuy, Phan Thi Thanh, Yamamoto, Akihiro

In this paper we propose that a restricted version of logical inference can be implemented with self-attention networks. We are aiming at showing that LLMs (Large Language Models) constructed with transformer networks can make logical inferences. We

Externí odkaz: http://arxiv.org/abs/2410.11396

Zobrazit plný text záznamu

Report

Exploiting LLMs' Reasoning Capability to Infer Implicit Concepts in Legal Information Retrieval

Autor: Nguyen, Hai-Long, Nguyen, Tan-Minh, Nguyen, Duc-Minh, Vuong, Thi-Hai-Yen, Nguyen, Ha-Thanh, Phan, Xuan-Hieu

Statutory law retrieval is a typical problem in legal language processing, that has various practical applications in law engineering. Modern deep learning-based retrieval methods have achieved significant results for this problem. However, retrieval

Externí odkaz: http://arxiv.org/abs/2410.12154

Zobrazit plný text záznamu

Report

Layer-of-Thoughts Prompting (LoT): Leveraging LLM-Based Retrieval with Constraint Hierarchies

Autor: Fungwacharakorn, Wachara, Thanh, Nguyen Ha, Zin, May Myo, Satoh, Ken

This paper presents a novel approach termed Layer-of-Thoughts Prompting (LoT), which utilizes constraint hierarchies to filter and refine candidate responses to a given query. By integrating these constraints, our method enables a structured retrieva

Externí odkaz: http://arxiv.org/abs/2410.12153

Zobrazit plný text záznamu

Report

Stratified Domain Adaptation: A Progressive Self-Training Approach for Scene Text Recognition

Autor: Le, Kha Nhat, Nguyen, Hoang-Tuan, Tran, Hung Tien, Ngo, Thanh Duc

Unsupervised domain adaptation (UDA) has become increasingly prevalent in scene text recognition (STR), especially where training and testing data reside in different domains. The efficacy of existing UDA approaches tends to degrade when there is a l

Externí odkaz: http://arxiv.org/abs/2410.09913

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání