Výsledky vyhledávání - "Zhang, WenXuan"

Report

Bi-Factorial Preference Optimization: Balancing Safety-Helpfulness in Language Models

Autor: Zhang, Wenxuan, Torr, Philip H. S., Elhoseiny, Mohamed, Bibi, Adel

Fine-tuning large language models (LLMs) on human preferences, typically through reinforcement learning from human feedback (RLHF), has proven successful in enhancing their capabilities. However, ensuring the safety of LLMs during the fine-tuning rem

Externí odkaz: http://arxiv.org/abs/2408.15313

Zobrazit plný text záznamu

Report

SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages

Autor: Zhang, Wenxuan, Chan, Hou Pong, Zhao, Yiran, Aljunied, Mahani, Wang, Jianyu, Liu, Chaoqun, Deng, Yue, Hu, Zhiqiang, Xu, Weiwen, Chia, Yew Ken, Li, Xin, Bing, Lidong

Large Language Models (LLMs) have shown remarkable abilities across various tasks, yet their development has predominantly centered on high-resource languages like English and Chinese, leaving low-resource languages underserved. To address this dispa

Externí odkaz: http://arxiv.org/abs/2407.19672

Zobrazit plný text záznamu

Report

Paths towards time evolution with larger neural-network quantum states

Autor: Zhang, Wenxuan, Xing, Bo, Xu, Xiansong, Poletti, Dario

In recent years, the neural-network quantum states method has been investigated to study the ground state and the time evolution of many-body quantum systems. Here we expand on the investigation and consider a quantum quench from the paramagnetic to

Externí odkaz: http://arxiv.org/abs/2406.03381

Zobrazit plný text záznamu

Report

Auto Arena of LLMs: Automating LLM Evaluations with Agent Peer-battles and Committee Discussions

Autor: Zhao, Ruochen, Zhang, Wenxuan, Chia, Yew Ken, Zhao, Deli, Bing, Lidong

As LLMs evolve on a daily basis, there is an urgent need for a trustworthy evaluation method that can provide robust evaluation results in a timely fashion. Currently, as static benchmarks are prone to contamination concerns, users tend to trust huma

Externí odkaz: http://arxiv.org/abs/2405.20267

Zobrazit plný text záznamu

Report

Continual Learning on a Diet: Learning from Sparsely Labeled Streams Under Constrained Computation

Autor: Zhang, Wenxuan, Mohamed, Youssef, Ghanem, Bernard, Torr, Philip H. S., Bibi, Adel, Elhoseiny, Mohamed

We propose and study a realistic Continual Learning (CL) setting where learning algorithms are granted a restricted computational budget per time step while training. We apply this setting to large-scale semi-supervised Continual Learning scenarios w

Externí odkaz: http://arxiv.org/abs/2404.12766

Zobrazit plný text záznamu

Report

Is Translation All You Need? A Study on Solving Multilingual Tasks with Large Language Models

Autor: Liu, Chaoqun, Zhang, Wenxuan, Zhao, Yiran, Luu, Anh Tuan, Bing, Lidong

Large language models (LLMs) have demonstrated multilingual capabilities; yet, they are mostly English-centric due to the imbalanced training corpora. Existing works leverage this phenomenon to improve their multilingual performances through translat

Externí odkaz: http://arxiv.org/abs/2403.10258

Zobrazit plný text záznamu

Report

AdaMergeX: Cross-Lingual Transfer with Large Language Models via Adaptive Adapter Merging

Autor: Zhao, Yiran, Zhang, Wenxuan, Wang, Huiming, Kawaguchi, Kenji, Bing, Lidong

As an effective alternative to the direct fine-tuning on target tasks in specific languages, cross-lingual transfer addresses the challenges of limited training data by decoupling ''task ability'' and ''language ability'' by fine-tuning on the target

Externí odkaz: http://arxiv.org/abs/2402.18913

Zobrazit plný text záznamu

Report

How do Large Language Models Handle Multilingualism?

Autor: Zhao, Yiran, Zhang, Wenxuan, Chen, Guizhen, Kawaguchi, Kenji, Bing, Lidong

Large language models (LLMs) have demonstrated impressive capabilities across diverse languages. This study explores how LLMs handle multilingualism. Based on observed language ratio shifts among layers and the relationships between network structure

Externí odkaz: http://arxiv.org/abs/2402.18815

Zobrazit plný text záznamu

Report

On the Multi-turn Instruction Following for Conversational Web Agents

Autor: Deng, Yang, Zhang, Xuan, Zhang, Wenxuan, Yuan, Yifei, Ng, See-Kiong, Chua, Tat-Seng

Web agents powered by Large Language Models (LLMs) have demonstrated remarkable abilities in planning and executing multi-step interactions within complex web-based environments, fulfilling a wide range of web navigation tasks. Despite these advancem

Externí odkaz: http://arxiv.org/abs/2402.15057

Zobrazit plný text záznamu

Report

Radar-Based Recognition of Static Hand Gestures in American Sign Language

Autor: Schuessler, Christian, Zhang, Wenxuan, Bräunig, Johanna, Hoffmann, Marcel, Stelzig, Michael, Vossiek, Martin

In the fast-paced field of human-computer interaction (HCI) and virtual reality (VR), automatic gesture recognition has become increasingly essential. This is particularly true for the recognition of hand signs, providing an intuitive way to effortle

Externí odkaz: http://arxiv.org/abs/2402.12800

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání