Výsledky vyhledávání - "Luo, Haozheng"

Report

Decoupled Alignment for Robust Plug-and-Play Adaptation

Autor: Luo, Haozheng, Yu, Jiahao, Zhang, Wenxin, Li, Jialong, Hu, Jerry Yao-Chieh, Xing, Xinyu, Liu, Han

We introduce a low-resource safety enhancement method for aligning large language models (LLMs) without the need for supervised fine-tuning (SFT) or reinforcement learning from human feedback (RLHF). Our main idea is to exploit knowledge distillation

Externí odkaz: http://arxiv.org/abs/2406.01514

Zobrazit plný text záznamu

Report

Enhancing Jailbreak Attack Against Large Language Models through Silent Tokens

Autor: Yu, Jiahao, Luo, Haozheng, Hu, Jerry Yao-Chieh, Guo, Wenbo, Liu, Han, Xing, Xinyu

Along with the remarkable successes of Language language models, recent research also started to explore the security threats of LLMs, including jailbreaking attacks. Attackers carefully craft jailbreaking prompts such that a target LLM will respond

Externí odkaz: http://arxiv.org/abs/2405.20653

Zobrazit plný text záznamu

Report

Conv-CoA: Improving Open-domain Question Answering in Large Language Models via Conversational Chain-of-Action

Autor: Pan, Zhenyu, Luo, Haozheng, Li, Manling, Liu, Han

We present a Conversational Chain-of-Action (Conv-CoA) framework for Open-domain Conversational Question Answering (OCQA). Compared with literature, Conv-CoA addresses three major challenges: (i) unfaithful hallucination that is inconsistent with rea

Externí odkaz: http://arxiv.org/abs/2405.17822

Zobrazit plný text záznamu

Report

Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models

Autor: Pan, Zhenyu, Luo, Haozheng, Li, Manling, Liu, Han

We present a Chain-of-Action (CoA) framework for multimodal and retrieval-augmented Question-Answering (QA). Compared to the literature, CoA overcomes two major challenges of current QA applications: (i) unfaithful hallucination that is inconsistent

Externí odkaz: http://arxiv.org/abs/2403.17359

Zobrazit plný text záznamu

Report

SMUTF: Schema Matching Using Generative Tags and Hybrid Features

Autor: Zhang, Yu, Di, Mei, Luo, Haozheng, Xu, Chenwei, Tsai, Richard Tzong-Han

We introduce SMUTF, a unique approach for large-scale tabular data schema matching (SM), which assumes that supervised learning does not affect performance in open-domain tasks, thereby enabling effective cross-domain matching. This system uniquely c

Externí odkaz: http://arxiv.org/abs/2402.01685

Zobrazit plný text záznamu

Report

SciAnnotate: A Tool for Integrating Weak Labeling Sources for Sequence Labeling

Autor: Liu, Mengyang, Luo, Haozheng, Thong, Leonard, Li, Yinghao, Zhang, Chao, Song, Le

Weak labeling is a popular weak supervision strategy for Named Entity Recognition (NER) tasks, with the goal of reducing the necessity for hand-crafted annotations. Although there are numerous remarkable annotation tools for NER labeling, the subject

Externí odkaz: http://arxiv.org/abs/2208.10241

Zobrazit plný text záznamu

Report

MBGDT:Robust Mini-Batch Gradient Descent

Autor: Wang, Hanming, Luo, Haozheng, Wang, Yue

In high dimensions, most machine learning method perform fragile even there are a little outliers. To address this, we hope to introduce a new method with the base learner, such as Bayesian regression or stochastic gradient descent to solve the probl

Externí odkaz: http://arxiv.org/abs/2206.07139

Zobrazit plný text záznamu

Report

IGN : Implicit Generative Networks

Autor: Luo, Haozheng, Wu, Tianyi, Han, Colin Feiyu, Yan, Zhijun

In this work, we build recent advances in distributional reinforcement learning to give a state-of-art distributional variant of the model based on the IQN. We achieve this by using the GAN model's generator and discriminator function with the quanti

Externí odkaz: http://arxiv.org/abs/2206.05860

Zobrazit plný text záznamu

Report

IBERT: Idiom Cloze-style reading comprehension with Attention

Autor: Qin, Ruiyang, Luo, Haozheng, Fan, Zheheng, Ren, Ziang

Idioms are special fixed phrases usually derived from stories. They are commonly used in casual conversations and literary writings. Their meanings are usually highly non-compositional. The idiom cloze task is a challenge problem in Natural Language

Externí odkaz: http://arxiv.org/abs/2112.02994

Zobrazit plný text záznamu

Report

Open-Ended Multi-Modal Relational Reasoning for Video Question Answering

Autor: Luo, Haozheng, Qin, Ruiyang, Xu, Chenwei, Ye, Guo, Luo, Zening

In this paper, we introduce a robotic agent specifically designed to analyze external environments and address participants' questions. The primary focus of this agent is to assist individuals using language-based interactions within video-based scen

Externí odkaz: http://arxiv.org/abs/2012.00822

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání