Výsledky vyhledávání - "Huang, Minlie"

Report

SafetyBench: Evaluating the Safety of Large Language Models

Autor: Zhang, Zhexin, Lei, Leqi, Wu, Lindong, Sun, Rui, Huang, Yongkang, Long, Chong, Liu, Xiao, Lei, Xuanyu, Tang, Jie, Huang, Minlie

With the rapid development of Large Language Models (LLMs), increasing attention has been paid to their safety concerns. Consequently, evaluating the safety of LLMs has become an essential task for facilitating the broad applications of LLMs. Neverth

Externí odkaz: http://arxiv.org/abs/2309.07045

Zobrazit plný text záznamu

Report

Large Language Models Are Not Robust Multiple Choice Selectors

Autor: Zheng, Chujie, Zhou, Hao, Meng, Fandong, Zhou, Jie, Huang, Minlie

Multiple choice questions (MCQs) serve as a common yet important task format in the evaluation of large language models (LLMs). This work shows that modern LLMs are vulnerable to option position changes in MCQs due to their inherent "selection bias",

Externí odkaz: http://arxiv.org/abs/2309.03882

Zobrazit plný text záznamu

Report

AgentBench: Evaluating LLMs as Agents

Large Language Models (LLMs) are becoming increasingly smart and autonomous, targeting real-world pragmatic missions beyond traditional NLP tasks. As a result, there has been an urgent need to evaluate LLMs as agents on challenging tasks in interacti

Externí odkaz: http://arxiv.org/abs/2308.03688

Zobrazit plný text záznamu

Report

Facilitating Multi-turn Emotional Support Conversation with Positive Emotion Elicitation: A Reinforcement Learning Approach

Autor: Zhou, Jinfeng, Chen, Zhuang, Wang, Bo, Huang, Minlie

Emotional support conversation (ESC) aims to provide emotional support (ES) to improve one's mental state. Existing works stay at fitting grounded responses and responding strategies (e.g., question), which ignore the effect on ES and lack explicit g

Externí odkaz: http://arxiv.org/abs/2307.07994

Zobrazit plný text záznamu

Report

DecompEval: Evaluating Generated Texts as Unsupervised Decomposed Question Answering

Autor: Ke, Pei, Huang, Fei, Mi, Fei, Wang, Yasheng, Liu, Qun, Zhu, Xiaoyan, Huang, Minlie

Existing evaluation metrics for natural language generation (NLG) tasks face the challenges on generalization ability and interpretability. Specifically, most of the well-performed metrics are required to train on evaluation datasets of specific NLG

Externí odkaz: http://arxiv.org/abs/2307.06869

Zobrazit plný text záznamu

Report

Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confidence Estimation

Autor: Zhang, Zhexin, Wen, Jiaxin, Huang, Minlie

Large pre-trained language models achieve impressive results across many tasks. However, recent works point out that pre-trained language models may memorize a considerable fraction of their training data, leading to the privacy risk of information l

Externí odkaz: http://arxiv.org/abs/2307.04401

Zobrazit plný text záznamu

Report

Mitigating the Learning Bias towards Repetition by Self-Contrastive Training for Open-Ended Generation

Autor: Guan, Jian, Huang, Minlie

Despite the huge progress in myriad generation tasks, pretrained language models (LMs) such as GPT2 still tend to generate repetitive texts with maximization-based decoding algorithms for open-ended generation. We attribute their overestimation of to

Externí odkaz: http://arxiv.org/abs/2307.01542

Zobrazit plný text záznamu

Report

MiniLLM: Knowledge Distillation of Large Language Models

Autor: Gu, Yuxian, Dong, Li, Wei, Furu, Huang, Minlie

Knowledge Distillation (KD) is a promising technique for reducing the high computational demand of large language models (LLMs). However, previous KD methods are primarily applied to white-box classification models or training small models to imitate

Externí odkaz: http://arxiv.org/abs/2306.08543

Zobrazit plný text záznamu

Report

Uncertainty in Natural Language Processing: Sources, Quantification, and Applications

Autor: Hu, Mengting, Zhang, Zhen, Zhao, Shiwan, Huang, Minlie, Wu, Bingzhe

As a main field of artificial intelligence, natural language processing (NLP) has achieved remarkable success via deep neural networks. Plenty of NLP tasks have been addressed in a unified manner, with various tasks being associated with each other t

Externí odkaz: http://arxiv.org/abs/2306.04459

Zobrazit plný text záznamu

Report

Click: Controllable Text Generation with Sequence Likelihood Contrastive Learning

Autor: Zheng, Chujie, Ke, Pei, Zhang, Zheng, Huang, Minlie

It has always been an important yet challenging problem to control language models to avoid generating texts with undesirable attributes, such as toxic language and unnatural repetition. We introduce Click for controllable text generation, which need

Externí odkaz: http://arxiv.org/abs/2306.03350

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání