Výsledky vyhledávání

Report

IDGen: Item Discrimination Induced Prompt Generation for LLM Evaluation

Autor: Lin, Fan, Xie, Shuyi, Dai, Yong, Yao, Wenlin, Lang, Tianjiao, Xu, Zishan, Hu, Zhichao, Xiao, Xiao, Liu, Yuhong, Zhang, Yu

As Large Language Models (LLMs) grow increasingly adept at managing complex tasks, the evaluation set must keep pace with these advancements to ensure it remains sufficiently discriminative. Item Discrimination (ID) theory, which is widely used in ed

Externí odkaz: http://arxiv.org/abs/2409.18892

Zobrazit plný text záznamu

Report

Reshaping the Online Data Buffering and Organizing Mechanism for Continual Test-Time Adaptation

Autor: Zhu, Zhilin, Hong, Xiaopeng, Ma, Zhiheng, Zhuang, Weijun, Ma, Yaohui, Dai, Yong, Wang, Yaowei

Continual Test-Time Adaptation (CTTA) involves adapting a pre-trained source model to continually changing unsupervised target domains. In this paper, we systematically analyze the challenges of this task: online environment, unsupervised nature, and

Externí odkaz: http://arxiv.org/abs/2407.09367

Zobrazit plný text záznamu

Report

Prompt Customization for Continual Learning

Autor: Dai, Yong, Hong, Xiaopeng, Wang, Yabin, Ma, Zhiheng, Jiang, Dongmei, Wang, Yaowei

Contemporary continual learning approaches typically select prompts from a pool, which function as supplementary inputs to a pre-trained model. However, this strategy is hindered by the inherent noise of its selection approach when handling increasin

Externí odkaz: http://arxiv.org/abs/2404.18060

Zobrazit plný text záznamu

Report

Self-playing Adversarial Language Game Enhances LLM Reasoning

Autor: Cheng, Pengyu, Hu, Tianhao, Xu, Han, Zhang, Zhisong, Dai, Yong, Han, Lei, Du, Nan

We explore the self-play training procedure of large language models (LLMs) in a two-player adversarial language game called Adversarial Taboo. In this game, an attacker and a defender communicate around a target word only visible to the attacker. Th

Externí odkaz: http://arxiv.org/abs/2404.10642

Zobrazit plný text záznamu

Report

Prior-agnostic Multi-scale Contrastive Text-Audio Pre-training for Parallelized TTS Frontend Modeling

Autor: Wang, Quanxiu, Huang, Hui, Wang, Mingjie, Dai, Yong, Zhong, Jinzuomu, Tang, Benlai

Over the past decade, a series of unflagging efforts have been dedicated to developing highly expressive and controllable text-to-speech (TTS) systems. In general, the holistic TTS comprises two interconnected components: the frontend module and the

Externí odkaz: http://arxiv.org/abs/2404.09192

Zobrazit plný text záznamu

Report

Look Before You Leap: Towards Decision-Aware and Generalizable Tool-Usage for Large Language Models

Autor: Gui, Anchun, Li, Jian, Dai, Yong, Du, Nan, Xiao, Han

Tool-augmented large language models (LLMs) are attracting widespread attention when accessing up-to-date knowledge and alleviating hallucination issues. Nowadays, advanced closed-source LLMs (e.g., ChatGPT) have demonstrated surprising tool-usage ca

Externí odkaz: http://arxiv.org/abs/2402.16696

Zobrazit plný text záznamu

Report

Enhancing Zero-shot Counting via Language-guided Exemplar Learning

Autor: Wang, Mingjie, Zhou, Jun, Dai, Yong, Buys, Eric, Gong, Minglun

Recently, Class-Agnostic Counting (CAC) problem has garnered increasing attention owing to its intriguing generality and superior efficiency compared to Category-Specific Counting (CSC). This paper proposes a novel ExpressCount to enhance zero-shot o

Externí odkaz: http://arxiv.org/abs/2402.05394

Zobrazit plný text záznamu

Report

Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning

Autor: Yang, Bang, Dai, Yong, Cheng, Xuxin, Li, Yaowei, Raza, Asif, Zou, Yuexian

While vision-language pre-trained models (VL-PTMs) have advanced multimodal research in recent years, their mastery in a few languages like English restricts their applicability in broader communities. To this end, there is an increasing interest in

Externí odkaz: http://arxiv.org/abs/2401.17186

Zobrazit plný text záznamu

Report

WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models

Autor: He, Hongliang, Yao, Wenlin, Ma, Kaixin, Yu, Wenhao, Dai, Yong, Zhang, Hongming, Lan, Zhenzhong, Yu, Dong

The rapid advancement of large language models (LLMs) has led to a new era marked by the development of autonomous applications in real-world scenarios, which drives innovation in creating advanced web agents. Existing web agents typically only handl

Externí odkaz: http://arxiv.org/abs/2401.13919

Zobrazit plný text záznamu

Report

Emage: Non-Autoregressive Text-to-Image Generation

Autor: Feng, Zhangyin, Hu, Runyi, Liu, Liangxin, Zhang, Fan, Tang, Duyu, Dai, Yong, Feng, Xiaocheng, Li, Jiwei, Qin, Bing, Shi, Shuming

Autoregressive and diffusion models drive the recent breakthroughs on text-to-image generation. Despite their huge success of generating high-realistic images, a common shortcoming of these models is their high inference latency - autoregressive mode

Externí odkaz: http://arxiv.org/abs/2312.14988

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání