Výsledky vyhledávání - "He, PengCheng"

Report

Switchable Decision: Dynamic Neural Generation Networks

Autor: Zhang, Shujian, Tanwisuth, Korawat, Gong, Chengyue, He, Pengcheng, Zhou, Mingyuan

Auto-regressive generation models achieve competitive performance across many different NLP tasks such as summarization, question answering, and classifications. However, they are also known for being slow in inference, which makes them challenging t

Externí odkaz: http://arxiv.org/abs/2405.04513

Zobrazit plný text záznamu

Report

Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective

Autor: Zhong, Ming, An, Chenxin, Chen, Weizhu, Han, Jiawei, He, Pengcheng

Large Language Models (LLMs) inherently encode a wealth of knowledge within their parameters through pre-training on extensive corpora. While prior research has delved into operations on these parameters to manipulate the underlying implicit knowledg

Externí odkaz: http://arxiv.org/abs/2310.11451

Zobrazit plný text záznamu

Report

LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models

Autor: Li, Yixiao, Yu, Yifan, Liang, Chen, He, Pengcheng, Karampatziakis, Nikos, Chen, Weizhu, Zhao, Tuo

Quantization is an indispensable technique for serving Large Language Models (LLMs) and has recently found its way into LoRA fine-tuning. In this work we focus on the scenario where quantization and LoRA fine-tuning are applied together on a pre-trai

Externí odkaz: http://arxiv.org/abs/2310.08659

Zobrazit plný text záznamu

Report

Learning Stackable and Skippable LEGO Bricks for Efficient, Reconfigurable, and Variable-Resolution Diffusion Modeling

Autor: Zheng, Huangjie, Wang, Zhendong, Yuan, Jianbo, Ning, Guanghan, He, Pengcheng, You, Quanzeng, Yang, Hongxia, Zhou, Mingyuan

Diffusion models excel at generating photo-realistic images but come with significant computational costs in both training and sampling. While various techniques address these computational challenges, a less-explored issue is designing an efficient

Externí odkaz: http://arxiv.org/abs/2310.06389

Zobrazit plný text záznamu

Report

DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models

Autor: Chuang, Yung-Sung, Xie, Yujia, Luo, Hongyin, Kim, Yoon, Glass, James, He, Pengcheng

Despite their impressive capabilities, large language models (LLMs) are prone to hallucinations, i.e., generating content that deviates from facts seen during pretraining. We propose a simple decoding strategy for reducing hallucinations with pretrai

Externí odkaz: http://arxiv.org/abs/2309.03883

Zobrazit plný text záznamu

Report

Deep Reinforcement Learning from Hierarchical Preference Design

Autor: Bukharin, Alexander, Li, Yixiao, He, Pengcheng, Zhao, Tuo

Reward design is a fundamental, yet challenging aspect of reinforcement learning (RL). Researchers typically utilize feedback signals from the environment to handcraft a reward function, but this process is not always effective due to the varying sca

Externí odkaz: http://arxiv.org/abs/2309.02632

Zobrazit plný text záznamu

Report

Evaluating the Instruction-Following Robustness of Large Language Models to Prompt Injection

Autor: Li, Zekun, Peng, Baolin, He, Pengcheng, Yan, Xifeng

Large Language Models (LLMs) have shown remarkable proficiency in following instructions, making them valuable in customer-facing applications. However, their impressive capabilities also raise concerns about the amplification of risks posed by adver

Externí odkaz: http://arxiv.org/abs/2308.10819

Zobrazit plný text záznamu

Report

Summaries, Highlights, and Action items: Design, implementation and evaluation of an LLM-powered meeting recap system

Autor: Asthana, Sumit, Hilleli, Sagih, He, Pengcheng, Halfaker, Aaron

Meetings play a critical infrastructural role in the coordination of work. In recent years, due to shift to hybrid and remote work, more meetings are moving to online Computer Mediated Spaces. This has led to new problems (e.g. more time spent in les

Externí odkaz: http://arxiv.org/abs/2307.15793

Zobrazit plný text záznamu

Report

LoSparse: Structured Compression of Large Language Models based on Low-Rank and Sparse Approximation

Autor: Li, Yixiao, Yu, Yifan, Zhang, Qingru, Liang, Chen, He, Pengcheng, Chen, Weizhu, Zhao, Tuo

Transformer models have achieved remarkable results in various natural language tasks, but they are often prohibitively large, requiring massive memories and computational resources. To reduce the size and complexity of these models, we propose LoSpa

Externí odkaz: http://arxiv.org/abs/2306.11222

Zobrazit plný text záznamu

Report

Interactive Editing for Text Summarization

Autor: Xie, Yujia, Wang, Xun, Chen, Si-Qing, Xiong, Wayne, He, Pengcheng

Summarizing lengthy documents is a common and essential task in our daily lives. Although recent advancements in neural summarization models can assist in crafting general-purpose summaries, human writers often have specific requirements that call fo

Externí odkaz: http://arxiv.org/abs/2306.03067

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání