Výsledky vyhledávání

Report

Post-hoc Reward Calibration: A Case Study on Length Bias

Autor: Huang, Zeyu, Qiu, Zihan, Wang, Zili, Ponti, Edoardo M., Titov, Ivan

Reinforcement Learning from Human Feedback aligns the outputs of Large Language Models with human values and preferences. Central to this process is the reward model (RM), which translates human feedback into training signals for optimising LLM behav

Externí odkaz: http://arxiv.org/abs/2409.17407

Zobrazit plný text záznamu

Report

Layerwise Recurrent Router for Mixture-of-Experts

Autor: Qiu, Zihan, Huang, Zeyu, Cheng, Shuang, Zhou, Yizhi, Wang, Zili, Titov, Ivan, Fu, Jie

The scaling of large language models (LLMs) has revolutionized their capabilities in various tasks, yet this growth must be matched with efficient computational strategies. The Mixture-of-Experts (MoE) architecture stands out for its ability to scale

Externí odkaz: http://arxiv.org/abs/2408.06793

Zobrazit plný text záznamu

Report

DiscipLink: Unfolding Interdisciplinary Information Seeking Process via Human-AI Co-Exploration

Autor: Zheng, Chengbo, Zhang, Yuanhao, Huang, Zeyu, Shi, Chuhan, Xu, Minrui, Ma, Xiaojuan

Interdisciplinary studies often require researchers to explore literature in diverse branches of knowledge. Yet, navigating through the highly scattered knowledge from unfamiliar disciplines poses a significant challenge. In this paper, we introduce

Externí odkaz: http://arxiv.org/abs/2408.00447

Zobrazit plný text záznamu

Report

FRI-Net: Floorplan Reconstruction via Room-wise Implicit Representation

Autor: Xu, Honghao, Xu, Juzhan, Huang, Zeyu, Xu, Pengfei, Huang, Hui, Hu, Ruizhen

In this paper, we introduce a novel method called FRI-Net for 2D floorplan reconstruction from 3D point cloud. Existing methods typically rely on corner regression or box regression, which lack consideration for the global shapes of rooms. To address

Externí odkaz: http://arxiv.org/abs/2407.10687

Zobrazit plný text záznamu

Report

Purification Of Contaminated Convolutional Neural Networks Via Robust Recovery: An Approach with Theoretical Guarantee in One-Hidden-Layer Case

Autor: Lu, Hanxiao, Huang, Zeyu, Wang, Ren

Convolutional neural networks (CNNs), one of the key architectures of deep learning models, have achieved superior performance on many machine learning tasks such as image classification, video recognition, and power systems. Despite their success, C

Externí odkaz: http://arxiv.org/abs/2407.11031

Zobrazit plný text záznamu

Report

PFME: A Modular Approach for Fine-grained Hallucination Detection and Editing of Large Language Models

Autor: Deng, Kunquan, Huang, Zeyu, Li, Chen, Lin, Chenghua, Gao, Min, Rong, Wenge

Large Language Models (LLMs) excel in fluency but risk producing inaccurate content, called "hallucinations." This paper outlines a standardized process for categorizing fine-grained hallucination types and proposes an innovative framework--the Progr

Externí odkaz: http://arxiv.org/abs/2407.00488

Zobrazit plný text záznamu

Report

A Closer Look into Mixture-of-Experts in Large Language Models

Autor: Lo, Ka Man, Huang, Zeyu, Qiu, Zihan, Wang, Zili, Fu, Jie

Mixture-of-experts (MoE) is gaining increasing attention due to its unique properties and remarkable performance, especially for language tasks. By sparsely activating a subset of parameters for each token, MoE architecture could increase the model s

Externí odkaz: http://arxiv.org/abs/2406.18219

Zobrazit plný text záznamu

Report

Unlocking Continual Learning Abilities in Language Models

Autor: Du, Wenyu, Cheng, Shuang, Luo, Tongxu, Qiu, Zihan, Huang, Zeyu, Cheung, Ka Chun, Cheng, Reynold, Fu, Jie

Language models (LMs) exhibit impressive performance and generalization capabilities. However, LMs struggle with the persistent challenge of catastrophic forgetting, which undermines their long-term sustainability in continual learning (CL). Existing

Externí odkaz: http://arxiv.org/abs/2406.17245

Zobrazit plný text záznamu

Report

Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training

Autor: Du, Wenyu, Luo, Tongxu, Qiu, Zihan, Huang, Zeyu, Shen, Yikang, Cheng, Reynold, Guo, Yike, Fu, Jie

LLMs are computationally expensive to pre-train due to their large scale. Model growth emerges as a promising approach by leveraging smaller models to accelerate the training of larger ones. However, the viability of these model growth methods in eff

Externí odkaz: http://arxiv.org/abs/2405.15319

Zobrazit plný text záznamu

Report

Spatial and Surface Correspondence Field for Interaction Transfer

Autor: Huang, Zeyu, Xu, Honghao, Huang, Haibin, Ma, Chongyang, Huang, Hui, Hu, Ruizhen

In this paper, we introduce a new method for the task of interaction transfer. Given an example interaction between a source object and an agent, our method can automatically infer both surface and spatial relationships for the agent and target objec

Externí odkaz: http://arxiv.org/abs/2405.03221

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání