Výsledky vyhledávání

Report

LRHP: Learning Representations for Human Preferences via Preference Pairs

Autor: Wang, Chenglong, Gan, Yang, Huo, Yifu, Mu, Yongyu, He, Qiaozhi, Yang, Murun, Xiao, Tong, Zhang, Chunliang, Liu, Tongran, Zhu, Jingbo

To improve human-preference alignment training, current research has developed numerous preference datasets consisting of preference pairs labeled as "preferred" or "dispreferred". These preference pairs are typically used to encode human preferences

Externí odkaz: http://arxiv.org/abs/2410.04503

Zobrazit plný text záznamu

Report

RoVRM: A Robust Visual Reward Model Optimized via Auxiliary Textual Preference Data

Autor: Wang, Chenglong, Gan, Yang, Huo, Yifu, Mu, Yongyu, Yang, Murun, He, Qiaozhi, Xiao, Tong, Zhang, Chunliang, Liu, Tongran, Du, Quan, Yang, Di, Zhu, Jingbo

Large vision-language models (LVLMs) often fail to align with human preferences, leading to issues like generating misleading content without proper visual context (also known as hallucination). A promising solution to this problem is using human-pre

Externí odkaz: http://arxiv.org/abs/2408.12109

Zobrazit plný text záznamu

Report

Cross-layer Attention Sharing for Large Language Models

Autor: Mu, Yongyu, Wu, Yuzhang, Fan, Yuchun, Wang, Chenglong, Li, Hengyu, He, Qiaozhi, Yang, Murun, Xiao, Tong, Zhu, Jingbo

As large language models (LLMs) evolve, the increase in model depth and parameter number leads to substantial redundancy. To enhance the efficiency of the attention mechanism, previous works primarily compress the KV cache or group attention heads, w

Externí odkaz: http://arxiv.org/abs/2408.01890

Zobrazit plný text záznamu

Report

CTC-based Non-autoregressive Speech Translation

Autor: Xu, Chen, Liu, Xiaoqian, Liu, Xiaowen, Sun, Qingxuan, Zhang, Yuhao, Yang, Murun, Dong, Qianqian, Ko, Tom, Wang, Mingxuan, Xiao, Tong, Ma, Anxiang, Zhu, Jingbo

Combining end-to-end speech translation (ST) and non-autoregressive (NAR) generation is promising in language and speech processing for their advantages of less error propagation and low latency. In this paper, we investigate the potential of connect

Externí odkaz: http://arxiv.org/abs/2305.17358

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání