Zobrazeno 1 - 4
of 4
pro vyhledávání: '"Yang, Murun"'
Autor:
Wang, Chenglong, Gan, Yang, Huo, Yifu, Mu, Yongyu, He, Qiaozhi, Yang, Murun, Xiao, Tong, Zhang, Chunliang, Liu, Tongran, Zhu, Jingbo
To improve human-preference alignment training, current research has developed numerous preference datasets consisting of preference pairs labeled as "preferred" or "dispreferred". These preference pairs are typically used to encode human preferences
Externí odkaz:
http://arxiv.org/abs/2410.04503
Autor:
Wang, Chenglong, Gan, Yang, Huo, Yifu, Mu, Yongyu, Yang, Murun, He, Qiaozhi, Xiao, Tong, Zhang, Chunliang, Liu, Tongran, Du, Quan, Yang, Di, Zhu, Jingbo
Large vision-language models (LVLMs) often fail to align with human preferences, leading to issues like generating misleading content without proper visual context (also known as hallucination). A promising solution to this problem is using human-pre
Externí odkaz:
http://arxiv.org/abs/2408.12109
Autor:
Mu, Yongyu, Wu, Yuzhang, Fan, Yuchun, Wang, Chenglong, Li, Hengyu, He, Qiaozhi, Yang, Murun, Xiao, Tong, Zhu, Jingbo
As large language models (LLMs) evolve, the increase in model depth and parameter number leads to substantial redundancy. To enhance the efficiency of the attention mechanism, previous works primarily compress the KV cache or group attention heads, w
Externí odkaz:
http://arxiv.org/abs/2408.01890
Autor:
Xu, Chen, Liu, Xiaoqian, Liu, Xiaowen, Sun, Qingxuan, Zhang, Yuhao, Yang, Murun, Dong, Qianqian, Ko, Tom, Wang, Mingxuan, Xiao, Tong, Ma, Anxiang, Zhu, Jingbo
Combining end-to-end speech translation (ST) and non-autoregressive (NAR) generation is promising in language and speech processing for their advantages of less error propagation and low latency. In this paper, we investigate the potential of connect
Externí odkaz:
http://arxiv.org/abs/2305.17358