Výsledky vyhledávání

Report

Instruction-Tuned LLMs Succeed in Document-Level MT Without Fine-Tuning -- But BLEU Turns a Blind Eye

Autor: Sun, Yirong, Zhu, Dawei, Chen, Yanjun, Xiao, Erjia, Chen, Xinghao, Shen, Xiaoyu

Large language models (LLMs) have excelled in various NLP tasks, including machine translation (MT), yet most studies focus on sentence-level translation. This work investigates the inherent capability of instruction-tuned LLMs for document-level tra

Externí odkaz: http://arxiv.org/abs/2410.20941

Zobrazit plný text záznamu

Report

AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories

Autor: Song, Yifan, Xiong, Weimin, Zhao, Xiutian, Zhu, Dawei, Wu, Wenhao, Wang, Ke, Li, Cheng, Peng, Wei, Li, Sujian

Fine-tuning on agent-environment interaction trajectory data holds significant promise for surfacing generalized agent capabilities in open-source large language models (LLMs). In this work, we introduce AgentBank, by far the largest trajectory tunin

Externí odkaz: http://arxiv.org/abs/2410.07706

Zobrazit plný text záznamu

Report

To Preserve or To Compress: An In-Depth Study of Connector Selection in Multimodal Large Language Models

Autor: Lin, Junyan, Chen, Haoran, Zhu, Dawei, Shen, Xiaoyu

In recent years, multimodal large language models (MLLMs) have garnered significant attention from both industry and academia. However, there is still considerable debate on constructing MLLM architectures, particularly regarding the selection of app

Externí odkaz: http://arxiv.org/abs/2410.06765

Zobrazit plný text záznamu

Report

The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Language Models

Autor: Chen, Yanjun, Zhu, Dawei, Sun, Yirong, Chen, Xinghao, Zhang, Wei, Shen, Xiaoyu

Reinforcement Learning from Human Feedback significantly enhances Natural Language Processing by aligning language models with human expectations. A critical factor in this alignment is the strength of reward models used during training. This study e

Externí odkaz: http://arxiv.org/abs/2410.06554

Zobrazit plný text záznamu

Report

From Calculation to Adjudication: Examining LLM judges on Mathematical Reasoning Tasks

Autor: Stephan, Andreas, Zhu, Dawei, Aßenmacher, Matthias, Shen, Xiaoyu, Roth, Benjamin

To reduce the need for human annotations, large language models (LLMs) have been proposed as judges of the quality of other candidate models. LLM judges are typically evaluated by measuring the correlation with human judgments on generation tasks suc

Externí odkaz: http://arxiv.org/abs/2409.04168

Zobrazit plný text záznamu

Report

Assessing 'Implicit' Retrieval Robustness of Large Language Models

Autor: Shen, Xiaoyu, Blloshmi, Rexhina, Zhu, Dawei, Pei, Jiahuan, Zhang, Wei

Retrieval-augmented generation has gained popularity as a framework to enhance large language models with external knowledge. However, its effectiveness hinges on the retrieval robustness of the model. If the model lacks retrieval robustness, its per

Externí odkaz: http://arxiv.org/abs/2406.18134

Zobrazit plný text záznamu

Report

EERPD: Leveraging Emotion and Emotion Regulation for Improving Personality Detection

Autor: Li, Zheng, Zhu, Dawei, Ma, Qilong, Xiong, Weimin, Li, Sujian

Personality is a fundamental construct in psychology, reflecting an individual's behavior, thinking, and emotional patterns. Previous researches have made some progress in personality detection, primarily by utilizing the whole text to predict person

Externí odkaz: http://arxiv.org/abs/2406.16079

Zobrazit plný text záznamu

Report

InternLM-Law: An Open Source Chinese Legal Large Language Model

Autor: Fei, Zhiwei, Zhang, Songyang, Shen, Xiaoyu, Zhu, Dawei, Wang, Xiao, Cao, Maosong, Zhou, Fengzhe, Li, Yining, Zhang, Wenwei, Lin, Dahua, Chen, Kai, Ge, Jidong

While large language models (LLMs) have showcased impressive capabilities, they struggle with addressing legal queries due to the intricate complexities and specialized expertise required in the legal field. In this paper, we introduce InternLM-Law,

Externí odkaz: http://arxiv.org/abs/2406.14887

Zobrazit plný text záznamu

Report

Long Context Alignment with Short Instructions and Synthesized Positions

Autor: Wu, Wenhao, Wang, Yizhong, Fu, Yao, Yue, Xiang, Zhu, Dawei, Li, Sujian

Effectively handling instructions with extremely long context remains a challenge for Large Language Models (LLMs), typically necessitating high-quality long data and substantial computational resources. This paper introduces Step-Skipping Alignment

Externí odkaz: http://arxiv.org/abs/2405.03939

Zobrazit plný text záznamu

Report

Fine-Tuning Large Language Models to Translate: Will a Touch of Noisy Data in Misaligned Languages Suffice?

Autor: Zhu, Dawei, Chen, Pinzhen, Zhang, Miaoran, Haddow, Barry, Shen, Xiaoyu, Klakow, Dietrich

Traditionally, success in multilingual machine translation can be attributed to three key factors in training data: large volume, diverse translation directions, and high quality. In the current practice of fine-tuning large language models (LLMs) fo

Externí odkaz: http://arxiv.org/abs/2404.14122

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání