Výsledky vyhledávání - "Li, Chengming"

Report

UncertaintyRAG: Span-Level Uncertainty Enhanced Long-Context Modeling for Retrieval-Augmented Generation

Autor: Li, Zixuan, Xiong, Jing, Ye, Fanghua, Zheng, Chuanyang, Wu, Xun, Lu, Jianqiao, Wan, Zhongwei, Liang, Xiaodan, Li, Chengming, Sun, Zhenan, Kong, Lingpeng, Wong, Ngai

We present UncertaintyRAG, a novel approach for long-context Retrieval-Augmented Generation (RAG) that utilizes Signal-to-Noise Ratio (SNR)-based span uncertainty to estimate similarity between text chunks. This span uncertainty enhances model calibr

Externí odkaz: http://arxiv.org/abs/2410.02719

Zobrazit plný text záznamu

Report

PersonaMath: Enhancing Math Reasoning through Persona-Driven Data Augmentation

Autor: Luo, Jing, Luo, Run, Chen, Longze, Zhu, Liang, Ao, Chang, Li, Jiaming, Chen, Yukun, Cheng, Xin, Yang, Wen, Su, Jiayuan, Li, Chengming, Yang, Min

While closed-source Large Language Models (LLMs) demonstrate strong mathematical problem-solving abilities, open-source models continue to struggle with such tasks. To bridge this gap, we propose a data augmentation approach and introduce PersonaMath

Externí odkaz: http://arxiv.org/abs/2410.01504

Zobrazit plný text záznamu

Report

Learning to Generalize Unseen Domains via Multi-Source Meta Learning for Text Classification

Autor: Hu, Yuxuan, Zhang, Chenwei, Yang, Min, Liang, Xiaodan, Li, Chengming, Hu, Xiping

With the rapid development of deep learning methods, there have been many breakthroughs in the field of text classification. Models developed for this task have been shown to achieve high accuracy. However, most of these models are trained using labe

Externí odkaz: http://arxiv.org/abs/2409.13787

Zobrazit plný text záznamu

Report

Training on the Benchmark Is Not All You Need

Autor: Ni, Shiwen, Kong, Xiangtao, Li, Chengming, Hu, Xiping, Xu, Ruifeng, Zhu, Jia, Yang, Min

The success of Large Language Models (LLMs) relies heavily on the huge amount of pre-training data learned in the pre-training phase. The opacity of the pre-training process and the training data causes the results of many benchmark tests to become u

Externí odkaz: http://arxiv.org/abs/2409.01790

Zobrazit plný text záznamu

Report

Fine-Tuning and Deploying Large Language Models Over Edges: Issues and Approaches

Autor: Dong, Yanjie, Zhang, Haijun, Li, Chengming, Guo, Song, Leung, Victor C. M., Hu, Xiping

Since the invention of GPT2--1.5B in 2019, large language models (LLMs) have transitioned from specialized models to versatile foundation models. The LLMs exhibit impressive zero-shot ability, however, require fine-tuning on local datasets and signif

Externí odkaz: http://arxiv.org/abs/2408.10691

Zobrazit plný text záznamu

Report

Lower Layer Matters: Alleviating Hallucination via Multi-Layer Fusion Contrastive Decoding with Truthfulness Refocused

Autor: Chen, Dingwei, Fang, Feiteng, Ni, Shiwen, Liang, Feng, Xu, Ruifeng, Yang, Min, Li, Chengming

Large Language Models (LLMs) have demonstrated exceptional performance across various natural language processing tasks, yet they occasionally tend to yield content that factually inaccurate or discordant with the expected output, a phenomenon empiri

Externí odkaz: http://arxiv.org/abs/2408.08769

Zobrazit plný text záznamu

Report

AgentCourt: Simulating Court with Adversarial Evolvable Lawyer Agents

Autor: Chen, Guhong, Fan, Liyang, Gong, Zihan, Xie, Nan, Li, Zixuan, Liu, Ziqiang, Li, Chengming, Qu, Qiang, Ni, Shiwen, Yang, Min

In this paper, we present a simulation system called AgentCourt that simulates the entire courtroom process. The judge, plaintiff's lawyer, defense lawyer, and other participants are autonomous agents driven by large language models (LLMs). Our core

Externí odkaz: http://arxiv.org/abs/2408.08089

Zobrazit plný text záznamu

Report

APTNESS: Incorporating Appraisal Theory and Emotion Support Strategies for Empathetic Response Generation

Autor: Hu, Yuxuan, Tan, Minghuan, Zhang, Chenwei, Li, Zixuan, Liang, Xiaodan, Yang, Min, Li, Chengming, Hu, Xiping

Empathetic response generation is designed to comprehend the emotions of others and select the most appropriate strategies to assist them in resolving emotional challenges. Empathy can be categorized into cognitive empathy and affective empathy. The

Externí odkaz: http://arxiv.org/abs/2407.21048

Zobrazit plný text záznamu

Report

Resource Allocation and Workload Scheduling for Large-Scale Distributed Deep Learning: A Survey

Autor: Liang, Feng, Zhang, Zhen, Lu, Haifeng, Li, Chengming, Leung, Victor C. M., Guo, Yanyi, Hu, Xiping

With rapidly increasing distributed deep learning workloads in large-scale data centers, efficient distributed deep learning framework strategies for resource allocation and workload scheduling have become the key to high-performance deep learning. T

Externí odkaz: http://arxiv.org/abs/2406.08115

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání