Výsledky vyhledávání - "Song, Feifan"

Report

Learning Spatial Similarity Distribution for Few-shot Object Counting

Autor: Xu, Yuanwu, Song, Feifan, Zhang, Haofeng

Few-shot object counting aims to count the number of objects in a query image that belong to the same class as the given exemplar images. Existing methods compute the similarity between the query image and exemplars in the 2D spatial domain and perfo

Externí odkaz: http://arxiv.org/abs/2405.11770

Zobrazit plný text záznamu

Report

Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment

Autor: Song, Feifan, Yu, Bowen, Lang, Hao, Yu, Haiyang, Huang, Fei, Wang, Houfeng, Li, Yongbin

Alignment with human preference prevents large language models (LLMs) from generating misleading or toxic content while requiring high-cost human feedback. Assuming resources of human annotation are limited, there are two different ways of allocating

Externí odkaz: http://arxiv.org/abs/2403.11124

Zobrazit plný text záznamu

Report

ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization

Autor: Song, Feifan, Fan, Yuxuan, Zhang, Xin, Wang, Peiyi, Wang, Houfeng

Large Language Models (LLMs) rely on Human Preference Alignment (HPA) to ensure the generation of safe content. Due to the heavy cost associated with fine-tuning, fine-tuning-free methods have emerged, typically modifying LLM decoding with external a

Externí odkaz: http://arxiv.org/abs/2402.09320

Zobrazit plný text záznamu

Report

Making Large Language Models Better Reasoners with Alignment

Autor: Wang, Peiyi, Li, Lei, Chen, Liang, Song, Feifan, Lin, Binghuai, Cao, Yunbo, Liu, Tianyu, Sui, Zhifang

Reasoning is a cognitive process of using evidence to reach a sound conclusion. The reasoning capability is essential for large language models (LLMs) to serve as the brain of the artificial general intelligence agent. Recent studies reveal that fine

Externí odkaz: http://arxiv.org/abs/2309.02144

Zobrazit plný text záznamu

Report

Preference Ranking Optimization for Human Alignment

Autor: Song, Feifan, Yu, Bowen, Li, Minghao, Yu, Haiyang, Huang, Fei, Li, Yongbin, Wang, Houfeng

Large language models (LLMs) often contain misleading content, emphasizing the need to align them with human values to ensure secure AI systems. Reinforcement learning from human feedback (RLHF) has been employed to achieve this alignment. However, i

Externí odkaz: http://arxiv.org/abs/2306.17492

Zobrazit plný text záznamu

Report

API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMs

Autor: Li, Minghao, Zhao, Yingxiu, Yu, Bowen, Song, Feifan, Li, Hangyu, Yu, Haiyang, Li, Zhoujun, Huang, Fei, Li, Yongbin

Recent research has demonstrated that Large Language Models (LLMs) can enhance their capabilities by utilizing external tools. However, three pivotal questions remain unanswered: (1) How effective are current LLMs in utilizing tools? (2) How can we e

Externí odkaz: http://arxiv.org/abs/2304.08244

Zobrazit plný text záznamu

Report

A Unified Framework for Multi-intent Spoken Language Understanding with prompting

Autor: Song, Feifan, Huang, Lianzhe, Wang, Houfeng

Multi-intent Spoken Language Understanding has great potential for widespread implementation. Jointly modeling Intent Detection and Slot Filling in it provides a channel to exploit the correlation between intents and slots. However, current approache

Externí odkaz: http://arxiv.org/abs/2210.03337

Zobrazit plný text záznamu

Report

Interacting with Non-Cooperative User: A New Paradigm for Proactive Dialogue Policy

Autor: Lei, Wenqiang, Zhang, Yao, Song, Feifan, Liang, Hongru, Mao, Jiaxin, Lv, Jiancheng, Yang, Zhenglu, Chua, Tat-Seng

Proactive dialogue system is able to lead the conversation to a goal topic and has advantaged potential in bargain, persuasion and negotiation. Current corpus-based learning manner limits its practical application in real-world scenarios. To this end

Externí odkaz: http://arxiv.org/abs/2204.07433

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání