Výsledky vyhledávání

Report

Human-like object concept representations emerge naturally in multimodal large language models

Autor: Du, Changde, Fu, Kaicheng, Wen, Bincheng, Sun, Yi, Peng, Jie, Wei, Wei, Gao, Ying, Wang, Shengpei, Zhang, Chuncheng, Li, Jinpeng, Qiu, Shuang, Chang, Le, He, Huiguang

The conceptualization and categorization of natural objects in the human mind have long intrigued cognitive scientists and neuroscientists, offering crucial insights into human perception and cognition. Recently, the rapid development of Large Langua

Externí odkaz: http://arxiv.org/abs/2407.01067

Zobrazit plný text záznamu

Report

ROPO: Robust Preference Optimization for Large Language Models

Autor: Liang, Xize, Chen, Chao, Qiu, Shuang, Wang, Jie, Wu, Yue, Fu, Zhihang, Shi, Zhihao, Wu, Feng, Ye, Jieping

Preference alignment is pivotal for empowering large language models (LLMs) to generate helpful and harmless responses. However, the performance of preference alignment is highly sensitive to the prevalent noise in the preference data. Recent efforts

Externí odkaz: http://arxiv.org/abs/2404.04102

Zobrazit plný text záznamu

Report

Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards

Autor: Wang, Haoxiang, Lin, Yong, Xiong, Wei, Yang, Rui, Diao, Shizhe, Qiu, Shuang, Zhao, Han, Zhang, Tong

Fine-grained control over large language models (LLMs) remains a significant challenge, hindering their adaptability to diverse user needs. While Reinforcement Learning from Human Feedback (RLHF) shows promise in aligning LLMs, its reliance on scalar

Externí odkaz: http://arxiv.org/abs/2402.18571

Zobrazit plný text záznamu

Report

Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment

Autor: Yang, Rui, Pan, Xiaoman, Luo, Feng, Qiu, Shuang, Zhong, Han, Yu, Dong, Chen, Jianshu

We consider the problem of multi-objective alignment of foundation models with human preferences, which is a critical step towards helpful and harmless AI systems. However, it is generally costly and unstable to fine-tune large foundation models usin

Externí odkaz: http://arxiv.org/abs/2402.10207

Zobrazit plný text záznamu

Report

A Temporal-Spectral Fusion Transformer with Subject-specific Adapter for Enhancing RSVP-BCI Decoding

Autor: Li, Xujin, Wei, Wei, Qiu, Shuang, He, Huiguang

The Rapid Serial Visual Presentation (RSVP)-based Brain-Computer Interface (BCI) is an efficient technology for target retrieval using electroencephalography (EEG) signals. The performance improvement of traditional decoding methods relies on a subst

Externí odkaz: http://arxiv.org/abs/2401.06340

Zobrazit plný text záznamu

Report

Posterior Sampling for Competitive RL: Function Approximation and Partial Observation

Autor: Qiu, Shuang, Dai, Ziyu, Zhong, Han, Wang, Zhaoran, Yang, Zhuoran, Zhang, Tong

This paper investigates posterior sampling algorithms for competitive reinforcement learning (RL) in the context of general function approximations. Focusing on zero-sum Markov games (MGs) under two critical settings, namely self-play and adversarial

Externí odkaz: http://arxiv.org/abs/2310.19861

Zobrazit plný text záznamu

Report

StairNetV3: Depth-aware Stair Modeling using Deep Learning

Autor: Wang, Chen, Pei, Zhongcai, Qiu, Shuang, Wang, Yachun, Tang, Zhiyong

Vision-based stair perception can help autonomous mobile robots deal with the challenge of climbing stairs, especially in unfamiliar environments. To address the problem that current monocular vision methods are difficult to model stairs accurately w

Externí odkaz: http://arxiv.org/abs/2308.06715

Zobrazit plný text záznamu

Kniha

Gender and Family Practices : Living Apart Together Relationships in China. [elektronicky zdroj]

Autor: Qiu, Shuang

Externí odkaz: Kolekce e-knih KNAV (Registrovani uzivatele: plny text online 5 minut, dalsi pristup na vyzadani. Registered users: full text online 5 minutes, further access on request.)

Report

On the Value of Myopic Behavior in Policy Reuse

Autor: Xu, Kang, Bai, Chenjia, Qiu, Shuang, He, Haoran, Zhao, Bin, Wang, Zhen, Li, Wei, Li, Xuelong

Leveraging learned strategies in unfamiliar scenarios is fundamental to human intelligence. In reinforcement learning, rationally reusing the policies acquired from other tasks or human experts is critical for tackling problems that are difficult to

Externí odkaz: http://arxiv.org/abs/2305.17623

Zobrazit plný text záznamu

Akademický článek

Cold exposure-induced plasma exosomes impair bone mass by inhibiting autophagy

Publikováno v: Journal of Nanobiotechnology, Vol 22, Iss 1, Pp 1-21 (2024)

Abstract Recently, environmental temperature has been shown to regulate bone homeostasis. However, the mechanisms by which cold exposure affects bone mass remain unclear. In our present study, we observed that exposure to cold temperature (CT) decrea

Externí odkaz: https://doaj.org/article/3b2ab9d30b6147afb6c8b61824265322

Zobrazit plný text záznamu

Plný text ve formátu HTML

Vyhledávací nástroje:

Upřesnit hledání