Zobrazeno 1 - 10
of 2 399
pro vyhledávání: '"Qiu, Shuang"'
Autor:
Du, Changde, Fu, Kaicheng, Wen, Bincheng, Sun, Yi, Peng, Jie, Wei, Wei, Gao, Ying, Wang, Shengpei, Zhang, Chuncheng, Li, Jinpeng, Qiu, Shuang, Chang, Le, He, Huiguang
The conceptualization and categorization of natural objects in the human mind have long intrigued cognitive scientists and neuroscientists, offering crucial insights into human perception and cognition. Recently, the rapid development of Large Langua
Externí odkaz:
http://arxiv.org/abs/2407.01067
Autor:
Liang, Xize, Chen, Chao, Qiu, Shuang, Wang, Jie, Wu, Yue, Fu, Zhihang, Shi, Zhihao, Wu, Feng, Ye, Jieping
Preference alignment is pivotal for empowering large language models (LLMs) to generate helpful and harmless responses. However, the performance of preference alignment is highly sensitive to the prevalent noise in the preference data. Recent efforts
Externí odkaz:
http://arxiv.org/abs/2404.04102
Autor:
Wang, Haoxiang, Lin, Yong, Xiong, Wei, Yang, Rui, Diao, Shizhe, Qiu, Shuang, Zhao, Han, Zhang, Tong
Fine-grained control over large language models (LLMs) remains a significant challenge, hindering their adaptability to diverse user needs. While Reinforcement Learning from Human Feedback (RLHF) shows promise in aligning LLMs, its reliance on scalar
Externí odkaz:
http://arxiv.org/abs/2402.18571
We consider the problem of multi-objective alignment of foundation models with human preferences, which is a critical step towards helpful and harmless AI systems. However, it is generally costly and unstable to fine-tune large foundation models usin
Externí odkaz:
http://arxiv.org/abs/2402.10207
A Temporal-Spectral Fusion Transformer with Subject-specific Adapter for Enhancing RSVP-BCI Decoding
The Rapid Serial Visual Presentation (RSVP)-based Brain-Computer Interface (BCI) is an efficient technology for target retrieval using electroencephalography (EEG) signals. The performance improvement of traditional decoding methods relies on a subst
Externí odkaz:
http://arxiv.org/abs/2401.06340
This paper investigates posterior sampling algorithms for competitive reinforcement learning (RL) in the context of general function approximations. Focusing on zero-sum Markov games (MGs) under two critical settings, namely self-play and adversarial
Externí odkaz:
http://arxiv.org/abs/2310.19861
Vision-based stair perception can help autonomous mobile robots deal with the challenge of climbing stairs, especially in unfamiliar environments. To address the problem that current monocular vision methods are difficult to model stairs accurately w
Externí odkaz:
http://arxiv.org/abs/2308.06715
Leveraging learned strategies in unfamiliar scenarios is fundamental to human intelligence. In reinforcement learning, rationally reusing the policies acquired from other tasks or human experts is critical for tackling problems that are difficult to
Externí odkaz:
http://arxiv.org/abs/2305.17623
Autor:
Li-Min Lei, Fu-Xing-Zi Li, Xiao Lin, Feng Xu, Su-Kang Shan, Bei Guo, Ming-Hui Zheng, Ke-Xin Tang, Yi Wang, Qiu-Shuang Xu, Wen-Lu Ouyang, Jia-Yue Duan, Yun-Yun Wu, Ye-Chi Cao, Zhi-Ang Zhou, Si-Yang He, Yan-Lin Wu, Xi Chen, Zheng-Jun Lin, Yi Pan, Ling-Qing Yuan, Zhi-Hong Li
Publikováno v:
Journal of Nanobiotechnology, Vol 22, Iss 1, Pp 1-21 (2024)
Abstract Recently, environmental temperature has been shown to regulate bone homeostasis. However, the mechanisms by which cold exposure affects bone mass remain unclear. In our present study, we observed that exposure to cold temperature (CT) decrea
Externí odkaz:
https://doaj.org/article/3b2ab9d30b6147afb6c8b61824265322