Zobrazeno 1 - 10
of 110
pro vyhledávání: '"Yang, Kaige"'
Autor:
Yang, Kaige, Lu, Yunqi, Yue, Zhiguang, Jin, Sanjun, Wang, Ping, Liu, Chaoqi, Wang, Lijun, Yin, Qingqiang, Dang, Xiaowei, Guo, Hongwei, Chang, Juan
Publikováno v:
In Journal of Functional Foods November 2024 122
Publikováno v:
In Journal of Pharmaceutical and Biomedical Analysis 15 January 2025 253
Publikováno v:
In Analytica Chimica Acta 29 May 2024 1305
Autor:
Yang, Kaige
Transfer reinforcement learning aims to improve the sample efficiency of solving unseen new tasks by leveraging experiences obtained from previous tasks. We consider the setting where all tasks (MDPs) share the same environment dynamic except reward
Externí odkaz:
http://arxiv.org/abs/2101.02230
Autor:
Yang, Kaige1 (AUTHOR), Xie, Rongli2 (AUTHOR), Xiao, Guohui1 (AUTHOR), Zhao, Zhifeng3 (AUTHOR), Ding, Min2 (AUTHOR), Lin, Tingyu1 (AUTHOR), Tsang, Yiu Sing1 (AUTHOR), Chen, Ying4 (AUTHOR) bichatlion@163.com, Xu, Dan4 (AUTHOR) stephanie.xud@hotmail.com, Fei, Jian1,2,5 (AUTHOR) feijian@hotmail.com
Publikováno v:
Journal of Translational Medicine. 4/11/2024, Vol. 22 Issue 1, p1-16. 16p.
Autor:
Yang, Kaige, Toni, Laura
Upper Confidence Bound (UCB) is arguably the most commonly used method for linear multi-arm bandit problems. While conceptually and computationally simple, this method highly relies on the confidence bounds, failing to strike the optimal exploration-
Externí odkaz:
http://arxiv.org/abs/2006.03000
Autor:
Zhu, Lin, Chen, Yao, Miao, Maohua, Liang, Hong, Xi, Jianya, Wang, Yan, Yang, Kaige, Wang, Ziliang, Yuan, Wei
Publikováno v:
In Ecotoxicology and Environmental Safety 1 September 2023 262
We consider a stochastic linear bandit problem with multiple users, where the relationship between users is captured by an underlying graph and user preferences are represented as smooth signals on the graph. We introduce a novel bandit algorithm whe
Externí odkaz:
http://arxiv.org/abs/1907.05632
We provide a theoretical analysis of the representation learning problem aimed at learning the latent variables (design matrix) $\Theta$ of observations $Y$ with the knowledge of the coefficient matrix $X$. The design matrix is learned under the assu
Externí odkaz:
http://arxiv.org/abs/1902.03720
Autor:
Vu, Hoang Dung, Chai, Kok Soon, Keating, Bryan, Tursynbek, Nurislam, Xu, Boyan, Yang, Kaige, Yang, Xiaoyan, Zhang, Zhenjie
Refrigeration and chiller optimization is an important and well studied topic in mechanical engineering, mostly taking advantage of physical models, designed on top of over-simplified assumptions, over the equipments. Conventional optimization techni
Externí odkaz:
http://arxiv.org/abs/1812.00679