Výsledky vyhledávání

Report

Learn Dynamic-Aware State Embedding for Transfer Learning

Autor: Yang, Kaige

Transfer reinforcement learning aims to improve the sample efficiency of solving unseen new tasks by leveraging experiences obtained from previous tasks. We consider the setting where all tasks (MDPs) share the same environment dynamic except reward

Externí odkaz: http://arxiv.org/abs/2101.02230

Zobrazit plný text záznamu

Akademický článek

The integration of single-cell and bulk RNA-seq atlas reveals ERS-mediated acinar cell damage in acute pancreatitis.

Autor: Yang, Kaige¹ (AUTHOR), Xie, Rongli² (AUTHOR), Xiao, Guohui¹ (AUTHOR), Zhao, Zhifeng³ (AUTHOR), Ding, Min² (AUTHOR), Lin, Tingyu¹ (AUTHOR), Tsang, Yiu Sing¹ (AUTHOR), Chen, Ying⁴ (AUTHOR) bichatlion@163.com, Xu, Dan⁴ (AUTHOR) stephanie.xud@hotmail.com, Fei, Jian^1,2,5 (AUTHOR) feijian@hotmail.com

Publikováno v: Journal of Translational Medicine. 4/11/2024, Vol. 22 Issue 1, p1-16. 16p.

Zobrazit plný text záznamu

Plný text ve formátu HTML

Report

Differentiable Linear Bandit Algorithm

Autor: Yang, Kaige, Toni, Laura

Upper Confidence Bound (UCB) is arguably the most commonly used method for linear multi-arm bandit problems. While conceptually and computationally simple, this method highly relies on the confidence bounds, failing to strike the optimal exploration-

Externí odkaz: http://arxiv.org/abs/2006.03000

Zobrazit plný text záznamu

Akademický článek

Prenatal exposures to isoflavones and neurobehavioral development in children at 2 and 4 years of age: A birth cohort study

Autor: Zhu, Lin, Chen, Yao, Miao, Maohua, Liang, Hong, Xi, Jianya, Wang, Yan, Yang, Kaige, Wang, Ziliang, Yuan, Wei

Publikováno v: In Ecotoxicology and Environmental Safety 1 September 2023 262

Zobrazit plný text záznamu

Report

Laplacian-regularized graph bandits: Algorithms and theoretical analysis

Autor: Yang, Kaige, Dong, Xiaowen, Toni, Laura

We consider a stochastic linear bandit problem with multiple users, where the relationship between users is captured by an underlying graph and user preferences are represented as smooth signals on the graph. We introduce a novel bandit algorithm whe

Externí odkaz: http://arxiv.org/abs/1907.05632

Zobrazit plný text záznamu

Report

Error Analysis on Graph Laplacian Regularized Estimator

Autor: Yang, Kaige, Dong, Xiaowen, Toni, Laura

We provide a theoretical analysis of the representation learning problem aimed at learning the latent variables (design matrix) $\Theta$ of observations $Y$ with the knowledge of the coefficient matrix $X$. The design matrix is learned under the assu

Externí odkaz: http://arxiv.org/abs/1902.03720

Zobrazit plný text záznamu

Report

Data Driven Chiller Plant Energy Optimization with Domain Knowledge

Autor: Vu, Hoang Dung, Chai, Kok Soon, Keating, Bryan, Tursynbek, Nurislam, Xu, Boyan, Yang, Kaige, Yang, Xiaoyan, Zhang, Zhenjie

Refrigeration and chiller optimization is an important and well studied topic in mechanical engineering, mostly taking advantage of physical models, designed on top of over-simplified assumptions, over the equipments. Conventional optimization techni

Externí odkaz: http://arxiv.org/abs/1812.00679

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání