Zobrazeno 1 - 10
of 11 773
pro vyhledávání: '"SHEN, Yi"'
Autor:
Shen, Yi, Huang, Hanyan
Offline reinforcement learning has received extensive attention from scholars because it avoids the interaction between the agent and the environment by learning a policy through a static dataset. However, general reinforcement learning methods canno
Externí odkaz:
http://arxiv.org/abs/2411.04534
Strong substrate reflections and complex scattering effects present significant challenges for diffraction tomography in metrology and inspection applications. To address these issues, we introduce a reflection-mode diffraction tomography technique f
Externí odkaz:
http://arxiv.org/abs/2411.04369
Autor:
Sun, Xingwu, Chen, Yanfeng, Huang, Yiqing, Xie, Ruobing, Zhu, Jiaqi, Zhang, Kai, Li, Shuaipeng, Yang, Zhen, Han, Jonny, Shu, Xiaobo, Bu, Jiahao, Chen, Zhongzhi, Huang, Xuemeng, Lian, Fengzong, Yang, Saiyong, Yan, Jianfeng, Zeng, Yuyuan, Ren, Xiaoqin, Yu, Chao, Wu, Lulu, Mao, Yue, Xia, Jun, Yang, Tao, Zheng, Suncong, Wu, Kan, Jiao, Dian, Xue, Jinbao, Zhang, Xipeng, Wu, Decheng, Liu, Kai, Wu, Dengpeng, Xu, Guanghui, Chen, Shaohua, Chen, Shuang, Feng, Xiao, Hong, Yigeng, Zheng, Junqiang, Xu, Chengcheng, Li, Zongwei, Kuang, Xiong, Hu, Jianglu, Chen, Yiqi, Deng, Yuchi, Li, Guiyang, Liu, Ao, Zhang, Chenchen, Hu, Shihui, Zhao, Zilong, Wu, Zifan, Ding, Yao, Wang, Weichao, Liu, Han, Wang, Roberts, Fei, Hao, Yu, Peijie, Zhao, Ze, Cao, Xun, Wang, Hai, Xiang, Fusheng, Huang, Mengyuan, Xiong, Zhiyuan, Hu, Bin, Hou, Xuebin, Jiang, Lei, Ma, Jianqiang, Wu, Jiajia, Deng, Yaping, Shen, Yi, Wang, Qian, Liu, Weijie, Liu, Jie, Chen, Meng, Dong, Liang, Jia, Weiwen, Chen, Hu, Liu, Feifei, Yuan, Rui, Xu, Huilin, Yan, Zhenxiang, Cao, Tengfei, Hu, Zhichao, Feng, Xinhua, Du, Dong, Yu, Tinghao, Tao, Yangyu, Zhang, Feng, Zhu, Jianchen, Xu, Chengzhong, Li, Xirui, Zha, Chong, Ouyang, Wen, Xia, Yinben, Li, Xiang, He, Zekun, Chen, Rongpeng, Song, Jiawei, Chen, Ruibin, Jiang, Fan, Zhao, Chongqing, Wang, Bo, Gong, Hao, Gan, Rong, Hu, Winston, Kang, Zhanhui, Yang, Yong, Liu, Yuhong, Wang, Di, Jiang, Jie
In this paper, we introduce Hunyuan-Large, which is currently the largest open-source Transformer-based mixture of experts model, with a total of 389 billion parameters and 52 billion activation parameters, capable of handling up to 256K tokens. We c
Externí odkaz:
http://arxiv.org/abs/2411.02265
Autor:
Hagiwara, Kenta, Chen, Ying-Jiun, Go, Dongwook, Tan, Xin Liang, Grytsiuk, Sergii, Yang, Kui-Hon Ou, Shu, Guo-Jiun, Chien, Jing, Shen, Yi-Hsin, Huang, Xiang-Lin, Chou, Fang-Cheng, Cojocariu, Iulia, Feyer, Vitaliy, Lin, Minn-Tsong, Blügel, Stefan, Schneider, Claus Michael, Mokrousov, Yuriy, Tusche, Christian
Chirality is ubiquitous in nature and manifests in a wide range of phenomena including chemical reactions, biological processes, and quantum transport of electrons. In quantum materials, the chirality of fermions, given by the relative directions bet
Externí odkaz:
http://arxiv.org/abs/2410.20607
Autor:
Sivakumar, Kavinayan P., Shen, Yi, Bell, Zachary, Nivison, Scott, Chen, Boyuan, Zavlanos, Michael M.
In this paper, we study an inverse reinforcement learning problem that involves learning the reward function of a learning agent using trajectory data collected while this agent is learning its optimal policy. To address this problem, we propose an i
Externí odkaz:
http://arxiv.org/abs/2410.14135
Autor:
Konti, Xenia, Riess, Hans, Giannopoulos, Manos, Shen, Yi, Pencina, Michael J., Economou-Zavlanos, Nicoleta J., Zavlanos, Michael M.
In this paper, we address the challenge of heterogeneous data distributions in cross-silo federated learning by introducing a novel algorithm, which we term Cross-silo Robust Clustered Federated Learning (CS-RCFL). Our approach leverages the Wasserst
Externí odkaz:
http://arxiv.org/abs/2410.07039
Advancements in deep multi-agent reinforcement learning (MARL) have positioned it as a promising approach for decision-making in cooperative games. However, it still remains challenging for MARL agents to learn cooperative strategies for some game en
Externí odkaz:
http://arxiv.org/abs/2410.03997
Search-based motion planning algorithms have been widely utilized for unmanned aerial vehicles (UAVs). However, deploying these algorithms on real UAVs faces challenges due to limited onboard computational resources. The algorithms struggle to find s
Externí odkaz:
http://arxiv.org/abs/2410.01230
The core of quantum metrology lies in utilizing entanglement to enhance measurement precision beyond standard quantum limit. Here, we utilize the Floquet-engineered two-axis twisting (TAT) and turn dynamics to generate non-Gaussian states for quantum
Externí odkaz:
http://arxiv.org/abs/2409.08524
A recent analysis of the LHCb data [Phys. Rev. D 105 (2022) L031503] obtained a sizable negative effective range for the $X(3872)$. This has attracted intensive discussions on whether $X(3872)$ can be deemed as a $D\bar{D}^*$ molecular state. This wo
Externí odkaz:
http://arxiv.org/abs/2409.06409