Zobrazeno 1 - 10
of 4 559
pro vyhledávání: '"Vinitsky A"'
Autor:
Yu, Chao, Lu, Hong, Gao, Jiaxuan, Tan, Qixin, Yang, Xinting, Wang, Yu, Wu, Yi, Vinitsky, Eugene
Designing reward functions is a core component of reinforcement learning but can be challenging for truly complex behavior. Reinforcement Learning from Human Feedback (RLHF) has been used to alleviate this challenge by replacing a hand-coded reward f
Externí odkaz:
http://arxiv.org/abs/2410.17233
Multi-agent learning algorithms have been successful at generating superhuman planning in various games but have had limited impact on the design of deployed multi-agent planners. A key bottleneck in applying these techniques to multi-agent planning
Externí odkaz:
http://arxiv.org/abs/2408.01584
Autor:
Orte, Peter
Publikováno v:
Pushkin Review / Пушкинский вестник, 2020 Jan 01. 22/23, 193-196.
Externí odkaz:
https://www.jstor.org/stable/48674699
Autor:
Cornelisse, Daphne, Vinitsky, Eugene
A central challenge for autonomous vehicles is coordinating with humans. Therefore, incorporating realistic human agents is essential for scalable training and evaluation of autonomous driving systems in simulation. Simulation agents are typically de
Externí odkaz:
http://arxiv.org/abs/2403.19648
Autor:
Peschio, Joe
Publikováno v:
The Slavic and East European Journal, 2019 Oct 01. 63(3), 431-433.
Externí odkaz:
https://www.jstor.org/stable/45408551
Autor:
Jang, Kathy, Lichtlé, Nathan, Vinitsky, Eugene, Shah, Adit, Bunting, Matthew, Nice, Matthew, Piccoli, Benedetto, Seibold, Benjamin, Work, Daniel B., Monache, Maria Laura Delle, Sprinkle, Jonathan, Lee, Jonathan W., Bayen, Alexandre M.
In this article, we explore the technical details of the reinforcement learning (RL) algorithms that were deployed in the largest field test of automated vehicles designed to smooth traffic flow in history as of 2023, uncovering the challenges and br
Externí odkaz:
http://arxiv.org/abs/2402.17050
Autor:
Lee, Jonathan W., Wang, Han, Jang, Kathy, Hayat, Amaury, Bunting, Matthew, Alanqary, Arwa, Barbour, William, Fu, Zhe, Gong, Xiaoqian, Gunter, George, Hornstein, Sharon, Kreidieh, Abdul Rahman, Lichtlé, Nathan, Nice, Matthew W., Richardson, William A., Shah, Adit, Vinitsky, Eugene, Wu, Fangyu, Xiang, Shengquan, Almatrudi, Sulaiman, Althukair, Fahd, Bhadani, Rahul, Carpio, Joy, Chekroun, Raphael, Cheng, Eric, Chiri, Maria Teresa, Chou, Fang-Chieh, Delorenzo, Ryan, Gibson, Marsalis, Gloudemans, Derek, Gollakota, Anish, Ji, Junyi, Keimer, Alexander, Khoudari, Nour, Mahmood, Malaika, Mahmood, Mikail, Matin, Hossein Nick Zinat, Mcquade, Sean, Ramadan, Rabie, Urieli, Daniel, Wang, Xia, Wang, Yanbing, Xu, Rita, Yao, Mengsha, You, Yiling, Zachár, Gergely, Zhao, Yibo, Ameli, Mostafa, Baig, Mirza Najamuddin, Bhaskaran, Sarah, Butts, Kenneth, Gowda, Manasi, Janssen, Caroline, Lee, John, Pedersen, Liam, Wagner, Riley, Zhang, Zimo, Zhou, Chang, Work, Daniel B., Seibold, Benjamin, Sprinkle, Jonathan, Piccoli, Benedetto, Monache, Maria Laura Delle, Bayen, Alexandre M.
The CIRCLES project aims to reduce instabilities in traffic flow, which are naturally occurring phenomena due to human driving behavior. These "phantom jams" or "stop-and-go waves,"are a significant source of wasted energy. Toward this goal, the CIRC
Externí odkaz:
http://arxiv.org/abs/2402.17043
This review outlines the main results which show the dual nature of the chemical bond in diatomic beryllium molecule in the ground $X^1\Sigma_g^+$ state. It has been shown that the beryllium atoms are covalently bound at low-lying vibrational energy
Externí odkaz:
http://arxiv.org/abs/2311.07378
Autor:
Mediratta, Ishita, Jiang, Minqi, Parker-Holder, Jack, Dennis, Michael, Vinitsky, Eugene, Rocktäschel, Tim
A key challenge in training generally-capable agents is the design of training tasks that facilitate broad generalization and robustness to environment variations. This challenge motivates the problem setting of Unsupervised Environment Design (UED),
Externí odkaz:
http://arxiv.org/abs/2308.10797
Autor:
Golburt, Luba
Publikováno v:
The Russian Review, 2017 Jan 01. 76(1), 135-136.
Externí odkaz:
https://www.jstor.org/stable/45097280