Zobrazeno 1 - 10
of 13 377
pro vyhledávání: '"Restelli A"'
The comb-like spectrum added to laser light by an electro-optic modulator (EOM) finds use in a wide range of applications including coherent optical communication, laser frequency and phase stabilization, and atomic spectroscopy. In some cases a side
Externí odkaz:
http://arxiv.org/abs/2412.05411
Policy search methods are crucial in reinforcement learning, offering a framework to address continuous state-action and partially observable problems. However, the complexity of exploring vast policy spaces can lead to significant inefficiencies. Re
Externí odkaz:
http://arxiv.org/abs/2411.09900
Autor:
Liu, Puze, Günster, Jonas, Funk, Niklas, Gröger, Simon, Chen, Dong, Bou-Ammar, Haitham, Jankowski, Julius, Marić, Ante, Calinon, Sylvain, Orsula, Andrej, Olivares-Mendez, Miguel, Zhou, Hongyi, Lioutikov, Rudolf, Neumann, Gerhard, Zhalehmehrabi, Amarildo Likmeta Amirhossein, Bonenfant, Thomas, Restelli, Marcello, Tateo, Davide, Liu, Ziyuan, Peters, Jan
Machine learning methods have a groundbreaking impact in many application domains, but their application on real robotic platforms is still limited. Despite the many challenges associated with combining machine learning technology with robotics, robo
Externí odkaz:
http://arxiv.org/abs/2411.05718
Achieving the no-regret property for Reinforcement Learning (RL) problems in continuous state and action-space environments is one of the major open problems in the field. Existing solutions either work under very specific assumptions or achieve boun
Externí odkaz:
http://arxiv.org/abs/2410.24071
Policy evaluation via Monte Carlo (MC) simulation is at the core of many MC Reinforcement Learning (RL) algorithms (e.g., policy gradient methods). In this context, the designer of the learning system specifies an interaction budget that the agent us
Externí odkaz:
http://arxiv.org/abs/2410.13463
Autor:
Monaco, Vito Alessandro, Riva, Antonio, Sabbioni, Luca, Bisi, Lorenzo, Vittori, Edoardo, Pinciroli, Marco, Trapletti, Michele, Restelli, Marcello
In recent years, the popularity of artificial intelligence has surged due to its widespread application in various fields. The financial sector has harnessed its advantages for multiple purposes, including the development of automated trading systems
Externí odkaz:
http://arxiv.org/abs/2410.23294
Dealing with Partially Observable Markov Decision Processes is notably a challenging task. We face an average-reward infinite-horizon POMDP setting with an unknown transition model, where we assume the knowledge of the observation model. Under this a
Externí odkaz:
http://arxiv.org/abs/2410.01331
Publikováno v:
Il Foro Italiano, 2016 Jul 01. 139(7/8), 2409/2410-2413/2414.
Externí odkaz:
https://www.jstor.org/stable/44875766
Autor:
Tu, J., Restelli, A., Tsui, T. -C., Weber, K., Spielman, I. B., Rolston, S. L., Porto, J. V., Subhankar, S.
The Pound-Drever-Hall (PDH) technique is routinely used to stabilize the frequency of a laser to a reference cavity. The electronic sideband (ESB) locking scheme, a PDH variant, helps bridge the frequency difference between the quantized frequencies
Externí odkaz:
http://arxiv.org/abs/2409.08764
Autor:
Genalti, Gianmarco, Mussi, Marco, Gatti, Nicola, Restelli, Marcello, Castiglioni, Matteo, Metelli, Alberto Maria
Rested and Restless Bandits are two well-known bandit settings that are useful to model real-world sequential decision-making problems in which the expected reward of an arm evolves over time due to the actions we perform or due to the nature. In thi
Externí odkaz:
http://arxiv.org/abs/2409.05980