Výsledky vyhledávání

Report

Spectrally-Pure Optical Serrodyne Modulation for Continuously-Tunable Laser Offset Locking

Autor: Hildebrand, Roame A., Restelli, Alessandro, Wang, Wance, Goham, Connor, Britton, Joseph W.

The comb-like spectrum added to laser light by an electro-optic modulator (EOM) finds use in a wide range of applications including coherent optical communication, laser frequency and phase stabilization, and atomic spectroscopy. In some cases a side

Externí odkaz: http://arxiv.org/abs/2412.05411

Zobrazit plný text záznamu

Report

Statistical Analysis of Policy Space Compression Problem

Autor: Molaei, Majid, Restelli, Marcello, Metelli, Alberto Maria, Papini, Matteo

Policy search methods are crucial in reinforcement learning, offering a framework to address continuous state-action and partially observable problems. However, the complexity of exploring vast policy spaces can lead to significant inefficiencies. Re

Externí odkaz: http://arxiv.org/abs/2411.09900

Zobrazit plný text záznamu

Report

A Retrospective on the Robot Air Hockey Challenge: Benchmarking Robust, Reliable, and Safe Learning Techniques for Real-world Robotics

Machine learning methods have a groundbreaking impact in many application domains, but their application on real robotic platforms is still limited. Despite the many challenges associated with combining machine learning technology with robotics, robo

Externí odkaz: http://arxiv.org/abs/2411.05718

Zobrazit plný text záznamu

Report

Local Linearity: the Key for No-regret Reinforcement Learning in Continuous MDPs

Autor: Maran, Davide, Metelli, Alberto Maria, Papini, Matteo, Restelli, Marcello

Achieving the no-regret property for Reinforcement Learning (RL) problems in continuous state and action-space environments is one of the major open problems in the field. Existing solutions either work under very specific assumptions or achieve boun

Externí odkaz: http://arxiv.org/abs/2410.24071

Zobrazit plný text záznamu

Report

Truncating Trajectories in Monte Carlo Policy Evaluation: an Adaptive Approach

Autor: Poiani, Riccardo, Nobili, Nicole, Metelli, Alberto Maria, Restelli, Marcello

Policy evaluation via Monte Carlo (MC) simulation is at the core of many MC Reinforcement Learning (RL) algorithms (e.g., policy gradient methods). In this context, the designer of the learning system specifies an interaction budget that the agent us

Externí odkaz: http://arxiv.org/abs/2410.13463

Zobrazit plný text záznamu

Report

Exploiting Risk-Aversion and Size-dependent fees in FX Trading with Fitted Natural Actor-Critic

Autor: Monaco, Vito Alessandro, Riva, Antonio, Sabbioni, Luca, Bisi, Lorenzo, Vittori, Edoardo, Pinciroli, Marco, Trapletti, Michele, Restelli, Marcello

In recent years, the popularity of artificial intelligence has surged due to its widespread application in various fields. The financial sector has harnessed its advantages for multiple purposes, including the development of automated trading systems

Externí odkaz: http://arxiv.org/abs/2410.23294

Zobrazit plný text záznamu

Report

Efficient Learning of POMDPs with Known Observation Model in Average-Reward Setting

Autor: Russo, Alessio, Metelli, Alberto Maria, Restelli, Marcello

Dealing with Partially Observable Markov Decision Processes is notably a challenging task. We face an average-reward infinite-horizon POMDP setting with an unknown transition model, where we assume the knowledge of the observation model. Under this a

Externí odkaz: http://arxiv.org/abs/2410.01331

Zobrazit plný text záznamu

Akademický článek

sezione I civile; sentenza 26 maggio 2016, n. 10937; Pres. Dogliotti, Est. Nazzicone, P.M. Patrone (concl. conf.); Restelli (Avv. Rapanà) c. Manzoni e altri (Avv. Bernardini). Conferma App. Milano 19 agosto 2010

Publikováno v: Il Foro Italiano, 2016 Jul 01. 139(7/8), 2409/2410-2413/2414.

Externí odkaz: https://www.jstor.org/stable/44875766

Zobrazit plný text záznamu

Report

Quadrature amplitude modulation for electronic sideband Pound-Drever-Hall locking

Autor: Tu, J., Restelli, A., Tsui, T. -C., Weber, K., Spielman, I. B., Rolston, S. L., Porto, J. V., Subhankar, S.

The Pound-Drever-Hall (PDH) technique is routinely used to stabilize the frequency of a laser to a reference cavity. The electronic sideband (ESB) locking scheme, a PDH variant, helps bridge the frequency difference between the quantized frequencies

Externí odkaz: http://arxiv.org/abs/2409.08764

Zobrazit plný text záznamu

Report

Bridging Rested and Restless Bandits with Graph-Triggering: Rising and Rotting

Autor: Genalti, Gianmarco, Mussi, Marco, Gatti, Nicola, Restelli, Marcello, Castiglioni, Matteo, Metelli, Alberto Maria

Rested and Restless Bandits are two well-known bandit settings that are useful to model real-world sequential decision-making problems in which the expected reward of an arm evolves over time due to the actions we perform or due to the nature. In thi

Externí odkaz: http://arxiv.org/abs/2409.05980

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání