Výsledky vyhledávání

Report

No-regret Exploration in Shuffle Private Reinforcement Learning

Autor: Bai, Shaojie, Talebi, Mohammad Sadegh, Zhao, Chengcheng, Cheng, Peng, Chen, Jiming

Differential privacy (DP) has recently been introduced into episodic reinforcement learning (RL) to formally address user privacy concerns in personalized services. Previous work mainly focuses on two trust models of DP: the central model, where a ce

Externí odkaz: http://arxiv.org/abs/2411.11647

Zobrazit plný text záznamu

Report

BlueME: Robust Underwater Robot-to-Robot Communication Using Compact Magnetoelectric Antennas

Autor: Talebi, Mehron, Mahmud, Sultan, Khalifa, Adam, Islam, Md Jahidul

We present the design, development, and experimental validation of BlueME, a compact magnetoelectric (ME) antenna array system for underwater robot-to-robot communication. BlueME employs ME antennas operating at their natural mechanical resonance fre

Externí odkaz: http://arxiv.org/abs/2411.09241

Zobrazit plný text záznamu

Report

Single replica spin-glass phase detection using field variation and machine learning

Autor: Talebi, Ali, Bagherikalhor, Mahsa, Askari, Behrouz, Jafari, G. Reza

The Sherrington-Kirkpatrick spin-glass model used the replica symmetry method to find the phase transition of the system. In 1979-1980, Parisi proposed a solution based on replica symmetry breaking (RSB), which allowed him to identify the underlying

Externí odkaz: http://arxiv.org/abs/2411.04567

Zobrazit plný text záznamu

Report

Risk-sensitive Affine Control Synthesis for Stationary LTI Systems

Autor: Hu, Yang, Talebi, Shahriar, Li, Na

To address deviations from expected performance in stochastic systems, we propose a risk-sensitive control synthesis method to minimize certain risk measures over the limiting stationary distribution. Specifically, we extend Worst-case Conditional Va

Externí odkaz: http://arxiv.org/abs/2410.17581

Zobrazit plný text záznamu

Report

Uniform Ergodicity and Ergodic-Risk Constrained Policy Optimization

Autor: Talebi, Shahriar, Li, Na

In stochastic systems, risk-sensitive control balances performance with resilience to less likely events. Although existing methods rely on finite-horizon risk criteria, this paper introduces \textit{limiting-risk criteria} that capture long-term cum

Externí odkaz: http://arxiv.org/abs/2409.10767

Zobrazit plný text záznamu

Report

Tractable Offline Learning of Regular Decision Processes

Autor: Deb, Ahana, Cipollone, Roberto, Jonsson, Anders, Ronca, Alessandro, Talebi, Mohammad Sadegh

This work studies offline Reinforcement Learning (RL) in a class of non-Markovian environments called Regular Decision Processes (RDPs). In RDPs, the unknown dependency of future observations and rewards from the past interactions can be captured by

Externí odkaz: http://arxiv.org/abs/2409.02747

Zobrazit plný text záznamu

Report

Casper: Prompt Sanitization for Protecting User Privacy in Web-Based Large Language Models

Autor: Chong, Chun Jie, Hou, Chenxi, Yao, Zhihao, Talebi, Seyed Mohammadjavad Seyed

Web-based Large Language Model (LLM) services have been widely adopted and have become an integral part of our Internet experience. Third-party plugins enhance the functionalities of LLM by enabling access to real-world data and services. However, th

Externí odkaz: http://arxiv.org/abs/2408.07004

Zobrazit plný text záznamu

Report

Opinion dynamics on switching networks

Autor: Talebi, Amirreza

We study opinion dynamics over a directed multilayer network. In particular, we consider networks in which the impact of neighbors of agents on their opinions is proportional to their in-degree. Agents update their opinions over time to coordinate wi

Externí odkaz: http://arxiv.org/abs/2407.17749

Zobrazit plný text záznamu

Report

Simulation in discrete choice models evaluation: SDCM, a simulation tool for performance evaluation of DCMs

Autor: Talebi, Amirreza

Discrete choice models (DCMs) have been widely utilized in various scientific fields, especially economics, for many years. These models consider a stochastic environment influencing each decision maker's choices. Extensive research has shown that th

Externí odkaz: http://arxiv.org/abs/2407.17014

Zobrazit plný text záznamu

Report

How to Shrink Confidence Sets for Many Equivalent Discrete Distributions?

Autor: Maillard, Odalric-Ambrym, Talebi, Mohammad Sadegh

We consider the situation when a learner faces a set of unknown discrete distributions $(p_k)_{k\in \mathcal K}$ defined over a common alphabet $\mathcal X$, and can build for each distribution $p_k$ an individual high-probability confidence set than

Externí odkaz: http://arxiv.org/abs/2407.15662

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání