Výsledky vyhledávání - "Rinaldo D'Alessandro"

Report

Statistical Inference for Temporal Difference Learning with Linear Function Approximation

Autor: Wu, Weichen, Li, Gen, Wei, Yuting, Rinaldo, Alessandro

Statistical inference with finite-sample validity for the value function of a given policy in Markov decision processes (MDPs) is crucial for ensuring the reliability of reinforcement learning. Temporal Difference (TD) learning, arguably the most wid

Externí odkaz: http://arxiv.org/abs/2410.16106

Zobrazit plný text záznamu

Report

2-Rectifications are Enough for Straight Flows: A Theoretical Insight into Wasserstein Convergence

Autor: Roy, Saptarshi, Bansal, Vansh, Sarkar, Purnamrita, Rinaldo, Alessandro

Diffusion models have emerged as a powerful tool for image generation and denoising. Typically, generative models learn a trajectory between the starting noise distribution and the target data distribution. Recently Liu et al. (2023b) designed a nove

Externí odkaz: http://arxiv.org/abs/2410.14949

Zobrazit plný text záznamu

Report

Sigmoid Gating is More Sample Efficient than Softmax Gating in Mixture of Experts

Autor: Nguyen, Huy, Ho, Nhat, Rinaldo, Alessandro

The softmax gating function is arguably the most popular choice in mixture of experts modeling. Despite its widespread use in practice, the softmax gating may lead to unnecessary competition among experts, potentially causing the undesirable phenomen

Externí odkaz: http://arxiv.org/abs/2405.13997

Zobrazit plný text záznamu

Report

On Least Square Estimation in Softmax Gating Mixture of Experts

Autor: Nguyen, Huy, Ho, Nhat, Rinaldo, Alessandro

Mixture of experts (MoE) model is a statistical machine learning design that aggregates multiple expert networks using a softmax gating function in order to form a more intricate and expressive model. Despite being commonly used in several applicatio

Externí odkaz: http://arxiv.org/abs/2402.02952

Zobrazit plný text záznamu

Report

On the estimation of persistence intensity functions and linear representations of persistence diagrams

Autor: Wu, Weichen, Kim, Jisu, Rinaldo, Alessandro

The prevailing statistical approach to analyzing persistence diagrams is concerned with filtering out topological noise. In this paper, we adopt a different viewpoint and aim at estimating the actual distribution of a random persistence diagram, whic

Externí odkaz: http://arxiv.org/abs/2310.11982

Zobrazit plný text záznamu

Report

Inference for Projection Parameters in Linear Regression: beyond $d = o(n^{1/2})$

Autor: Chang, Woonyoung, Kuchibhotla, Arun Kumar, Rinaldo, Alessandro

We consider the problem of inference for projection parameters in linear regression with increasing dimensions. This problem has been studied under a variety of assumptions in the literature. The classical asymptotic normality result for the least sq

Externí odkaz: http://arxiv.org/abs/2307.00795

Zobrazit plný text záznamu

Report

Multilayer random dot product graphs: Estimation and online change point detection

Autor: Wang, Fan, Li, Wanshan, Padilla, Oscar Hernan Madrid, Yu, Yi, Rinaldo, Alessandro

We study the multilayer random dot product graph (MRDPG) model, an extension of the random dot product graph to multilayer networks. To estimate the edge probabilities, we deploy a tensor-based methodology and demonstrate its superiority over existin

Externí odkaz: http://arxiv.org/abs/2306.15286

Zobrazit plný text záznamu

Report

Dual Induction CLT for High-dimensional m-dependent Data

Autor: Bong, Heejong, Kuchibhotla, Arun Kumar, Rinaldo, Alessandro

We derive novel and sharp high-dimensional Berry--Esseen bounds for the sum of $m$-dependent random vectors over the class of hyper-rectangles exhibiting only a poly-logarithmic dependence in the dimension. Our results hold under minimal assumptions,

Externí odkaz: http://arxiv.org/abs/2306.14299

Zobrazit plný text záznamu

Report

High-probability sample complexities for policy evaluation with linear function approximation

Autor: Li, Gen, Wu, Weichen, Chi, Yuejie, Ma, Cong, Rinaldo, Alessandro, Wei, Yuting

This paper is concerned with the problem of policy evaluation with linear function approximation in discounted infinite horizon Markov decision processes. We investigate the sample complexities required to guarantee a predefined estimation error of t

Externí odkaz: http://arxiv.org/abs/2305.19001

Zobrazit plný text záznamu

Report

Divide and Conquer Dynamic Programming: An Almost Linear Time Change Point Detection Methodology in High Dimensions

Autor: Li, Wanshan, Wang, Daren, Rinaldo, Alessandro

We develop a novel, general and computationally efficient framework, called Divide and Conquer Dynamic Programming (DCDP), for localizing change points in time series data with high-dimensional features. DCDP deploys a class of greedy algorithms that

Externí odkaz: http://arxiv.org/abs/2301.10942

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání