Zobrazeno 1 - 10
of 578
pro vyhledávání: '"Wainwright, Martin J."'
We provide a non-asymptotic analysis of the linear instrumental variable estimator allowing for the presence of exogeneous covariates. In addition, we introduce a novel measure of the strength of an instrument that can be used to derive non-asymptoti
Externí odkaz:
http://arxiv.org/abs/2410.02015
We study a class of structured Markov Decision Processes (MDPs) known as Exo-MDPs. They are characterized by a partition of the state space into two components: the exogenous states evolve stochastically in a manner not affected by the agent's action
Externí odkaz:
http://arxiv.org/abs/2409.14557
We study best-response type learning dynamics for two player zero-sum matrix games. We consider two settings that are distinguished by the type of information that each player has about the game and their opponent's strategy. The first setting is the
Externí odkaz:
http://arxiv.org/abs/2407.20128
Autor:
Yan, Yuling, Wainwright, Martin J.
Longitudinal or panel data can be represented as a matrix with rows indexed by units and columns indexed by time. We consider inferential questions associated with the missing data version of panel data induced by staggered adoption. We propose a com
Externí odkaz:
http://arxiv.org/abs/2401.13665
Autor:
Duan, Yaqi, Wainwright, Martin J.
We introduce a novel framework for analyzing reinforcement learning (RL) in continuous state-action spaces, and use it to prove fast rates of convergence in both off-line and on-line settings. Our analysis highlights two key stability properties, rel
Externí odkaz:
http://arxiv.org/abs/2401.05233
We study regression adjustment with general function class approximations for estimating the average treatment effect in the design-based setting. Standard regression adjustment involves bias due to sample re-use, and this bias leads to behavior that
Externí odkaz:
http://arxiv.org/abs/2311.10076
Key challenges in running a retail business include how to select products to present to consumers (the assortment problem), and how to price products (the pricing problem) to maximize revenue or profit. Instead of considering these problems in isola
Externí odkaz:
http://arxiv.org/abs/2309.08634
We study semi-parametric estimation of the population mean when data is observed missing at random (MAR) in the $n < p$ "inconsistency regime", in which neither the outcome model nor the propensity/missingness model can be estimated consistently. Con
Externí odkaz:
http://arxiv.org/abs/2309.01362
Anecdotally, using an estimated propensity score is superior to the true propensity score in estimating the average treatment effect based on observational data. However, this claim comes with several qualifications: it holds only if propensity score
Externí odkaz:
http://arxiv.org/abs/2303.17102
Estimation problems with constrained parameter spaces arise in various settings. In many of these problems, the observations available to the statistician can be modelled as arising from the noisy realization of the image of a random linear operator;
Externí odkaz:
http://arxiv.org/abs/2303.12613