On the Distribution of Penalized Maximum Likelihood Estimators: The LASSO, SCAD, and Thresholding

Autor:	Benedikt M. Poetscher, Hannes Leeb
Jazyk:	angličtina
Rok vydání:	2007
Předmět:	FOS: Computer and information sciences Statistics and Probability SCAD Uniform consistency Penalized maximum likelihood LASSO thresholding post-model-selection estimator finite-sample distribution asymptotic distribution estimation of distribution uniform consistency Asymptotic distribution Machine Learning (stat.ML) Feature selection Mathematics - Statistics Theory Statistics Theory (math.ST) Methodology (stat.ME) Finite-sample distribution Lasso (statistics) Statistics - Machine Learning primary Estimation of distribution Statistics FOS: Mathematics Applied mathematics jel:C2 Limit (mathematics) Post-model-selection estimator Statistics - Methodology Mathematics Numerical Analysis Model selection Mathematical statistics Estimator jel:C13 M-estimator Thresholding Distribution (mathematics) Statistics Probability and Uncertainty Oracle property Scad 62J07 62F12 62E15
Popis:	We study the distributions of the LASSO, SCAD, and thresholding estimators, in finite samples and in the large-sample limit. The asymptotic distributions are derived for both the case where the estimators are tuned to perform consistent model selection and for the case where the estimators are tuned to perform conservative model selection. Our findings complement those of Knight and Fu [K. Knight, W. Fu, Asymptotics for lasso-type estimators, Annals of Statistics 28 (2000) 1356–1378] and Fan and Li [J. Fan, R. Li, Variable selection via non-concave penalized likelihood and its oracle properties, Journal of the American Statistical Association 96 (2001) 1348–1360]. We show that the distributions are typically highly non-normal regardless of how the estimator is tuned, and that this property persists in large samples. The uniform convergence rate of these estimators is also obtained, and is shown to be slower than n−1/2 in case the estimator is tuned to perform consistent model selection. An impossibility result regarding estimation of the estimators’ distribution function is also provided.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::34824f71e70bfb090e6630ef9c8940e4 http://arxiv.org/abs/0711.0660 Zobrazit plný text záznamu