On the Distribution of Penalized Maximum Likelihood Estimators: The LASSO, SCAD, and Thresholding
Autor: | Benedikt M. Poetscher, Hannes Leeb |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2007 |
Předmět: |
FOS: Computer and information sciences
Statistics and Probability SCAD Uniform consistency Penalized maximum likelihood LASSO thresholding post-model-selection estimator finite-sample distribution asymptotic distribution estimation of distribution uniform consistency Asymptotic distribution Machine Learning (stat.ML) Feature selection Mathematics - Statistics Theory Statistics Theory (math.ST) Methodology (stat.ME) Finite-sample distribution Lasso (statistics) Statistics - Machine Learning primary Estimation of distribution Statistics FOS: Mathematics Applied mathematics jel:C2 Limit (mathematics) Post-model-selection estimator Statistics - Methodology Mathematics Numerical Analysis Model selection Mathematical statistics Estimator jel:C13 M-estimator Thresholding Distribution (mathematics) Statistics Probability and Uncertainty Oracle property Scad 62J07 62F12 62E15 |
Popis: | We study the distributions of the LASSO, SCAD, and thresholding estimators, in finite samples and in the large-sample limit. The asymptotic distributions are derived for both the case where the estimators are tuned to perform consistent model selection and for the case where the estimators are tuned to perform conservative model selection. Our findings complement those of Knight and Fu [K. Knight, W. Fu, Asymptotics for lasso-type estimators, Annals of Statistics 28 (2000) 1356–1378] and Fan and Li [J. Fan, R. Li, Variable selection via non-concave penalized likelihood and its oracle properties, Journal of the American Statistical Association 96 (2001) 1348–1360]. We show that the distributions are typically highly non-normal regardless of how the estimator is tuned, and that this property persists in large samples. The uniform convergence rate of these estimators is also obtained, and is shown to be slower than n−1/2 in case the estimator is tuned to perform consistent model selection. An impossibility result regarding estimation of the estimators’ distribution function is also provided. |
Databáze: | OpenAIRE |
Externí odkaz: |