Influence of single observations on the choice of the penalty parameter in ridge regression

Autor: Hellton, Kristoffer, Lingjærde, Camilla, De Bin, Riccardo
Jazyk: angličtina
Rok vydání: 2021
Předmět:
Zdroj: 65th Annual Meeting of the German Association for Medical Informatics, Biometry and Epidemiology (GMDS), Meeting of the Central European Network (CEN: German Region, Austro-Swiss Region and Polish Region) of the International Biometric Society (IBS); 20200906-20200909; Berlin; DOCAbstr. 43 /20210226/
DOI: 10.3205/20gmds112
Popis: Penalized regression methods such as ridge regression heavily rely on the choice of a tuning or penalty parameter, which is often computed via cross-validation. Discrepancies in the value of the penalty parameter may lead to substantial differences in regression coefficient estimates and predictions. In this paper, we investigate the effect of single observations on the optimal choice of the tuning parameter, showing how the presence of influential points can change it dramatically. We distinguish between points as ``expanders'' and ``shrinkers'', based on their effect on the model complexity. Our approach supplies a visual exploratory tool to identify influential points, naturally implementable for high-dimensional data where traditional approaches usually fail. Applications to simulated and real data examples, both low- and high-dimensional, are presented. The visual tool is implemented in the R package influridge.
27 pages, 6 figures
Databáze: OpenAIRE