LASSO type Penalized Spline Regression for Binary Data
Autor: | Muhammad Abu Shadeque Mullah, Andrea Benedetti, James A. Hanley |
---|---|
Rok vydání: | 2020 |
Předmět: |
Medicine (General)
Epidemiology Health Informatics 01 natural sciences Generalized linear mixed model Normal distribution 010104 statistics & probability 03 medical and health sciences symbols.namesake 0302 clinical medicine R5-920 Lasso (statistics) Humans Applied mathematics Computer Simulation 0101 mathematics Mathematics Penalized splines Generalized linear mixed models Bayes Theorem Markov chain Monte Carlo Least absolute shrinkage and selection operator (LASSO) Markov Chains Spline (mathematics) 030228 respiratory system Ridge regression Binary data Linear Models symbols Curve fitting Monte Carlo Method Smoothing Research Article |
Zdroj: | BMC Medical Research Methodology, Vol 21, Iss 1, Pp 1-14 (2021) BMC Medical Research Methodology |
DOI: | 10.21203/rs.3.rs-36792/v1 |
Popis: | Background Generalized linear mixed models (GLMMs), typically used for analyzing correlated data, can also be used for smoothing by considering the knot coefficients from a regression spline as random effects. The resulting models are called semiparametric mixed models (SPMMs). Allowing the random knot coefficients to follow a normal distribution with mean zero and a constant variance is equivalent to using a penalized spline with a ridge regression type penalty. We introduce the least absolute shrinkage and selection operator (LASSO) type penalty in the SPMM setting by considering the coefficients at the knots to follow a Laplace double exponential distribution with mean zero. Methods We adopt a Bayesian approach and use the Markov Chain Monte Carlo (MCMC) algorithm for model fitting. Through simulations, we compare the performance of curve fitting in a SPMM using a LASSO type penalty to that of using ridge penalty for binary data. We apply the proposed method to obtain smooth curves from data on the relationship between the amount of pack years of smoking and the risk of developing chronic obstructive pulmonary disease (COPD). Results The LASSO penalty performs as well as ridge penalty for simple shapes of association and outperforms the ridge penalty when the shape of association is complex or linear. Conclusion We demonstrated that LASSO penalty captured complex dose-response association better than the Ridge penalty in a SPMM. |
Databáze: | OpenAIRE |
Externí odkaz: |