Quantile correlation-based variable selection
Autor: | Yuanyuan Lin, Wenlu Tang, Nian-Sheng Tang, Jinhan Xie |
---|---|
Rok vydání: | 2021 |
Předmět: |
Statistics and Probability
False discovery rate Economics and Econometrics 05 social sciences Feature selection 01 natural sciences Correlation 010104 statistics & probability 0502 economics and business Statistics 0101 mathematics Statistics Probability and Uncertainty High dimensionality Social Sciences (miscellaneous) 050205 econometrics Mathematics Quantile |
DOI: | 10.6084/m9.figshare.14192095.v1 |
Popis: | This article is concerned with identifying important features in high-dimensional data analysis, especially when there are complex relationships among predictors. Without any specification of an actual model, we first introduce a multiple testing procedure based on the quantile correlation to select important predictors in high dimensionality. The quantile-correlation statistic is able to capture a wide range of dependence. A stepwise procedure is studied for further identifying important variables. Moreover, a sure independent screening based on the quantile correlation is developed in handling ultrahigh dimensional data. It is computationally efficient and easy to implement. We establish the theoretical properties under mild conditions. Numerical studies including simulation studies and real data analysis contain supporting evidence that the proposal performs reasonably well in practical settings. |
Databáze: | OpenAIRE |
Externí odkaz: |