Asymptotic properties of distance-weighted discrimination and its bias correction for high-dimension, low-sample-size data

Autor: Makoto Aoshima, Kento Egashira, Kazuyoshi Yata
Rok vydání: 2021
Předmět:
Zdroj: Japanese Journal of Statistics and Data Science. 4:821-840
ISSN: 2520-8764
2520-8756
Popis: While distance-weighted discrimination (DWD) was proposed to improve the support vector machine in high-dimensional settings, it is known that the DWD is quite sensitive to the imbalanced ratio of sample sizes. In this paper, we study asymptotic properties of the DWD in high-dimension, low-sample-size (HDLSS) settings. We show that the DWD includes a huge bias caused by a heterogeneity of covariance matrices as well as sample imbalance. We propose a bias-corrected DWD (BC-DWD) and show that the BC-DWD can enjoy consistency properties about misclassification rates. We also consider the weighted DWD (WDWD) and propose an optimal choice of weights in the WDWD. Finally, we discuss performances of the BC-DWD and the WDWD with the optimal weights in numerical simulations and actual data analyses.
Databáze: OpenAIRE