ST-PLS: a multi-directional nearest shrunken centroid type classifier via PLS

Autor: J. Aarøe, Trygve Almøy, Are H. Aastveit, Solve Sæbø
Rok vydání: 2007
Předmět:
Zdroj: Journal of Chemometrics. 22:54-62
ISSN: 1099-128X
0886-9383
Popis: The nearest shrunken centroid (NSC) Classifier is successfully applied for class prediction in a wide range of studies based on microarray data. The contribution from seemingly irrelevant variables to the classifier is minimized by the so-called soft-thresholding property of the approach. In this paper, we first show that for the two-class prediction problem, the NSC Classifier is similar to a one-component discriminant partial least squares (PLS) model with soft-shrinkage of the loading weights. Then we introduce the soft-threshold-PLS (ST-PLS) as a general discriminant-PLS model with soft-thresholding of the loading weights of multiple latent components. This method is especially suited for classification and variable selection when the number of variables is large compared to the number of samples, which is typical for gene expression data. A characteristic feature of ST-PLS is the ability to identify important variables in multiple directions in the variable space. Both the ST-PLS and the NSC classifiers are applied to four real data sets. The results indicate that ST-PLS performs better than the shrunken centroid approach if there are several directions in the variable space which are important for classification, and there are strong dependencies between subsets of variables. Copyright © 2007 John Wiley & Sons, Ltd.
Databáze: OpenAIRE