Dimensionality Reduction and Microarray Data.

Autor: Barth, Timothy J., Griebel, Michael, Keyes, David E., Nieminen, Risto M., Roose, Dirk, Schlick, Tamar, Gorban, Alexander N., Kégl, Balázs, Wunsch, Donald C., Zinovyev, Andrei Y., Elizondo, David A., Passow, Benjamin N., Birkenhead, Ralph, Huemer, Andreas
Zdroj: Principal Manifolds for Data Visualization & Dimension Reduction; 2007, p293-308, 16p
Abstrakt: Microarrays are being currently used for the expression levels of thousands of genes simultaneously. They present new analytical challenges because they have a very high input dimension and a very low sample size. It is highly complex to analyse multi-dimensional data with complex geometry and to identify low-dimensional "principal objects" that relate to the optimal projection while losing the least amount of information. Several methods have been proposed for dimensionality reduction of microarray data. Some of these methods include principal component analysis and principal manifolds. This article presents a comparison study of the performance of the linear principal component analysis and the non linear local tangent space alignment principal manifold methods on such a problem. Two microarray data sets will be used in this study. A classification model will be created using fully dimensional and dimensionality reduced data sets. To measure the amount of information lost with the two dimensionality reduction methods, the level of performance of each of the methods will be measured in terms of level of generalisation obtained by the classification models on previously unseen data sets. These results will be compared with the ones obtained using the fully dimensional data sets. [ABSTRACT FROM AUTHOR]
Databáze: Supplemental Index