Numerical considerations and a new implementation for ICS

Autor: Archimbaud, Aurore, Drmač, Zlatko, Nordhausen, Klaus, Radojičić, Una, Ruiz-Gazen, Anne
Rok vydání: 2022
Předmět:
Zdroj: SIAM Journal on Mathematics of Data Science, 5, 97-121, 2023
Druh dokumentu: Working Paper
DOI: 10.1137/22M1498759
Popis: Invariant Coordinate Selection (ICS) is a multivariate data transformation and a dimension reduction method that can be useful in many different contexts. It can be used for outlier detection or cluster identification, and can be seen as an independent component or a non-Gaussian component analysis method. The usual implementation of ICS is based on a joint diagonalization of two scatter matrices, and may be numerically unstable in some ill-conditioned situations. We focus on one-step M-scatter matrices and propose a new implementation of ICS based on a pivoted QR factorization of the centered data set. This factorization avoids the direct computation of the scatter matrices and their inverse and brings numerical stability to the algorithm. Furthermore, the row and column pivoting leads to a rank revealing procedure that allows computation of ICS when the scatter matrices are not full rank. Several artificial and real data sets illustrate the interest of using the new implementation compared to the original one.
Databáze: arXiv