One-Layer vs. Two-Layer SOM in the Context of Outlier Identification: A Simulation Study

Autor: Gabriel Antonio Valverde Castilla, José Manuel Mira McWilliams, Beatriz González-Pérez
Jazyk: angličtina
Rok vydání: 2021
Předmět:
Zdroj: Applied Sciences, Vol 11, Iss 14, p 6241 (2021)
Druh dokumentu: article
ISSN: 2076-3417
DOI: 10.3390/app11146241
Popis: In this work, we applied a stochastic simulation methodology to quantify the power of the detection of outlying mixture components of a stochastic model, when applying a reduced-dimension clustering technique such as Self-Organizing Maps (SOMs). The essential feature of SOMs, besides dimensional reduction into a discrete map, is the conservation of topology. In SOMs, two forms of learning are applied: competitive, by sequential allocation of sample observations to a winning node in the map, and cooperative, by the update of the weights of the winning node and its neighbors. By means of cooperative learning, the conservation of topology from the original data space to the reduced (typically 2D) map is achieved. Here, we compared the performance of one- and two-layer SOMs in the outlier representation task. The same stratified sampling was applied for both the one-layer and two-layer SOMs; although, stratification would only be relevant for the two-layer setting—to estimate the outlying mixture component detection power. Two distance measures between points in the map were defined to quantify the conservation of topology. The results of the experiment showed that the two-layer setting was more efficient in outlier detection while maintaining the basic properties of the SOM, which included adequately representing distances from the outlier component to the remaining ones.
Databáze: Directory of Open Access Journals