Parallelized Kernel Patch Clustering

Autor: Stefan Faußer, Friedhelm Schwenker
Rok vydání: 2010
Předmět:
Zdroj: Artificial Neural Networks in Pattern Recognition ISBN: 9783642121586
ANNPR
Popis: Kernel based clustering methods allow to unsupervised partition samples in feature space but have a quadratic computation time O(n2) where n are the number of samples. Therefore these methods are generally ineligible for large datasets. In this paper we propose a meta-algorithm that performs parallelized clusterings of subsets of the samples and merges them repeatedly. The algorithm is able to use many Kernel based clustering methods where we mainly emphasize on Kernel Fuzzy C-Means and Relational Neural Gas. We show that the computation time of this algorithm is basicly linear, i.e. O(n). Further we statistically evaluate the performance of this meta-algorithm on a real-life dataset, namely the Enron Emails.
Databáze: OpenAIRE