Convexity-based clustering criteria: theory, algorithms, and applications in statistics

Autor: Hans-Hermann Bock
Rok vydání: 2004
Předmět:
Zdroj: Statistical Methods and Applications. 12:293-317
ISSN: 1613-981X
1618-2510
DOI: 10.1007/s10260-003-0069-8
Popis: This paper deals with the construction of optimum partitions ${\cal B} = (B_1,...,B_m)$ of $I\hspace{-4.0pt}R^p$ for a clustering criterion which is based on a convex function of the class centroids $E[X\vert X\in B_i]$ as a generalization of the classical SSQ clustering criterion for n data points. We formulate a dual optimality problem involving two sets of variables and derive a maximum-support-plane (MSP) algorithm for constructing a (sub-)optimum partition as a generalized k-means algorithm. We present various modifications of the basic criterion and describe the corresponding MSP algorithm. It is shown that the method can also be used for solving optimality problems in classical statistics (maximizing Csiszar’s $\phi$ -divergence) and for simultaneous classification of the rows and columns of a contingency table.
Databáze: OpenAIRE