Convexity-based clustering criteria: theory, algorithms, and applications in statistics
Autor: | Hans-Hermann Bock |
---|---|
Rok vydání: | 2004 |
Předmět: | |
Zdroj: | Statistical Methods and Applications. 12:293-317 |
ISSN: | 1613-981X 1618-2510 |
DOI: | 10.1007/s10260-003-0069-8 |
Popis: | This paper deals with the construction of optimum partitions ${\cal B} = (B_1,...,B_m)$ of $I\hspace{-4.0pt}R^p$ for a clustering criterion which is based on a convex function of the class centroids $E[X\vert X\in B_i]$ as a generalization of the classical SSQ clustering criterion for n data points. We formulate a dual optimality problem involving two sets of variables and derive a maximum-support-plane (MSP) algorithm for constructing a (sub-)optimum partition as a generalized k-means algorithm. We present various modifications of the basic criterion and describe the corresponding MSP algorithm. It is shown that the method can also be used for solving optimality problems in classical statistics (maximizing Csiszar’s $\phi$ -divergence) and for simultaneous classification of the rows and columns of a contingency table. |
Databáze: | OpenAIRE |
Externí odkaz: |