Evaluate the number of clusters in finite mixture models with the penalized histogram difference criterion

Autor: Siliang Zhang, Weilu Lin, Yonghong Wang, Yingping Zhuang
Rok vydání: 2013
Předmět:
Zdroj: Journal of Process Control. 23:1052-1062
ISSN: 0959-1524
DOI: 10.1016/j.jprocont.2013.06.008
Popis: Aimed at the determination of the number of mixtures for finite mixture models (FMMs), in this work, a new method called the penalized histogram difference criterion (PHDC) is proposed and evaluated with other criteria such as Akaike information criterion (AIC), the minimum message length (MML), the information complexity (ICOMP) and the evidence of data criterion (EDC). The new method, which calculates the penalized histogram difference between the data generated from estimated FMMs and those for modeling purpose, turns out to be better than others for data with complicate mixtures patterns. It is demonstrated in this work that the PHDC can determine the optimal number of clusters of the FMM. Furthermore, the estimated FMMs asymptotically approximate the true model. The utility of the new method is demonstrated through synthetic data sets analysis and the batch-wise comparison of citric acid fermentation processes.
Databáze: OpenAIRE