A pareto ensemble based spectral clustering framework
Autor: | Juanjuan Luo, Huadong Ma, Dongqing Zhou |
---|---|
Rok vydání: | 2020 |
Předmět: |
Divide and conquer algorithms
Optimization problem Computer science Pareto principle Evolutionary algorithm Initialization 02 engineering and technology Spectral clustering Computational Mathematics Similarity (network science) Artificial Intelligence 020204 information systems 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing Cluster analysis Engineering (miscellaneous) Algorithm Information Systems |
Zdroj: | Complex & Intelligent Systems. 7:495-509 |
ISSN: | 2198-6053 2199-4536 |
DOI: | 10.1007/s40747-020-00215-7 |
Popis: | Similarity matrix has a significant effect on the performance of the spectral clustering, and how to determine the neighborhood in the similarity matrix effectively is one of its main difficulties. In this paper, a “divide and conquer” strategy is proposed to model the similarity matrix construction task by adopting Multiobjective evolutionary algorithm (MOEA). The whole procedure is divided into two phases, phase I aims to determine the nonzero entries of the similarity matrix, and Phase II aims to determine the value of the nonzero entries of the similarity matrix. In phase I, the main contribution is that we model the task as a biobjective dynamic optimization problem, which optimizes the diversity and the similarity at the same time. It makes each individual determine one nonzero entry for each sample, and the encoding length decreases toO(N) in contrast with the non-ensemble multiobjective spectral clustering. In addition, a specific initialization operator and diversity preservation strategy are proposed during this phase. In phase II, three ensemble strategies are designed to determine the value of the nonzero value of the similarity matrix. Furthermore, this Pareto ensemble framework is extended to semi-supervised clustering by transforming the semi-supervised information to constraints. In contrast with the previous multiobjective evolutionary-based spectral clustering algorithms, the proposed Pareto ensemble-based framework makes a balance between time cost and the clustering accuracy, which is demonstrated in the experiments section. |
Databáze: | OpenAIRE |
Externí odkaz: |