Clustering Large Data Set: An Applied Comparative Study
Autor: | Laura Bocci, Isabella Mingo |
---|---|
Rok vydání: | 2011 |
Předmět: |
Clustering high-dimensional data
Computer science Correlation clustering Single-linkage clustering computer.software_genre Field (computer science) Large data set mixed clustering statistical source Data set work-flexibility CURE data clustering algorithm cluster analysis computational statistic Consensus clustering Data mining Cluster analysis computer |
Zdroj: | Advanced Statistical Methods for the Analysis of Large Data-Sets ISBN: 9783642210365 |
Popis: | The aim of this paper is to analyze different strategies to cluster large data sets derived from social context. For the purpose of clustering, trials on effective and efficient methods for large databases have only been carried out in recent years due to the emergence of the field of data mining. In this paper a sequential approach based on multiobjective genetic algorithm as clustering technique is proposed. The proposed strategy is applied to a real-life data set consisting of approximately 1.5 million workers and the results are compared with those obtained by other methods to find out an unambiguous partitioning of data. |
Databáze: | OpenAIRE |
Externí odkaz: |