Large-scale spectral clustering for managing big data in healthcare operations

Autor: Liu, Maoqing, Fard, Nasser, Sadeghzadeh, Keivan
Zdroj: International Journal of Big Data Intelligence; 2017, Vol. 4 Issue: 3 p195-207, 13p
Abstrakt: Healthcare industries have access to a large volume and variety of data about patients' behaviours, diseases, and treatments. There is a significant need for a data-driven system to discover patterns for better understanding of the impact of human risk behaviours on numerous diseases. In order to discover and extract interesting knowledge and pattern from large amount of data, a data mining process for discovering knowledge from unprocessed and raw healthcare data is studied. Methods for analysis of big data, and the role and types of clustering methods are presented. An in-depth analysis of spectral clustering method as a superior clustering algorithm for big healthcare data is presented. The spectral clustering algorithm is applied to a large healthcare data from the behavioural risk factor surveillance system (BRFSS), by partitioning the untrained data to at least four clusters. The MATLAB®R2011b programming environment is utilised as a calculation tool in the experimental design and analysis. The experimental results and analysis, and the implementation process are discussed and the data processing is presented. Sensitivity analysis for both parameters of the spectral clustering are performed to determine their influence on the clustering results.
Databáze: Supplemental Index