An approach to remove duplication records in healthcare dataset based on Mimic Deep Neural Network (MDNN) and Chaotic Whale Optimization (CWO)

Autor: M. D. Anto Praveena, B. Bharathi
Rok vydání: 2021
Předmět:
Zdroj: Concurrent Engineering. 29:58-67
ISSN: 1531-2003
1063-293X
DOI: 10.1177/1063293x21992014
Popis: Duplication of data in an application will become an expensive factor. These replication of data need to be checked and if it is needed it has to be removed from the dataset as it occupies huge volume of data in the storage space. The cloud is the main source of data storage and all organizations are already started to move their dataset into the cloud since it is cost effective, storage space, data security and data Privacy. In the healthcare sector, storing the duplicated records leads to wrong prediction. Also uploading same files by many users, data storage demand will be occurred. To address those issues, this paper proposes an Optimal Removal of Deduplication (ORD) in heart disease data using hybrid trust based neural network algorithm. In ORD scheme, the Chaotic Whale Optimization (CWO) algorithm is used for trust computation of data using multiple decision metrics. The computed trust values and the nature of the data’s are sequentially applied to the training process by the Mimic Deep Neural Network (MDNN). It classify the data is a duplicate or not. Hence the duplicates files are identified and they were removed from the data storage. Finally, the simulation evaluates to examine the proposed MDNN based model and simulation results show the effectiveness of ORD scheme in terms of data duplication removal. From the simulation result it is found that the model’s accuracy, sensitivity and specificity was good.
Databáze: OpenAIRE