A double-layered fully automated insomnia identification model employing synthetic data generation using MCSA and CTGAN with single-channel EEG signals

Autor: Steffi Philip Mulamoottil, T. Vigneswaran
Jazyk: angličtina
Rok vydání: 2024
Předmět:
Zdroj: Scientific Reports, Vol 14, Iss 1, Pp 1-16 (2024)
Druh dokumentu: article
ISSN: 2045-2322
DOI: 10.1038/s41598-024-74706-9
Popis: Abstract Insomnia was diagnosed by analyzing sleep stages obtained during polysomnography (PSG) recording. The state-of-the-art insomnia detection models that used physiological signals in PSG were successful in classification. However, the sleep stages of unbalanced data in small-time intervals were fed for classification in previous studies. This can be avoided by analyzing the insomnia detection structure in different frequency bands with artificially generated data from the existing one at the preprocessing and post-processing stages. Hence, the paper proposes a double-layered augmentation model using Modified Conventional Signal Augmentation (MCSA) and a Conditional Tabular Generative Adversarial Network (CTGAN) to generate synthetic signals from raw EEG and synthetic data from extracted features, respectively, in creating training data. The presented work is independent of sleep stage scoring and provides double-layered data protection with the utility of augmentation methods. It is ideally suited for real-time detection using a single-channel EEG provides better mobility and comfort while recording. The work analyzes each augmentation layer’s performance individually, and better accuracy was observed when merging both. It also evaluates the augmentation performance in various frequency bands, which are decomposed using discrete wavelet transform, and observed that the alpha band contributes more to detection. The classification is performed using Decision Tree (DT), Ensembled Bagged Decision Tree (EBDT), Gradient Boosting (GB), Random Forest (RF), and Stacking classifier (SC), attaining the highest classification accuracy of 94% using RF with a greater Area Under Curve (AUC) value of 0.97 compared to the existing works and is best suited for small datasets.
Databáze: Directory of Open Access Journals
Nepřihlášeným uživatelům se plný text nezobrazuje