Data-driven discovery of probable Alzheimer's disease and related dementia subphenotypes using electronic health records.
Autor: | Xu J; Department of Population Health Sciences Information Technologies and Services, Weill Cornell Medicine New York New York USA., Wang F; Department of Population Health Sciences Information Technologies and Services, Weill Cornell Medicine New York New York USA., Xu Z; Department of Population Health Sciences Information Technologies and Services, Weill Cornell Medicine New York New York USA., Adekkanattu P; Information Technologies and Services, Weill Cornell Medicine New York New York USA., Brandt P; Biomedical Informatics and Medical Education University of Washington Seattle Washington USA., Jiang G; Department of Health Sciences Research Mayo Clinic Rochester Minnesota USA., Kiefer RC; Department of Health Sciences Research Mayo Clinic Rochester Minnesota USA., Luo Y; Feinberg School of Medicine Northwestern University Chicago Illinois USA., Mao C; Feinberg School of Medicine Northwestern University Chicago Illinois USA., Pacheco JA; Feinberg School of Medicine Northwestern University Chicago Illinois USA., Rasmussen LV; Feinberg School of Medicine Northwestern University Chicago Illinois USA., Zhang Y; Department of Population Health Sciences Information Technologies and Services, Weill Cornell Medicine New York New York USA., Isaacson R; Department of Population Health Sciences Information Technologies and Services, Weill Cornell Medicine New York New York USA., Pathak J; Department of Population Health Sciences Information Technologies and Services, Weill Cornell Medicine New York New York USA. |
---|---|
Jazyk: | angličtina |
Zdroj: | Learning health systems [Learn Health Syst] 2020 Sep 10; Vol. 4 (4), pp. e10246. Date of Electronic Publication: 2020 Sep 10 (Print Publication: 2020). |
DOI: | 10.1002/lrh2.10246 |
Abstrakt: | Introduction: We sought to assess longitudinal electronic health records (EHRs) using machine learning (ML) methods to computationally derive probable Alzheimer's Disease (AD) and related dementia subphenotypes. Methods: A retrospective analysis of EHR data from a cohort of 7587 patients seen at a large, multi-specialty urban academic medical center in New York was conducted. Subphenotypes were derived using hierarchical clustering from 792 probable AD patients (cases) who had received at least one diagnosis of AD using their clinical data. The other 6795 patients, labeled as controls, were matched on age and gender with the cases and randomly selected in the ratio of 9:1. Prediction models with multiple ML algorithms were trained on this cohort using 5-fold cross-validation. XGBoost was used to rank the variable importance. Results: Four subphenotypes were computationally derived. Subphenotype A (n = 273; 28.2%) had more patients with cardiovascular diseases; subphenotype B (n = 221; 27.9%) had more patients with mental health illnesses, such as depression and anxiety; patients in subphenotype C (n = 183; 23.1%) were overall older (mean (SD) age, 79.5 (5.4) years) and had the most comorbidities including diabetes, cardiovascular diseases, and mental health disorders; and subphenotype D (n = 115; 14.5%) included patients who took anti-dementia drugs and had sensory problems, such as deafness and hearing impairment.The 0-year prediction model for AD risk achieved an area under the receiver operating curve (AUC) of 0.764 (SD: 0.02); the 6-month model, 0.751 (SD: 0.02); the 1-year model, 0.752 (SD: 0.02); the 2-year model, 0.749 (SD: 0.03); and the 3-year model, 0.735 (SD: 0.03), respectively. Based on variable importance, the top-ranked comorbidities included depression, stroke/transient ischemic attack, hypertension, anxiety, mobility impairments, and atrial fibrillation. The top-ranked medications included anti-dementia drugs, antipsychotics, antiepileptics, and antidepressants. Conclusions: Four subphenotypes were computationally derived that correlated with cardiovascular diseases and mental health illnesses. ML algorithms based on patient demographics, diagnosis, and treatment demonstrated promising results in predicting the risk of developing AD at different time points across an individual's lifespan. Competing Interests: The authors declare no conflict of interest. (© 2020 The Authors. Learning Health Systems published by Wiley Periodicals LLC on behalf of the University of Michigan.) |
Databáze: | MEDLINE |
Externí odkaz: |