An Adversarial Domain Separation Framework for Septic Shock Early Prediction Across EHR Systems
Autor: | Farzaneh Khoshnevisan, Min Chi |
---|---|
Rok vydání: | 2020 |
Předmět: |
FOS: Computer and information sciences
Computer Science - Machine Learning Computer science Process (engineering) 02 engineering and technology 010501 environmental sciences Machine learning computer.software_genre 01 natural sciences Domain (software engineering) Machine Learning (cs.LG) Adversarial system 0202 electrical engineering electronic engineering information engineering Neural and Evolutionary Computing (cs.NE) Representation (mathematics) Invariant (computer science) 0105 earth and related environmental sciences Data collection business.industry Computer Science - Neural and Evolutionary Computing Missing data Recurrent neural network 020201 artificial intelligence & image processing Artificial intelligence business computer |
Zdroj: | IEEE BigData |
DOI: | 10.48550/arxiv.2010.13952 |
Popis: | Modeling patient disease progression using Electronic Health Records (EHRs) is critical to assist clinical decision making. While most of prior work has mainly focused on developing effective disease progression models using EHRs collected from an individual medical system, relatively little work has investigated building robust yet generalizable diagnosis models across different systems. In this work, we propose a general domain adaptation (DA) framework that tackles two categories of discrepancies in EHRs collected from different medical systems: one is caused by heterogeneous patient populations (covariate shift) and the other is caused by variations in data collection procedures (systematic bias). Prior research in DA has mainly focused on addressing covariate shift but not systematic bias. In this work, we propose an adversarial domain separation framework that addresses both categories of discrepancies by maintaining one globally-shared invariant latent representation across all systems} through an adversarial learning process, while also allocating a domain-specific model for each system to extract local latent representations that cannot and should not be unified across systems. Moreover, our proposed framework is based on variational recurrent neural network (VRNN) because of its ability to capture complex temporal dependencies and handling missing values in time-series data. We evaluate our framework for early diagnosis of an extremely challenging condition, septic shock, using two real-world EHRs from distinct medical systems in the U.S. The results show that by separating globally-shared from domain-specific representations, our framework significantly improves septic shock early prediction performance in both EHRs and outperforms the current state-of-the-art DA models. Comment: to be published in 2020 IEEE International Conference on Big Data |
Databáze: | OpenAIRE |
Externí odkaz: |