Exploiting Causal Structure for Robust Model Selection in Unsupervised Domain Adaptation

Autor:	Mihaela van der Schaar, Trent Kyono
Rok vydání:	2021
Předmět:	Computer science business.industry Model selection Causal structure Machine learning computer.software_genre Synthetic data Oracle Domain (software engineering) Factor (programming language) Covariate Feature (machine learning) Artificial intelligence business computer computer.programming_language
Zdroj:	IEEE Transactions on Artificial Intelligence. 2:494-507
ISSN:	2691-4581
DOI:	10.1109/tai.2021.3101185
Popis:	In many real-world settings, such as healthcare, machine learning models are trained and validated on one labeled domain and tested or deployed on another where feature distributions differ, i.e., there is covariate shift. When annotations are costly or prohibitive, an unsupervised domain adaptation (UDA) regime can be leveraged requiring only unlabeled samples in the target domain. Existing UDA methods are unable to factor in a model's predictive loss based on predictions in the target domain and therefore suboptimally leverage density ratios of only the input covariates in each domain. In this work we propose a model selection method for leveraging model predictions on a target domain without labels by exploiting the domain invariance of causal structure. We assume or learn a causal graph from the source domain, and select models that produce predicted distributions in the target domain that have the highest likelihood of fitting our causal graph. We thoroughly analyze our method under oracle knowledge using synthetic data. We then show on several real-world datasets, including several COVID-19 examples, that our method is able to improve on the state-of-the-art UDA algorithms for model selection.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::7cf20ce3949e89d4d8b05918c0a81b53 https://doi.org/10.1109/tai.2021.3101185 Zobrazit plný text záznamu