Synthetic data as an enabler for machine learning applications in medicine

Autor: Jean-Francois Rajotte, Robert Bergen, David L. Buckeridge, Khaled El Emam, Raymond Ng, Elissa Strome
Jazyk: angličtina
Rok vydání: 2022
Předmět:
Zdroj: iScience, Vol 25, Iss 11, Pp 105331- (2022)
Druh dokumentu: article
ISSN: 2589-0042
DOI: 10.1016/j.isci.2022.105331
Popis: Summary: Synthetic data generation is the process of using machine learning methods to train a model that captures the patterns in a real dataset. Then new or synthetic data can be generated from that trained model. The synthetic data does not have a one-to-one mapping to the original data or to real patients, and therefore has the potential of privacy preserving properties. There is a growing interest in the application of synthetic data across health and life sciences, but to fully realize the benefits, further education, research, and policy innovation is required. This article summarizes the opportunities and challenges of SDG for health data, and provides directions for how this technology can be leveraged to accelerate data access for secondary purposes.
Databáze: Directory of Open Access Journals