Text Extraction and Standardization System Development for Pathological Records in the Korea Biobank Network

Autor: SooJeong Ko, Sunghyeon Park, SeolWhan Oh, YunSeon Im, Surin Jung, BoYeon Choi, Jaeyoon Kim, Wona Choi, InYoung Choi
Rok vydání: 2023
Zdroj: Caring is Sharing – Exploiting the Value in Data for Health and Innovation ISBN: 9781643683881
DOI: 10.3233/shti230156
Popis: In Korea, the Korea Centers for Disease Control and Prevention operates the Korea BioBank Network (KBN). KBN has pathological records that collected in Korea and it is useful dataset for research. In this study, we established system that time efficient and reduced error by step-by-step data extraction process from KBN pathological records. We tested the extraction process by 769 lung cancer cohorts and 1292 breast cancer cohorts and accuracy is 91%. We expect this system can be used to efficiently process data from multiple institutions, including Korea BioBank Network.
Databáze: OpenAIRE