Biomedical Literature Mining for Repurposing Laboratory Tests.
Autor: | Kuusisto F; Morgridge Institute for Research, Madison, WI, USA. finn@cs.wisc.edu., Kleiman R; University of Wisconsin, Madison, WI, USA., Weiss J; Carnegie Mellon University, Pittsburgh, PA, USA. |
---|---|
Jazyk: | angličtina |
Zdroj: | Methods in molecular biology (Clifton, N.J.) [Methods Mol Biol] 2022; Vol. 2496, pp. 91-109. |
DOI: | 10.1007/978-1-0716-2305-3_5 |
Abstrakt: | Epidemiological studies identifying biological markers of disease state are valuable, but can be time-consuming, expensive, and require extensive intuition and expertise. Furthermore, not all hypothesized markers will be borne out in a study, suggesting that high-quality initial hypotheses are crucial. In this chapter, we describe a high-throughput pipeline to produce a ranked list of high-quality hypothesized biomarkers for diseases. We review an example use of this approach to generate a large number of candidate disease biomarker hypotheses derived from machine learning models, filter and rank them according to their potential novelty using text mining, and corroborate the most promising hypotheses with further statistical modeling. The example use of the pipeline uses a large electronic health record dataset and the PubMed corpus, to find several promising hypothesized laboratory tests with previously undocumented correlations to particular diseases. (© 2022. The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature.) |
Databáze: | MEDLINE |
Externí odkaz: |