Systematic tissue annotations of genomics samples by modeling unstructured metadata

Autor: Nathaniel T. Hawkins, Marc Maldaver, Anna Yannakopoulos, Lindsay A. Guare, Arjun Krishnan
Jazyk: angličtina
Rok vydání: 2022
Předmět:
Zdroj: Nature Communications, Vol 13, Iss 1, Pp 1-13 (2022)
Druh dokumentu: article
ISSN: 2041-1723
DOI: 10.1038/s41467-022-34435-x
Popis: The 1+ million publicly-available human –omics samples currently remain acutely underused. Here the authors present an approach combining natural language processing and machine learning to infer the source tissue of public genomics samples based on their plain text descriptions, making these samples easy to discover and reuse.
Databáze: Directory of Open Access Journals