Phenotate: crowdsourcing phenotype annotations as exercises in undergraduate classes
Autor: | Tugce B. Balci, Michael Brudno, Willie H. Chang, Annie Olry, Peter N. Robinson, Sylvie Maiella, Brittney Johnstone, Mia Husić, Sarah L. Sawyer, Pouria Mashouri, Alexander X. Lozano, Ana Rath |
---|---|
Rok vydání: | 2020 |
Předmět: |
0301 basic medicine
Information retrieval business.industry Computer science Gold standard (test) 030105 genetics & heredity Crowdsourcing 03 medical and health sciences Annotation Identification (information) 030104 developmental biology Documentation Knowledge base Similarity (psychology) Web application business Genetics (clinical) |
Zdroj: | Genetics in Medicine. 22:1391-1400 |
ISSN: | 1098-3600 |
DOI: | 10.1038/s41436-020-0812-7 |
Popis: | Purpose Computational documentation of genetic disorders is highly relianton structured data for differential diagnosis, pathogenic variant identification, and patient matchmaking. However, most information on rare diseases (RDs) exists in free form text, such as academic literature. To increase availability of structured RD data, we developed a crowdsourcing approach for collecting phenotype information using student assignments. Methods We developed Phenotate, a web application for crowdsourcing disease phenotype annotations through assignments for undergraduate genetics students. Using student-collected data, we generated composite annotations for each disease through a machine learning approach. These annotations were compared with those from clinical practitioners and gold standard curated data. Results Deploying Phenotate in five undergraduate genetics courses, we collected annotations for 22 diseases. Student-sourced annotations showed strong similarity to gold standards, with F-measures ranging from 0.584 to 0.868. Furthermore, clinicians used Phenotate annotations to identify diseases with comparable accuracy to other annotation sources and gold standards. For six disorders, no gold standards were available, allowing us to create some of the first structured annotations for them, while students demonstrated ability to research RDs. Conclusion Phenotate enables crowdsourcing RD phenotypic annotations through educational assignments. Presented as an intuitive web-based tool, it offer spedagogical benefits and augments the computable RD knowledge base. |
Databáze: | OpenAIRE |
Externí odkaz: |