Species-level microbial sequence classification is improved by source-environment information

Autor: Kaehler, Benjamin D., Bokulich, Nicholas, McDonald, Daniel, Knight, Rob, Caporaso, J. Gregory, Huttley, Gavin A.
Jazyk: angličtina
Rok vydání: 2019
Zdroj: bioRxiv
DOI: 10.3929/ethz-b-000431207
Popis: Popular naive Bayes taxonomic classifiers for amplicon sequences assume that all species in the reference database are equally likely to be observed. We demonstrate that classification accuracy degrades linearly with the degree to which that assumption is violated, and in practice it is always violated. By incorporating environment-specific taxonomic abundance information, we demonstrate that species-level resolution is attainable.
bioRxiv
Databáze: OpenAIRE