DoubletFinder: Doublet Detection in Single-Cell RNA Sequencing Data Using Artificial Nearest Neighbors
Autor: | Zev J. Gartner, Lyndsay M. Murrow, Christopher S. McGinnis |
---|---|
Rok vydání: | 2019 |
Předmět: |
animal structures
Histology Cell Sequencing data macromolecular substances Computational biology Deep sequencing Article single-cell RNA sequencing Pathology and Forensic Medicine Transcriptome quality-control 03 medical and health sciences Cell throughput 0302 clinical medicine Putative gene Gene expression Genetics medicine Humans Droplet microfluidics RNA-Seq Limit (mathematics) 030304 developmental biology Physics 0303 health sciences urogenital system RNA Cell Biology Expression (mathematics) Kidney cell machine learning medicine.anatomical_structure doublet detection Biochemistry and Cell Biology sense organs Single-Cell Analysis Software 030217 neurology & neurosurgery Biotechnology |
Zdroj: | Cell systems, vol 8, iss 4 |
ISSN: | 2405-4712 |
DOI: | 10.1016/j.cels.2019.03.003 |
Popis: | SUMMARYSingle-cell RNA sequencing (scRNA-seq) using droplet microfluidics occasionally produces transcriptome data representing more than one cell. These technical artifacts are caused by cell doublets formed during cell capture and occur at a frequency proportional to the total number of sequenced cells. The presence of doublets can lead to spurious biological conclusions, which justifies the practice of sequencing fewer cells to limit doublet formation rates. Here, we present a computational doublet detection tool – DoubletFinder – that identifies doublets based solely on gene expression features. DoubletFinder infers the putative gene expression profile of real doublets by generating artificial doublets from existing scRNA-seq data. Neighborhood detection in gene expression space then identifies sequenced cells with increased probability of being doublets based on their proximity to artificial doublets. DoubletFinder robustly identifies doublets across scRNA-seq datasets with variable numbers of cells and sequencing depth, and predicts false-negative and false-positive doublets defined using conventional barcoding approaches. We anticipate that DoubletFinder will aid in scRNA-seq data analysis and will increase the throughput and accuracy of scRNA-seq experiments. |
Databáze: | OpenAIRE |
Externí odkaz: |