Normalizing and denoising protein expression data from droplet-based single cell profiling.

Autor: Mulè MP; Multiscale Systems Biology Section, Laboratory of Immune System Biology, National Institute of Allergy and Infectious Diseases (NIAID), National Institutes of Health (NIH), Bethesda, MD, USA.; NIH-Oxford-Cambridge Scholars Program, Department of Medicine, University of Cambridge, Cambridge, UK., Martins AJ; Multiscale Systems Biology Section, Laboratory of Immune System Biology, National Institute of Allergy and Infectious Diseases (NIAID), National Institutes of Health (NIH), Bethesda, MD, USA., Tsang JS; Multiscale Systems Biology Section, Laboratory of Immune System Biology, National Institute of Allergy and Infectious Diseases (NIAID), National Institutes of Health (NIH), Bethesda, MD, USA. john.tsang@nih.gov.; NIH Center for Human Immunology (CHI), National Institutes of Health (NIH), Bethesda, MD, USA. john.tsang@nih.gov.
Jazyk: angličtina
Zdroj: Nature communications [Nat Commun] 2022 Apr 19; Vol. 13 (1), pp. 2099. Date of Electronic Publication: 2022 Apr 19.
DOI: 10.1038/s41467-022-29356-8
Abstrakt: Multimodal single-cell profiling methods that measure protein expression with oligo-conjugated antibodies hold promise for comprehensive dissection of cellular heterogeneity, yet the resulting protein counts have substantial technical noise that can mask biological variations. Here we integrate experiments and computational analyses to reveal two major noise sources and develop a method called "dsb" (denoised and scaled by background) to normalize and denoise droplet-based protein expression data. We discover that protein-specific noise originates from unbound antibodies encapsulated during droplet generation; this noise can thus be accurately estimated and corrected by utilizing protein levels in empty droplets. We also find that isotype control antibodies and the background protein population average in each cell exhibit significant correlations across single cells, we thus use their shared variance to correct for cell-to-cell technical noise in each cell. We validate these findings by analyzing the performance of dsb in eight independent datasets spanning multiple technologies, including CITE-seq, ASAP-seq, and TEA-seq. Compared to existing normalization methods, our approach improves downstream analyses by better unmasking biologically meaningful cell populations. Our method is available as an open-source R package that interfaces easily with existing single cell software platforms such as Seurat, Bioconductor, and Scanpy and can be accessed at "dsb [ https://cran.r-project.org/package=dsb ]".
(© 2022. This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply.)
Databáze: MEDLINE