nf-core/airrflow: An adaptive immune receptor repertoire analysis workflow employing the Immcantation framework.
Autor: | Gabernet G; Department of Pathology, Yale School of Medicine, New Haven, Connecticut, United States of America.; Quantitative Biology Center, Eberhard-Karls University of Tübingen, Tübingen, Germany., Marquez S; Department of Pathology, Yale School of Medicine, New Haven, Connecticut, United States of America., Bjornson R; Yale Center for Research Computing, New Haven, Connecticut, United States of America., Peltzer A; Boehringer Ingelheim Pharma GmbH & Co. KG, Biberach, Germany., Meng H; Department of Pathology, Yale School of Medicine, New Haven, Connecticut, United States of America., Aron E; Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America., Lee NY; Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America., Jensen CG; Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America., Ladd D; oNKo-Innate Pty Ltd, Melbourne, Victoria, Australia., Polster M; Quantitative Biology Center, Eberhard-Karls University of Tübingen, Tübingen, Germany.; Department of Computer Science, Eberhard-Karls University of Tübingen, Tübingen, Germany.; M3 Research Center, University Hospital, Tübingen, Germany., Hanssen F; Quantitative Biology Center, Eberhard-Karls University of Tübingen, Tübingen, Germany.; Department of Computer Science, Eberhard-Karls University of Tübingen, Tübingen, Germany.; M3 Research Center, University Hospital, Tübingen, Germany., Heumos S; Quantitative Biology Center, Eberhard-Karls University of Tübingen, Tübingen, Germany.; Department of Computer Science, Eberhard-Karls University of Tübingen, Tübingen, Germany.; M3 Research Center, University Hospital, Tübingen, Germany., Yaari G; Faculty of Engineering, Bar Ilan University, Ramat Gan, Israel., Kowarik MC; Department of Neurology and Stroke, Center for Neurology, Eberhard-Karls University of Tübingen, Tübingen, Germany.; Hertie Institute for Clinical Brain Research, Eberhard-Karls University of Tübingen, Tübingen, Germany., Nahnsen S; Quantitative Biology Center, Eberhard-Karls University of Tübingen, Tübingen, Germany.; Department of Computer Science, Eberhard-Karls University of Tübingen, Tübingen, Germany.; M3 Research Center, University Hospital, Tübingen, Germany.; Institute for Bioinformatics and Medical Informatics (IBMI), Eberhard-Karls University of Tübingen, Tübingen, Germany., Kleinstein SH; Department of Pathology, Yale School of Medicine, New Haven, Connecticut, United States of America.; Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America.; Department of Immunobiology, Yale School of Medicine, New Haven, Connecticut, United States of America. |
---|---|
Jazyk: | angličtina |
Zdroj: | PLoS computational biology [PLoS Comput Biol] 2024 Jul 26; Vol. 20 (7), pp. e1012265. Date of Electronic Publication: 2024 Jul 26 (Print Publication: 2024). |
DOI: | 10.1371/journal.pcbi.1012265 |
Abstrakt: | Adaptive Immune Receptor Repertoire sequencing (AIRR-seq) is a valuable experimental tool to study the immune state in health and following immune challenges such as infectious diseases, (auto)immune diseases, and cancer. Several tools have been developed to reconstruct B cell and T cell receptor sequences from AIRR-seq data and infer B and T cell clonal relationships. However, currently available tools offer limited parallelization across samples, scalability or portability to high-performance computing infrastructures. To address this need, we developed nf-core/airrflow, an end-to-end bulk and single-cell AIRR-seq processing workflow which integrates the Immcantation Framework following BCR and TCR sequencing data analysis best practices. The Immcantation Framework is a comprehensive toolset, which allows the processing of bulk and single-cell AIRR-seq data from raw read processing to clonal inference. nf-core/airrflow is written in Nextflow and is part of the nf-core project, which collects community contributed and curated Nextflow workflows for a wide variety of analysis tasks. We assessed the performance of nf-core/airrflow on simulated sequencing data with sequencing errors and show example results with real datasets. To demonstrate the applicability of nf-core/airrflow to the high-throughput processing of large AIRR-seq datasets, we validated and extended previously reported findings of convergent antibody responses to SARS-CoV-2 by analyzing 97 COVID-19 infected individuals and 99 healthy controls, including a mixture of bulk and single-cell sequencing datasets. Using this dataset, we extended the convergence findings to 20 additional subjects, highlighting the applicability of nf-core/airrflow to validate findings in small in-house cohorts with reanalysis of large publicly available AIRR datasets. Competing Interests: I have read the journal’s policy and the authors of this manuscript have the following competing interests: SHK receives consulting fees from Peraton. AP is an employee of Boehringer Ingelheim Pharma GmbH & Co KG and declares no conflict of interest. DL is an employee of oNKo-innate Pty Ltd and declares no conflict of interest. MCK has served on advisory boards and received speaker fees / travel grants from Merck, Sanofi-Genzyme, Novartis, Biogen, Janssen, Alexion, Celgene / Bristol-Myers Squibb and Roche. He has received research grants from Merck, Roche, Novartis, Sanofi-Genzyme and Celgene / Bristol-Myers Squibb. All other authors declare no conflicts of interest. (Copyright: © 2024 Gabernet et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.) |
Databáze: | MEDLINE |
Externí odkaz: | |
Nepřihlášeným uživatelům se plný text nezobrazuje | K zobrazení výsledku je třeba se přihlásit. |