Identification of Protein Secretion Systems in Bacterial Genomes Using MacSyFinder Version 2.
Autor: | Abby SS; Univ. Grenoble Alpes, CNRS, UMR 5525, VetAgro Sup, Grenoble INP, TIMC, Grenoble, France. sophie.abby@univ-grenoble-alpes.fr., Denise R; APC Microbiome Ireland & School of Microbiology, University College Cork, Cork, Ireland., Rocha EPC; Institut Pasteur, Université Paris Cité, CNRS UMR3525, Microbial Evolutionary Genomics, Paris, France. |
---|---|
Jazyk: | angličtina |
Zdroj: | Methods in molecular biology (Clifton, N.J.) [Methods Mol Biol] 2024; Vol. 2715, pp. 1-25. |
DOI: | 10.1007/978-1-0716-3445-5_1 |
Abstrakt: | Protein secretion systems are complex molecular machineries that translocate proteins through the outer membrane and sometimes through multiple other barriers. They have evolved by co-option of components from other envelope-associated cellular machineries, making them sometimes difficult to identify and discriminate. Here, we describe how to identify protein secretion systems in bacterial genomes using the MacSyFinder program. This flexible computational tool uses the knowledge gathered from experimental studies to identify homologous systems in genome data. It can be used with a set of predefined MacSyFinder models, "TXSScan," to identify all major secretion systems of diderm bacteria (i.e., with inner and LPS-containing outer membranes) as well as evolutionarily related cell appendages (pili and flagella). For this, it identifies and clusters co-localized genes encoding proteins of secretion systems using sequence similarity search with Hidden Markov Model (HMM) protein profiles. Finally, it checks if the clusters' genetic content and genomic organization satisfy the constraints of the model. TXSScan models can be altered in the command line or customized to search for variants of known secretion systems. Models can also be built from scratch to identify novel systems. In this chapter, we describe a complete pipeline of analysis, starting from (i) the integration of information from a reference set of experimentally studied systems, (ii) the identification of conserved proteins and the construction of their HMM protein profiles, (iii) the definition and optimization of "macsy-models," and (iv) their use and online distribution as tools to search genomic data for secretion systems of interest. MacSyFinder is available here: https://github.com/gem-pasteur/macsyfinder, and MacSyFinder models here: https://github.com/macsy-models . (© 2024. The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature.) |
Databáze: | MEDLINE |
Externí odkaz: |