Comparative Genomic Analysis of 60 Mycobacteriophage Genomes: Genome Clustering, Gene Acquisition, and Gene Size
Autor: | Molly S. Grace, Roger W. Hendrix, Jessica L. Wynalek, Robert H. Edgar, Marlana S. Myers, Craig L. Peebles, Matthew B. O'Brien, Andrew J. Hryckowian, Welkin H. Pope, Rebecca J. Weber, Alexis L. Smith, Natasha N. Hoyte, Helen Donis-Keller, Katherine L. Germane, Matt W. Bogel, Steven G. Cresawn, Daniel A. Russell, Manisha C. Patel, Ching-Chung Ko, Amy M. Vogelsberger, Deborah Jacobs-Sera, Anthony T. Tantoco, Charles A. Bowman, Graham F. Hatfull, Thuy T. Pham, Jeffrey G. Lawrence, Elizabeth C. Paladin |
---|---|
Rok vydání: | 2010 |
Předmět: |
Genes
Viral Mycobacteriophage Molecular Sequence Data Genomics Sequence alignment Bacterial genome size Genome Article Bacteriophage Open Reading Frames Structural Biology Cluster Analysis Molecular Biology Phylogeny Synteny Genetics Base Sequence biology Nucleotides Virion Nucleic acid sequence Genetic Variation Mycobacteriophages Sequence Analysis DNA biology.organism_classification Multigene Family Sequence Alignment |
Zdroj: | Journal of Molecular Biology. 397:119-143 |
ISSN: | 0022-2836 |
DOI: | 10.1016/j.jmb.2010.01.011 |
Popis: | Mycobacteriophages are viruses that infect mycobacterial hosts. Expansion of a collection of sequenced phage genomes to a total of sixty – all infecting a common bacterial host – provides further insight into their diversity and evolution. Of the sixty phage genomes, 55 can be grouped into nine clusters according to their nucleotide sequence similarities, five of which can be further divided into subclusters; five genomes do not cluster with other phages. The sequence diversity between genomes within a cluster varies greatly; for example, the six genomes in cluster D share more than 97.5% average nucleotide similarity with each other. In contrast, similarity between the two genomes in Cluster I is barely detectable by diagonal plot analysis. The total of 6,858 predicted ORFs have been grouped into 1523 phamilies (phams) of related sequences, 46% of which possess only a single member. Only 18.8% of the phams have sequence similarity to non-mycobacteriophage database entries and fewer than 10% of all phams can be assigned functions based on database searching or synteny. Genome clustering facilitates the identification of genes that are in greatest genetic flux and are more likely to have been exchanged horizontally in relatively recent evolutionary time. Although mycobacteriophage genes exhibit smaller average size than genes of their host (205 residues compared to 315), phage genes in higher flux average only ∼100 amino acids, suggesting that the primary units of genetic exchange correspond to single protein domains. |
Databáze: | OpenAIRE |
Externí odkaz: |