Clusters of nucleotide substitutions and insertion/deletion mutations are associated with repeat sequences

Autor:	Michael J. McDonald, Hsien Da Huang, Jun-Yi Leu, Wei-Chi Wang
Jazyk:	angličtina
Rok vydání:	2011
Předmět:	Mutation rate DNA-Directed DNA Polymerase Haploidy Molecular Biology/Bioinformatics INDEL Mutation Biology (General) Evolutionary Biology/Genomics Genetics Genome Microbiology/Microbial Evolution and Genomics General Neuroscience Eukaryota food and beverages Genetics and Genomics/Bioinformatics Genetics and Genomics/Microbial Evolution and Genomics Replication fork arrest Evolutionary Biology/Human Evolution Evolutionary Biology/Microbial Evolution and Genomics Synopsis Drosophila Genetics and Genomics/Comparative Genomics General Agricultural and Biological Sciences Research Article Genome evolution Saccharomyces cerevisiae Proteins Sequence analysis QH301-705.5 Sequence alignment Genomics Molecular Biology/Molecular Evolution Biology General Biochemistry Genetics and Molecular Biology Evolution Molecular Saccharomyces Genetics and Genomics/Population Genetics Escherichia coli Animals Humans Genetics and Genomics/Genomics Repetitive Sequences Nucleic Acid Comparative genomics Molecular Biology/DNA Repair General Immunology and Microbiology Bacteria Evolutionary Biology/Evolutionary and Comparative Genetics Models Genetic Point mutation Genetic Variation DNA Repair Enzymes Haplotypes human activities
Zdroj:	PLoS Biology PLoS Biology, Vol 9, Iss 6, p e1000622 (2011)
ISSN:	1545-7885 1544-9173
Popis:	The authors propose that short repeat sequences may play an important role in causing the pervasive clustering of mutations across diverse genomes from prokaryotes to humans. The genome-sequencing gold rush has facilitated the use of comparative genomics to uncover patterns of genome evolution, although their causal mechanisms remain elusive. One such trend, ubiquitous to prokarya and eukarya, is the association of insertion/deletion mutations (indels) with increases in the nucleotide substitution rate extending over hundreds of base pairs. The prevailing hypothesis is that indels are themselves mutagenic agents. Here, we employ population genomics data from Escherichia coli, Saccharomyces paradoxus, and Drosophila to provide evidence suggesting that it is not the indels per se but the sequence in which indels occur that causes the accumulation of nucleotide substitutions. We found that about two-thirds of indels are closely associated with repeat sequences and that repeat sequence abundance could be used to identify regions of elevated sequence diversity, independently of indels. Moreover, the mutational signature of indel-proximal nucleotide substitutions matches that of error-prone DNA polymerases. We propose that repeat sequences promote an increased probability of replication fork arrest, causing the persistent recruitment of error-prone DNA polymerases to specific sequence regions over evolutionary time scales. Experimental measures of the mutation rates of engineered DNA sequences and analyses of experimentally obtained collections of spontaneous mutations provide molecular evidence supporting our hypothesis. This study uncovers a new role for repeat sequences in genome evolution and provides an explanation of how fine-scale sequence contextual effects influence mutation rates and thereby evolution. Author Summary An intriguing observation made during the comparison of genomes is that insertion and deletion mutations (indels) cluster together with nucleotide substitutions. Two (not mutually exclusive) hypotheses have been proposed to explain this phenomenon. The first postulates that an indel mutation causes an increase in the likelihood of the surrounding sequence incurring nucleotide substitutions, while the second claims that the region of DNA in which such a cluster is located is more likely to sustain both indels and substitutions. Here, we present evidence suggesting that the region of DNA, and not the indel, is associated with the accumulation of clusters of mutations over evolutionary time scales. We find that repeat sequences are closely associated with a large proportion of indels and that the abundance of repeat sequences is linked with regions of increased nucleotide diversity. By analysing molecular data and measuring the mutation rates of genes engineered to contain repeats, we find that the mutation rate can be manipulated by the insertion of long repeat sequences. On the basis of these results, we propose a model in which repeat sequences are prone to cause stalling of the high-fidelity DNA polymerase, leading to the recruitment of error-prone repair polymerases which then replicate the surrounding sequence with a higher-than-average error rate.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::03e1c515d45ed64cadb4348060adccdb http://europepmc.org/articles/PMC3114760 Zobrazit plný text záznamu Plný text ve formátu PDF