Efficient Algorithms for Probing the RNA Mutation Landscape
Autor: | Bonnie Berger, Srinivas Devadas, Jérôme Waldispühl, Peter Clote |
---|---|
Rok vydání: | 2008 |
Předmět: |
0206 medical engineering
Sequence alignment Hepacivirus 02 engineering and technology Response Elements Evolution Molecular 03 medical and health sciences Cellular and Molecular Neuroscience Genetics Cluster Analysis Humans Nucleic acid structure lcsh:QH301-705.5 Molecular Biology Ecology Evolution Behavior and Systematics 030304 developmental biology Comparative genomics 0303 health sciences Partition function (statistical mechanics) Ecology biology Computational Biology HIV RNA RNA virus Evolutionary pressure biology.organism_classification Biophysics/RNA Structure lcsh:Biology (General) Computational Theory and Mathematics Mutagenesis Modeling and Simulation Mutation Mutation (genetic algorithm) Nucleic Acid Conformation Thermodynamics Replicon Algorithms Software 020602 bioinformatics Research Article |
Zdroj: | PLoS Computational Biology PLoS Computational Biology, Vol 4, Iss 8, p e1000124 (2008) |
ISSN: | 1553-7358 |
DOI: | 10.1371/journal.pcbi.1000124 |
Popis: | The diversity and importance of the role played by RNAs in the regulation and development of the cell are now well-known and well-documented. This broad range of functions is achieved through specific structures that have been (presumably) optimized through evolution. State-of-the-art methods, such as McCaskill's algorithm, use a statistical mechanics framework based on the computation of the partition function over the canonical ensemble of all possible secondary structures on a given sequence. Although secondary structure predictions from thermodynamics-based algorithms are not as accurate as methods employing comparative genomics, the former methods are the only available tools to investigate novel RNAs, such as the many RNAs of unknown function recently reported by the ENCODE consortium. In this paper, we generalize the McCaskill partition function algorithm to sum over the grand canonical ensemble of all secondary structures of all mutants of the given sequence. Specifically, our new program, RNAmutants, simultaneously computes for each integer k the minimum free energy structure MFE(k) and the partition function Z(k) over all secondary structures of all k-point mutants, even allowing the user to specify certain positions required not to mutate and certain positions required to base-pair or remain unpaired. This technically important extension allows us to study the resilience of an RNA molecule to pointwise mutations. By computing the mutation profile of a sequence, a novel graphical representation of the mutational tendency of nucleotide positions, we analyze the deleterious nature of mutating specific nucleotide positions or groups of positions. We have successfully applied RNAmutants to investigate deleterious mutations (mutations that radically modify the secondary structure) in the Hepatitis C virus cis-acting replication element and to evaluate the evolutionary pressure applied on different regions of the HIV trans-activation response element. In particular, we show qualitative agreement between published Hepatitis C and HIV experimental mutagenesis studies and our analysis of deleterious mutations using RNAmutants. Our work also predicts other deleterious mutations, which could be verified experimentally. Finally, we provide evidence that the 3′ UTR of the GB RNA virus C has been optimized to preserve evolutionarily conserved stem regions from a deleterious effect of pointwise mutations. We hope that there will be long-term potential applications of RNAmutants in de novo RNA design and drug design against RNA viruses. This work also suggests potential applications for large-scale exploration of the RNA sequence-structure network. Binary distributions are available at http://RNAmutants.csail.mit.edu/. Author Summary Evolution is a central concept in biology. This phenomenon can be observed at all levels of the organization of life—from single molecules to multicellular organisms. Here, we focus our attention on the implication of evolution at the level of nucleic acid sequences. In this context, RNA sequences presumably have been optimized by evolution to achieve specific functions. These functions are supported by a structure that can be determined using thermodynamics-based models and energy minimization techniques. In this work, we develop efficient algorithms for predicting energetically favorable mutations and study their impact on the stability of the structure. We use these techniques to reveal sequences under evolutionary pressure and design new methods to predict lethal mutations. Applications of our tool lead to a better understanding of the mutational process of some key regulatory elements of two important pathogenic RNA viruses—human immunodeficiency virus and hepatitis C virus. |
Databáze: | OpenAIRE |
Externí odkaz: |