Modeling RNA secondary structure folding ensembles using SHAPE mapping data.

Autor: Spasic A; Department of Biochemistry & Biophysics, University of Rochester Medical Center, Rochester, NY 14642, USA.; Center for RNA Biology, University of Rochester Medical Center, Rochester, NY 14642, USA., Assmann SM; Department of Biology, Pennsylvania State University, University Park, PA 16802, USA., Bevilacqua PC; Department of Chemistry, Department of Biochemistry & Molecular Biology, Center for RNA Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA., Mathews DH; Department of Biochemistry & Biophysics, University of Rochester Medical Center, Rochester, NY 14642, USA.; Center for RNA Biology, University of Rochester Medical Center, Rochester, NY 14642, USA.; Department of Biostatistics & Computational Biology, University of Rochester Medical Center, Rochester, NY 14642, USA.
Jazyk: angličtina
Zdroj: Nucleic acids research [Nucleic Acids Res] 2018 Jan 09; Vol. 46 (1), pp. 314-323.
DOI: 10.1093/nar/gkx1057
Abstrakt: RNA secondary structure prediction is widely used for developing hypotheses about the structures of RNA sequences, and structure can provide insight about RNA function. The accuracy of structure prediction is known to be improved using experimental mapping data that provide information about the pairing status of single nucleotides, and these data can now be acquired for whole transcriptomes using high-throughput sequencing. Prior methods for using these experimental data focused on predicting structures for sequences assuming that they populate a single structure. Most RNAs populate multiple structures, however, where the ensemble of strands populates structures with different sets of canonical base pairs. The focus on modeling single structures has been a bottleneck for accurately modeling RNA structure. In this work, we introduce Rsample, an algorithm for using experimental data to predict more than one RNA structure for sequences that populate multiple structures at equilibrium. We demonstrate, using SHAPE mapping data, that we can accurately model RNA sequences that populate multiple structures, including the relative probabilities of those structures. This program is freely available as part of the RNAstructure software package.
(© The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.)
Databáze: MEDLINE