Identification of protein-coding sequences using the hybridization of 18S rRNA and mRNA during translation
Autor: | Chuanhua Xing, Mladen A. Vouk, Donald L. Bitzer, Winser E. Alexander, Anne-Marie Stomp |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2008 |
Předmět: |
Genetics
Saccharomyces cerevisiae Proteins Sequence analysis Sequence Analysis RNA Pseudogene Peptide Chain Elongation Translational Computational Biology Ribosomal RNA Biology biology.organism_classification Open reading frame Open Reading Frames GenBank RNA Ribosomal 18S Human genome RNA Messenger Schizosaccharomyces pombe Proteins Gene Base Pairing Schizosaccharomyces |
Zdroj: | Nucleic Acids Research |
ISSN: | 1362-4962 0305-1048 |
Popis: | We introduce a new approach in this article to distinguish protein-coding sequences from non-coding sequences utilizing a period-3, free energy signal that arises from the interactions of the 3'-terminal nucleotides of the 18S rRNA with mRNA. We extracted the special features of the amplitude and the phase of the period-3 signal in protein-coding regions, which is not found in non-coding regions, and used them to distinguish protein-coding sequences from non-coding sequences. We tested on all the experimental genes from Saccharomyces cerevisiae and Schizosaccharomyces pombe. The identification was consistent with the corresponding information from GenBank, and produced better performance compared to existing methods that use a period-3 signal. The primary tests on some fly, mouse and human genes suggests that our method is applicable to higher eukaryotic genes. The tests on pseudogenes indicated that most pseudogenes have no period-3 signal. Some exploration of the 3'-tail of 18S rRNA and pattern analysis of protein-coding sequences supported further our assumption that the 3'-tail of 18S rRNA has a role of synchronization throughout translation elongation process. This, in turn, can be utilized for the identification of protein-coding sequences. |
Databáze: | OpenAIRE |
Externí odkaz: |