Completion of the Sequence of a Cetacean Morbillivirus and Comparative Analysis of the Complete Genome Sequences of Four Morbilliviruses
Autor: | A. M. J. Collin, J. A. P. Earle, Bertus K. Rima |
---|---|
Rok vydání: | 2005 |
Předmět: |
Genes
Viral Transcription Genetic Sequence analysis animal diseases Molecular Sequence Data Reading frame Sequence Homology Genome Viral Genome Open Reading Frames Viral Proteins Morbillivirus Virology Genetics Animals RNA Messenger Codon 3' Untranslated Regions Molecular Biology Gene Phylogeny Sequence Deletion Recombination Genetic Base Composition Base Sequence biology General Medicine biology.organism_classification Cetacean morbillivirus Open reading frame Codon usage bias RNA Viral Cetacea Transcription Initiation Site 5' Untranslated Regions Sequence Analysis |
Zdroj: | Virus Genes. 30:113-119 |
ISSN: | 1572-994X 0920-8569 |
DOI: | 10.1007/s11262-004-4588-7 |
Popis: | The gene encoding the large (L) protein and the genome termini of the dolphin strain of cetacean morbillivirus (CeMV) were sequenced. The CeMV genome is 15702 nucleotides long and has been compared with other available morbillivirus genome sequences in regards to the "rule of six" and the "phase" of any particular nucleotide, defined as its position within a given hexamer, which here is defined as a group of six nucleotides starting from the 3' end of the genomic RNA. With exception of the position of the start of the F gene, the phase of the transcription start sites of each gene is strictly conserved between the morbilliviruses, but each gene is in a different phase. The lengths of gene transcripts differ between viruses by multiples of six nucleotides with exception of the M and F transcripts. The differences between the various morbilliviruses result from deletions or insertions of multiples of six nucleotides in the 3' and 5' UTRs of the different viral genes. The four bases were distributed non-randomly over the six positions in the hexamer boxes. However, the distribution patterns of each of the four bases indicated that multiples of three were more prevalent than those of six nucleotides. This reflected the positions of nucleotides in codons and codon usage in the reading frames. The L protein of CeMV was found to be 2183 amino acids in length and similar to that of MV and RPV. The CeMV L protein sequence was found to be equidistant between those of the CDV/PDV and MV/RPV subgroups of the morbilliviruses. This concurs with the analyses carried out on the other structural proteins. |
Databáze: | OpenAIRE |
Externí odkaz: |