Completion of the Sequence of a Cetacean Morbillivirus and Comparative Analysis of the Complete Genome Sequences of Four Morbilliviruses

Autor: A. M. J. Collin, J. A. P. Earle, Bertus K. Rima
Rok vydání: 2005
Předmět:
Zdroj: Virus Genes. 30:113-119
ISSN: 1572-994X
0920-8569
DOI: 10.1007/s11262-004-4588-7
Popis: The gene encoding the large (L) protein and the genome termini of the dolphin strain of cetacean morbillivirus (CeMV) were sequenced. The CeMV genome is 15702 nucleotides long and has been compared with other available morbillivirus genome sequences in regards to the "rule of six" and the "phase" of any particular nucleotide, defined as its position within a given hexamer, which here is defined as a group of six nucleotides starting from the 3' end of the genomic RNA. With exception of the position of the start of the F gene, the phase of the transcription start sites of each gene is strictly conserved between the morbilliviruses, but each gene is in a different phase. The lengths of gene transcripts differ between viruses by multiples of six nucleotides with exception of the M and F transcripts. The differences between the various morbilliviruses result from deletions or insertions of multiples of six nucleotides in the 3' and 5' UTRs of the different viral genes. The four bases were distributed non-randomly over the six positions in the hexamer boxes. However, the distribution patterns of each of the four bases indicated that multiples of three were more prevalent than those of six nucleotides. This reflected the positions of nucleotides in codons and codon usage in the reading frames. The L protein of CeMV was found to be 2183 amino acids in length and similar to that of MV and RPV. The CeMV L protein sequence was found to be equidistant between those of the CDV/PDV and MV/RPV subgroups of the morbilliviruses. This concurs with the analyses carried out on the other structural proteins.
Databáze: OpenAIRE