From first base: the sequence of the tip of the X chromosome of Drosophila melanogaster, a comparison of two sequencing strategies
Autor: | Juan Modolell, Michael Ashburner, Robert D. C. Saunders, David A. Harris, Paul J. McMillan, Lorna Campbell, Belén Miñana, Christos Louis, Herbert Jäckle, Dana Borkova, Lior Pachter, Evelyn Tait, Stéphanie Gloux, Melanie K. Gatt, Debbie Callister, Inga Siden-Kiamos, Meike Werner, Gordon Dowe, George Papagiannakis, Fotini Mourkioti, Encarnación Madueño, David M. Glover, Nicole Beinert, Cathy Salles, Alain Bucheton, Christine Brun, Edouard Cadieu, Lee Murphy, Valerie Lelaure, Stéphane Dréano, Sophie Vidal, Beatriz de Pablos, Lefteris Spanos, Concepcion Ferraz, Phillipe Valenti, Francis Galibert, Jacques Demaille, Annette Peter, Slava Bolshakov, Fotis C. Kafatos, Alain Billaud, Ulrich Schäfer, Petra Schöttler, Stéphanie Mottier, Nadine S. Henderson, Bart Barrell, Panayiotis V. Benos |
---|---|
Rok vydání: | 2001 |
Předmět: |
Transposable element
Male X Chromosome Letter Molecular Sequence Data Genes Insect Biology DNA sequencing Sequence-tagged site Gene Order Genetics Consensus sequence Basic Helix-Loop-Helix Transcription Factors Animals Drosophila Proteins Insertion sequence Genetics (clinical) Sequence (medicine) Polytene chromosome Physical Chromosome Mapping Computational Biology Sequence Analysis DNA DNA-Binding Proteins Drosophila melanogaster DNA Transposable Elements Female Transcription Factors |
Zdroj: | Genome research. 11(5) |
ISSN: | 1088-9051 |
Popis: | We present the sequence of a contiguous 2.63 Mb of DNA extending from the tip of the X chromosome ofDrosophila melanogaster. Within this sequence, we predict 277 protein coding genes, of which 94 had been sequenced already in the course of studying the biology of their gene products, and examples of 12 different transposable elements. We show that an interval between bands 3A2 and 3C2, believed in the 1970s to show a correlation between the number of bands on the polytene chromosomes and the 20 genes identified by conventional genetics, is predicted to contain 45 genes from its DNA sequence. We have determined the insertion sites ofP-elements from 111 mutant lines, about half of which are in a position likely to affect the expression of novel predicted genes, thus representing a resource for subsequent functional genomic analysis. We compare the European Drosophila Genome Project sequence with the corresponding part of the independently assembled and annotated Joint Sequence determined through “shotgun” sequencing. Discounting differences in the distribution of known transposable elements between the strains sequenced in the two projects, we detected three major sequence differences, two of which are probably explained by errors in assembly; the origin of the third major difference is unclear. In addition there are eight sequence gaps within the Joint Sequence. At least six of these eight gaps are likely to be sites of transposable elements; the other two are complex. Of the 275 genes in common to both projects, 60% are identical within 1% of their predicted amino-acid sequence and 31% show minor differences such as in choice of translation initiation or termination codons; the remaining 9% show major differences in interpretation.[All of the sequences analyzed in this paper have been deposited in the EMBL-Bank database under the following accession nos.: AL009146,AL009147, AL009171, AL009188–AL009196, AL021067, AL021086,AL021106–AL021108, AL021726, AL021728, AL022017, AL022018, AL022139,AL023873, AL023874, AL023893, AL024453, AL024455–AL024457, AL024485,AL030993, AL030994, AL031024–AL031028, AL031128, AL031173, AL031366,AL031367, AL031581–AL031583, AL031640, AL031765, AL031883, AL031884,AL034388, AL034544, AL035104, AL035105, AL035207, AL035245, AL035331,AL035632, AL049535, AL050231, AL050232, AL109630, AL121804, AL121806,AL132651, AL132792, AL132797, AL133503–AL133506, AL138678, AL138971,AL138972, and Z98269. A single file (FASTA format) of the 2.6-Mb contig is available fromftp://ftp.ebi.ac.uk/pub/databases/edgp/contigs/contig_1.fa.] |
Databáze: | OpenAIRE |
Externí odkaz: |