Generation and analysis of 280,000 human expressed sequence tags
Autor: | DuBuque T, Bonaldo Mf, Tracy Rohlfing, Gregory G. Lennon, Stephanie L. Chissoe, Morris M, Le N, Chiapelli B, Trevaskis E, LaDeana W. Hillier, Underwood K, Bento Soares M, Elaine R. Mardis, Nicole Dietrich, Marco A. Marra, Tamara A. Kucaba, Hultman M, Wohldman P, Lacy M, Anthony Favello, J. Parsons, Warren Gish, Schellenberg K, Hawkins M, Prange C, Robert H. Waterston, Thierry-Meg J, Rifkin L, Richard K. Wilson, Michael C. Becker, Le M, Moore B, Tan F |
---|---|
Rok vydání: | 1996 |
Předmět: |
Adult
DNA Complementary Databases Factual Protein family Molecular Sequence Data Computational biology Biology Genome Sequence-tagged site Pregnancy Complementary DNA Genetics Humans Gene family Genomic library RNA Messenger Cloning Molecular Gene Genetics (clinical) Gene Library Sequence Tagged Sites Expressed sequence tag Genome Human Infant Proteins Introns Markov Chains Female |
Zdroj: | Genome Research. 6:807-828 |
ISSN: | 1088-9051 |
DOI: | 10.1101/gr.6.9.807 |
Popis: | We report the generation of 319,311 single-pass sequencing reactions (known as expressed sequence tags, or ESTs) obtained from the 5' and 3' ends of 194,031 human cDNA clones. Our goal has been to obtain tag sequences from many different genes and to deposit these in the publicly accessible Data Base for Expressed Sequence Tags. Highly efficient automatic screening of the data allows deposition of the annotated sequences without delay. Sequences have been generated from 26 oligo(dT) primed directionally cloned libraries, of which 18 were normalized. The libraries were constructed using mRNA isolated from 17 different tissues representing three developmental states. Comparisons of a subset of our data with nonredundant human mRNA and protein data bases show that the ESTs represent many known sequences and contain many that are novel. Analysis of protein families using Hidden Markov Models confirms this observation and supports the contention that although normalization reduces significantly the relative abundance of redundant cDNA clones, it does not result in the complete removal of members of gene families. |
Databáze: | OpenAIRE |
Externí odkaz: |