Sequence logos: a new way to display consensus sequences
Autor: | Thomas D. Schneider, R M Stephens |
---|---|
Rok vydání: | 1990 |
Předmět: |
Genes
Viral Molecular Sequence Data Biology Frequency Set (abstract data type) Position (vector) Genetics Consensus sequence Escherichia coli Humans Amino Acid Sequence Sequence (medicine) Binding Sites Base Sequence business.industry Nucleic acid sequence Pattern recognition DNA-Directed RNA Polymerases Position weight matrix Bacteriophage lambda Globins Sequence logo Genetic Techniques DNA Transposable Elements T-Phages Artificial intelligence Chromosome Deletion business |
Zdroj: | Nucleic acids research. 18(20) |
ISSN: | 0305-1048 |
Popis: | A graphical method is presented for displaying the patterns in a set of aligned sequences. The characters representing the sequence are stacked on top of each other for each position in the aligned sequences. The height of each letter is made proportional to its frequency, and the letters are sorted so the most common one is on top. The height of the entire stack is then adjusted to signify the information content of the sequences at that position. From these 'sequence logos', one can determine not only the consensus sequence but also the relative frequency of bases and the information content (measured in bits) at every position in a site or sequence. The logo displays both significant residues and subtle sequence patterns. |
Databáze: | OpenAIRE |
Externí odkaz: |