A novel approach to compress dna repetative sequences in bio-informatics
Autor: | Samparthi V. S. Kumar, Vmnssvkr Gupta, Deepak Nedunuri, S. M. B. Chowdary |
---|---|
Rok vydání: | 2019 |
Předmět: | |
Zdroj: | Journal of Physics: Conference Series. 1228:012026 |
ISSN: | 1742-6596 1742-6588 |
Popis: | In recent days numbers of gigabyte sequences of nucleotides are stored in a common database Genbank. All the victimization Deoxyribonucleic acid sequences for biological functions are to store the large number of Genomes in a compressed type in economically. Despite the fact that Deoxyribonucleic corrosive arrangements are put away in a packed kind, the information on Deoxyribonucleic corrosive groupings square measure hang on in science databases. For a four-letter alphabet in DNA (Adenine(A), Cytosine(C), Guanine(G) and Thymine(T)), an average description length of 2 bits per base is that the max length required to encode DNA. To reexamine the previous art of compression techniques and its merits and de merits, a novel attempt is initiated. Based on the comparative study of existing algorithms a new method proposed for DNA compression without depending on statistics of sequence set. |
Databáze: | OpenAIRE |
Externí odkaz: |