Data storage in DNA with fewer synthesis cycles using composite DNA letters
Autor: | Orna Atar, Leon Anavy, Roee Amit, Zohar Yakhini, Inbal Vaknin |
---|---|
Rok vydání: | 2019 |
Předmět: |
Sequence analysis
Computer science DNA digital data storage Composite number Biomedical Engineering Information Storage and Retrieval Bioengineering ENCODE Applied Microbiology and Biotechnology 03 medical and health sciences chemistry.chemical_compound 0302 clinical medicine Redundancy (information theory) Humans 030304 developmental biology 0303 health sciences Base Sequence business.industry High-Throughput Nucleotide Sequencing DNA Sequence Analysis DNA chemistry Computer data storage Molecular Medicine business Algorithm Algorithms 030217 neurology & neurosurgery Decoding methods Biotechnology |
Zdroj: | Nature Biotechnology. 37:1229-1236 |
ISSN: | 1546-1696 1087-0156 |
Popis: | The density and long-term stability of DNA make it an appealing storage medium, particularly for long-term data archiving. Existing DNA storage technologies involve the synthesis and sequencing of multiple nominally identical molecules in parallel, resulting in information redundancy. We report the development of encoding and decoding methods that exploit this redundancy using composite DNA letters. A composite DNA letter is a representation of a position in a sequence that consists of a mixture of all four DNA nucleotides in a predetermined ratio. Our methods encode data using fewer synthesis cycles. We encode 6.4 MB into composite DNA, with distinguishable composition medians, using 20% fewer synthesis cycles per unit of data, as compared to previous reports. We also simulate encoding with larger composite alphabets, with distinguishable composition deciles, to show that 75% fewer synthesis cycles are potentially sufficient. We describe applicable error-correcting codes and inference methods, and investigate error patterns in the context of composite DNA letters. Toward more storage for less synthesis using a six-letter composite DNA alphabet. |
Databáze: | OpenAIRE |
Externí odkaz: |