A DNA-Based Archival Storage System

Autor: Georg Seelig, Karin Strauss, Douglas Carmean, James Bornholt, Luis Ceze, Randolph Lopez
Rok vydání: 2017
Předmět:
0301 basic medicine
Computer science
DNA digital data storage
Distributed computing
Reliability (computer networking)
Real-time computing
02 engineering and technology
01 natural sciences
03 medical and health sciences
Encoding (memory)
0103 physical sciences
0202 electrical engineering
electronic engineering
information engineering

Redundancy (engineering)
Electrical and Electronic Engineering
General Environmental Science
010302 applied physics
Exabyte
business.industry
010401 analytical chemistry
05 social sciences
Archival storage
020206 networking & telecommunications
General Medicine
Computer Graphics and Computer-Aided Design
0104 chemical sciences
030104 developmental biology
Hardware and Architecture
Computer data storage
General Earth and Planetary Sciences
0509 other social sciences
050904 information & library sciences
business
Software
Random access
Zdroj: ASPLOS
ISSN: 0272-1732
DOI: 10.1109/mm.2017.264163456
Popis: Demand for data storage is growing exponentially, but the capacity of existing storage media is not keeping up. Using DNA to archive data is an attractive possibility because it is extremely dense, with a raw limit of 1 exabyte/mm 3 (109 GB/mm 3 ), and long-lasting, with observed half-life of over 500 years. This paper presents an architecture for a DNA-based archival storage system. It is structured as a key-value store, and leverages common biochemical techniques to provide random access. We also propose a new encoding scheme that offers controllable redundancy, trading off reliability for density. We demonstrate feasibility, random access, and robustness of the proposed encoding with wet lab experiments involving 151 kB of synthesized DNA and a 42 kB random-access subset, and simulation experiments of larger sets calibrated to the wet lab experiments. Finally, we highlight trends in biotechnology that indicate the impending practicality of DNA storage for much larger datasets.
Databáze: OpenAIRE