Article | REF: RE288 V1

Archiving of big data in DNA

Author: François KÉPÈS

Publication date: August 10, 2021

You do not have access to this resource.
Click here to request your free trial access!

Already subscribed? Log in!


Overview

Français

ABSTRACT

Datacenters store digital big data on hard disks and magnetic tapes that are copied every 5-7 years; they devour resources such as land, electricity and scarce materials. After analyzing this state of the art, this article presents a breakthrough solution for big data archiving at the molecular scale, on DNA. It shows its advantages and limitations, then describes the underlying technologies, and extends the discussion to the case of non-DNA polymers. It concludes by outlining the techno-scientific and economic perspectives of this approach.

Read this article from a comprehensive knowledge base, updated and supplemented with articles reviewed by scientific committees.

Read the article

AUTHOR

  • François KÉPÈS: Leader of the "ADN: lire, écrire, stocker l'information" working group of the Académie des technologies. - Académie des technologies, Académie d'Agriculture de France (Paris, France)

 INTRODUCTION

Information has been the driving force behind the socio-economic growth of civilization since its very beginnings. Today, however, the storage, archiving and processing of data by dedicated centers no longer offer sufficient margins for optimization to cope with the deluge of data, and its problematic environmental impact. A recent report by the French Academy of Technologies explores a promising alternative to the conventional model: data archiving on a molecular scale, a 20-year project.

Indeed, the storage and archiving of digital megadata ("big data") using the current approach to data centers will not be sustainable beyond 2040. The global sphere of data (SGD) created by humanity was estimated in 2018 at 33 zettabytes (33 trillion billion bytes), on the same order as the estimated number of stars in the observable universe. SGD increases by a factor of two every two or three years, or about one hundred to one thousand every twenty years. This article does not deal with efforts that might be made to slow the growth of SGD, even though this aspect is part of the wider problem.

Most of this data is then stored in several million data centers (including corporate data centers and the cloud), which operate within transmission networks. Together, these already consume around 2-4% of electricity in advanced countries. Their overall construction and operating costs are in the region of a trillion euros. These centers cover one millionth of the earth's surface (i.e. around 150 km 2 ); at the current rate, they will cover one ten-thousandth to one thousandth by 2040 (i.e. around 150,000 km 2 , 1/3 of France). The storage technologies used by these centers are rapidly becoming obsolete in terms of format, read/write devices and media, which need to be copied every five to seven years to guarantee data integrity. They also pose growing problems of supply for scarce resources such as electronic-grade silicon.

Bearing in mind that by 2040, there will be a hundred to a thousand times more data to preserve, these figures show that the current conservation model will by then have become insufficient, as well as being environmentally unsustainable.

The promising alternative discussed in this article is offered by molecular carriers of information, such as DNA, used here as a chemical agent outside the living world, or other highly promising non-DNA heteropolymers. Potentially, DNA allows information densities ten million times greater than traditional memories: all current SGD would fit in a minivan. DNA is stable at ordinary temperatures for several millennia, with no energy consumption. It can be easily multiplied or destroyed at will....

You do not have access to this resource.

Exclusive to subscribers. 97% yet to be discovered!

You do not have access to this resource.
Click here to request your free trial access!

Already subscribed? Log in!


The Ultimate Scientific and Technical Reference

A Comprehensive Knowledge Base, with over 1,200 authors and 100 scientific advisors
+ More than 10,000 articles and 1,000 how-to sheets, over 800 new or updated articles every year
From design to prototyping, right through to industrialization, the reference for securing the development of your industrial projects

This article is included in

Bioprocesses and bioproductions

This offer includes:

Knowledge Base

Updated and enriched with articles validated by our scientific committees

Services

A set of exclusive tools to complement the resources

Practical Path

Operational and didactic, to guarantee the acquisition of transversal skills

Doc & Quiz

Interactive articles with quizzes, for constructive reading

Subscribe now!

Ongoing reading
Archiving megadata in DNA