filmov
tv
DNA Data Storage Explained in 90 Seconds | Breakthrough Junior Challenge 2022

Показать описание
#breakthroughjuniorchallenge This video is my submission for the 2022 Breakthrough Junior Challenge. I explain the motivation for and basics behind DNA-based data storage in 90 seconds.
Sound effects and music:
Sources and calculations for claims made in the video:
At current rates of data generation, the entire surface of the Earth will need to be covered in data storage centers by the year 2060:
Data generated in 2020 was 64.2 zettabytes, and in 2015 it was 15.5 zettabytes. (64.2/15.5)^(1/5) = 1.329
Integrating 64.2*1.329^x from 0 to 40, you get 1.97 * 10^7 zettabytes, or 1.97*10^10 exabytes
This data center has an estimated capacity of 3 exabytes, and it has a surface area of around 1,500,000 square feet. Since this was completed in 2014, I will assume a density of 5 exabytes per 1.5 million square feet for my calculations.
1.97*10^10 exabytes / (5 exabytes / 1,500,000 square feet) = 5.91 * 10^15 square feet = 211,992,080 square miles
This is definitely not an exact calculation, but it is indicative of the direction we are heading based on current trends.
Visualization of 44 bits in DNA compared to a bit in magnetic tape reel:
An 18 terabyte magnetic tape reel has a tape width of 12.65 mm, a tape thickness of 5.2 μm, and a tape length of 1035 m. 18 * 8 * 10^12 bits / (68082.3 mm^3) = 2.12 * 10^9 bits/mm^3
According to Microsoft Research, DNA has a density of up to 1 exabyte per cubic millimeter = 8 * 10^18 bits/mm^3
(8*10^18 bits/mm^3) / (2.12 * 10^9 bits/mm^3) = 3.77 * 10^9 (DNA is this many times denser than magnetic tape reels)
Each base in a single strand of DNA can store up to 2 bits (in reality encoding strategies make this figure slightly lower), so a 22 nucleotide sequence of DNA can theoretically hold up to 44 bits.
The amount of data that could fit in a pile of magnetic tape reels stacked to the moon could fit in a shoebox filled with DNA:
A shoe box is about 9,000,000 mm^3
The distance from the surface of the Earth is 382,500 km
An 18-terabyte magnetic tape reel has a height of 21.5 mm
According to Microsoft Research, DNA has a density of up to 1 exabyte per cubic millimeter = 1*10^6 terabytes / mm^3 (although with redundancy and in solution, it would be lower)
Magnetic tape reels stacked to the moon:
18 TB * 382,500,000,000 mm / 21.5 mm = 3.2 * 10^11 terabytes
A shoebox full of DNA:
9*10^6 mm^3 * 10^6 TB/mm^3 = 9*10^12 terabytes
Magnetic tapes are today’s standard for archival data storage:
(This source also states that data being recorded is increasing at 30 to 40 percent per year, but I have another source for that.)
Data stored in DNA is potentially safe for hundreds of thousands of years:
It’s easy and inexpensive to make many copies of DNA:
Polymerase chain reaction (PCR) can make over a billion copies of a piece of DNA.
Sound effects and music:
Sources and calculations for claims made in the video:
At current rates of data generation, the entire surface of the Earth will need to be covered in data storage centers by the year 2060:
Data generated in 2020 was 64.2 zettabytes, and in 2015 it was 15.5 zettabytes. (64.2/15.5)^(1/5) = 1.329
Integrating 64.2*1.329^x from 0 to 40, you get 1.97 * 10^7 zettabytes, or 1.97*10^10 exabytes
This data center has an estimated capacity of 3 exabytes, and it has a surface area of around 1,500,000 square feet. Since this was completed in 2014, I will assume a density of 5 exabytes per 1.5 million square feet for my calculations.
1.97*10^10 exabytes / (5 exabytes / 1,500,000 square feet) = 5.91 * 10^15 square feet = 211,992,080 square miles
This is definitely not an exact calculation, but it is indicative of the direction we are heading based on current trends.
Visualization of 44 bits in DNA compared to a bit in magnetic tape reel:
An 18 terabyte magnetic tape reel has a tape width of 12.65 mm, a tape thickness of 5.2 μm, and a tape length of 1035 m. 18 * 8 * 10^12 bits / (68082.3 mm^3) = 2.12 * 10^9 bits/mm^3
According to Microsoft Research, DNA has a density of up to 1 exabyte per cubic millimeter = 8 * 10^18 bits/mm^3
(8*10^18 bits/mm^3) / (2.12 * 10^9 bits/mm^3) = 3.77 * 10^9 (DNA is this many times denser than magnetic tape reels)
Each base in a single strand of DNA can store up to 2 bits (in reality encoding strategies make this figure slightly lower), so a 22 nucleotide sequence of DNA can theoretically hold up to 44 bits.
The amount of data that could fit in a pile of magnetic tape reels stacked to the moon could fit in a shoebox filled with DNA:
A shoe box is about 9,000,000 mm^3
The distance from the surface of the Earth is 382,500 km
An 18-terabyte magnetic tape reel has a height of 21.5 mm
According to Microsoft Research, DNA has a density of up to 1 exabyte per cubic millimeter = 1*10^6 terabytes / mm^3 (although with redundancy and in solution, it would be lower)
Magnetic tape reels stacked to the moon:
18 TB * 382,500,000,000 mm / 21.5 mm = 3.2 * 10^11 terabytes
A shoebox full of DNA:
9*10^6 mm^3 * 10^6 TB/mm^3 = 9*10^12 terabytes
Magnetic tapes are today’s standard for archival data storage:
(This source also states that data being recorded is increasing at 30 to 40 percent per year, but I have another source for that.)
Data stored in DNA is potentially safe for hundreds of thousands of years:
It’s easy and inexpensive to make many copies of DNA:
Polymerase chain reaction (PCR) can make over a billion copies of a piece of DNA.
Комментарии