What could go wrong?
Could your data center be used to produce real viruses now? Who do we trust with these things?
Researchers at the University of Washington (UW) and Microsoft have managed to write data directly onto DNA, a format with dramatic storage densities and a very long life. The team wrote 200MB onto strands of synthetic DNA, including video footage of the band OK Go, the Universal Declaration of Human Rights in more than 100 …
Not sure if serious.
You'd need the code for a real virus to start with, and some clue as to how to make it more dangerous.
If you think that a real, dangerous virus can be created by accident as a by-product of data storage, then you are not the person with that clue.
Dave actually wrote without realizing, "If you think that a real, dangerous virus can be created by accident ..."
The world, if not the entire Universe, is chock-a-block full of viruses. Apparently all created 'by accident'.
: If the conditions are right, it's inevitable.
>The world, if not the entire Universe, is chock-a-block full of viruses. Apparently all created 'by accident'.
No, not 'by accident', but by a series of accidents interspersed by selection. Or rather, the natural selection of randomly occurring mutations.
If one were to merely transcribe a cat video into DNA, there would be no process of selection, no realisation of biological traits that could be tested against a selection pressure.
Of course, it is possible that transcribing an encrypted file into DNA would result in a virus, in the same way that it possible for a monkey at a typewriter to tap out the works of Vonnegut. Possible yes, but very unlikely. With 'very unlikely' being an extreme understatement.
Hehe, I made a comment which is text book Darwin, and I'm a creationist!
@Jeffy, ah yes, the old abiogenesis is well known, well understood and demonstrated. NOT. Aside from Sol3 there is still only wishful thinking about life elsewhere. And no, Drake Equations and the appalling self-contradictory nonsense in New Scientist sometime in 2015 about earthlike planets and life elsewhere simply demonstrates another post western cultural myth is thriving in an age of superstition and wishful thinking.
As far as accidents go, you get to try this at home for fun. Just paste everything from '#!' to 'done' into a text file called random_virus_machine, make it executable (chmod 755 random_virus_machine), and run it (./random_virus_machine).
while true; do
dd if=/dev/urandom of=/tmp/virus bs=1024 count=4 2>/dev/null
chmod 755 /tmp/virus
/tmp/virus 2>/dev/null && exit 0
I expected the chances of success to be small. Although it possible to squash a binary executable into 45 bytes the chances of those bytes being a valid ELF file are tiny. It is also possible to create a valid executable by starting with '#!' followed by the full path of an interpreter, followed by code valid for that interpreter. There are a few interpreters in /bin, so the required prefix of '#!/bin/' reduces your chances to 1 in 7x10¹⁶. It turns out if the file does not match any other pattern, the Linux kernel gives the file to one of the shells to chew on.
Shells have an insane default feature. If a line of shell script is complete gibberish the interpreter outputs "syntax error near unexpected token '%c'" and try to interpret the next line. There is a real chance that random_virus_machine will actually do something (probably harmless, but don't blame me for rm -r ~).
The DNA decoding machinery inside cells have similar default features. IIRC, they chug along until they find a start code, then take three base pairs (6 bits) at a time as an opcode. 21 of the 64 possible opcodes have a useful meaning. (I think the other 43 are 'unexpected symbol error, look for the next start sequence'). Microsoft's error correction code could easily insert invalid opcodes at regular intervals to prevent the creation of anything dangerous. If you fool the software into thinking that your raw file has already been through the error correction filter, then you can have the file->DNA machine create the DNA sequence for a virus (the small ones are only a few K). Normally viral DNA needs to be packed into a phage to be infectious (there are exceptions, ask a biologist). The chances of random data happening to be a valid sequence for an infectious virus are tiny. random_virus_machine is just as likely to output the code for ninvaders.
Just a small correction: All 64 codons are usefull, yes there are only 20 amino acids that can be derived from these 64 (3 base) combinations, which means multiple combinations result in same amino acid. This is nature's way of error correction or fault tollerance, as a mutation of only one base in 3 base codon likely results in same overall translation.
This post has been deleted by its author
The thermodynamic stability of DNA follows an Arrhenius equation. From the statements in the article, the DNA's data retention at room temperature can be estimated to be about a year. That's from thermodynamic stability. It's shorter if any organism has a chance to eat it.
So "What could go wrong?". Well, this long term storage needs long term refrigeration.
Of course, read & write speed must be horrible.
So when is this "great work" going to make it through peer review?
DNA Foundries wont produce viruses as they blast your sequences to prevent you from doing these things.
And it is a bit sad that yesterday was SynBioBeta activate on Edinburgh, openning of a DNA foundry in Edinburgh and Synthetic Yeast 2.0 annual meeting also in Edinburgh, and no news about it.
Also, I do not want to diminish the excellet work at UW, but you could name the DNA manufacterer, they are quite proud about being able to fulfill the DNA order from Microsoft.
Will go mad if they have to sift through zettabytes of twits and posts to fund anything even remotely useful.
On the other hand the humanities may have some interesting* PhD theses in the future:
"On the interactions between rival beeotches on New Jersey in the XXI century, as measured by a particular nasty sequence of tweets and comments from their friends at their community college and tanning salon, and what that reveals about their feeding and breeding habits" - Dr LaShaWanna Frito Rodriguez McNakamoto III.
I've been doing some work for the Drive Trust Alliance (http://drivetrust.com), so I'm tuned to the Full Disk Encryption / Self Encrypting Drive technology. By the end of 2017 nearly all storage will be using it.
So now I foresee a distant future when , after the collapse of human civilization, our successors, having risen to sentience and culture and having a robust archaeological science, discover this trove of human data in the Lunar Long Term Data Repository that we kindly left for future generations.
Unfortunately, all the data is encrypted, and the key is lost. Or there's a typo in the docs.
Thus speaks to a fundamental problem - such a data trove undoubtedly must contain secrets that should not be available to just anyone. But how to assure that the data is truly available in the distant future?
People have worked out various mechanisms for Time Release Encryption, but the issue is trusting the required 3rd party server.
Such a 3rd party server would have to be honest, and also in existence x years into the future.
If anyoine knows of work done to sidestep those issues, I'd love to hear them.
(The idea came to my attention when I thought of the uncaptured oral history available from people of a generation that don't blog. Whimsically, I thought of placing microphones in a pub, with everybody knowing that recordings couldn't be listened to for 100 years. )
"at 10 degrees Celsius the DNA won't degrade for around 2,000 years, and at -18 degrees it could last for millions"
Can we have some baseline comparisons so we can tell if that's good or bad? How long would a DVD-ROM last if you stored it at -18 degrees? What about magnetic tape, or punched cards and paper tape? QR codes etched into titanium sheets?
The latter I think would be far easier to decode by future civilisations, and could be read in a non-destructive way.
Also, there is no risk that you'd accidentally drop the entire world's knowledge into your tea thinking it was a sugar lump.
I sent in my spittle sample to 23&Me. They analyzed it and discovered that my DNA carried sections of object code from 'Grand Theft Auto V'. Now I've received a DMCA Take Down notice demanding that I destroy myself and all copies, and compensate them by $3500.
This post has been deleted by its author
The data is not written onto DNA. It is written in DNA. That is, only DNA is used for the writting by assembling it from four types of nucleotide bases.
(Compare this with a computer writing data onto words. No! It writes data in words by assembling them from two kinds of bits. That's not a perfect analogy, because we can create bits in many ways: discrete electric charges, discrete magnetic domains, holes in paper tape, etc.)
The concept has been around for decades, so sci-fi authors have had plenty of time to use the idea! :) Indeed, Dawkins talks about the information half life in living bacteria in his book The Blind Watchmaker. Obviously this is a different situation, because he was taking into the account of the bacteria's error-correction mechanisms over millions of generations.
As for inactive DNA, studies conduction on bird bones at ambient temperatures (i.e not in a frozen vault) in New Zealand suggest that the inforamtion half-life as being around 500 years.
Philip K Dick's "The Preserving Machine" springs to mind, but (having not read it since, probably, the late 1970s, when I may have been too young to fully appreciate/understand it all) ISTR it was more about turning music into animals as a way of preserving it.
Animals which then began to evolve.
Philip K Dick's "The Preserving Machine"
No, not P K Dick, it was some space-opera genre series, just cannot recall who wrote it.
It struck me at the time that the idea is quite smart. While writing and reading may be a bit on the slow side, the information density is ridiculous and all the error-correction and repair mechanics already exist in nature.
In that case, it may be something I haven't read (or something I've completely forgotten about).
Mind you, had the subject come up a few days ago I might not have remembered The Preserving Machine, either. I just happened to be browsing some old titles a couple of days back and remembering which ones I'd read as a child and what they were about.
This post has been deleted by its author
I mean we all have destroyed DNA, for example by cooking food. Ultraviolet radiation also is a problem. Essentially you'd get a storage medium which might hold its data for millions of years, but will quickly degrade when left out in the sun, or at temperatures higher than 40 degrees Celsius. It's also very susceptible to chemicals. Just about the only advantage is that you can make copies very easily. (even without a living being)
So they're writing data via DNA using a heavily error corrected format. If preserved for a million years it would be readable, but unless someone knew the error correction format what you read would be gibberish.
Sort of like our so-called "junk DNA" - that almost certainly isn't "junk", but that fact likely won't stop alien conspiracy theorists from before long claiming our very DNA contains instructions to build a FTL communication device to call home to our "parents"...
Biting the hand that feeds IT © 1998–2020