Re: 200GB to store a genome? Surely not!
No, this is just plain DNA sequencing data, which is strings of A, C, G and T.
Maybe they're storing them as actual strings. Not in ASCII. Not in UTF-8. But in UTF-32 (aka UCS-4). Four bytes per character.
You're about to retort that such an encoding would be incompetent. We're talking about the NHS here...