back to article Where has all the bad storage gone?

Spare a thought for the humble storage columnist looking for targets to skewer. It's a tricky job if you want to sound off - everything is just so basically good these days, and with no stand-out dismal products, what is there to knock? It was different a few years ago. Ever tried backing up your PC to Travan tape? I made that …

COMMENTS

This topic is closed for new posts.

Page:

  1. Anonymous Coward
    Anonymous Coward

    re:

    Top level recap: RAID for online continuity, tape backup for offline save your arse, and off-systems disk for nearline with perhaps a side order of deduplication if you want to be a bit flash (hint: armour plate your dedup index).

    Main thrust of the article: no bad storage? Maybe your test budget is a bit small... there's plenty out there.

    BTW, RAID5 is a disaster wating to happen* (RAID10 if you must), and fair point re:RAID controller** going tits-up: you don't get redundant RAID controllers... thats when you mandate clustered systems with independent disk controllers, or failover infrastructure e.g. VMware etc.

    * RAID5 does not EVER check parity on read: when you write the garbage sector back garbage

    parity will be calculated and your RAID5 integrity is lost

    ** SCSI reserves blocks for remapping when disk sectors get flaky; they don't report these

    back to the OS; when they're used up, it start writing garbage (now refer to the * section)

  2. Gaius

    rsync

    Problem with rsync is: if a file becomes corrupt, you overwrite your good backup copy. Time Machine avoids this; you just go grab the version from one hour before the corruption occurred, or the last known good time.

  3. Anonymous Coward
    Boffin

    @Paul Bruneau

    Yes, there were once amos32 users. I quite liked it, actually. Except for vcrsav & vcrres, of course. And then there was dskpak, which always started yet another pass just when you thought (or hoped) the damn thing had finished. The rdbms (IMS) was pretty good, though.

  4. Nigel
    Boffin

    @Gaius - re rsync

    Rsync does not by default overwrite. It writes new, then deletes (unlinks) old.

    Pedantry, one might think. But not so. In fact, Linux offers the means to roll your own time machine. The key to this is cp -al to create a hardlink farm duplicating the current state of your backup, then rsync to update the hardlink farm. Files that have changed will be updated, but the original will still be accessible through the original link, and won't get deleted until its link count reaches zero.

    See http://www.mikerubel.org/computers/rsync_snapshots/ for a good explanation.

    As to why no Windows time machine? I'd guess that NTFS is not a linux-like FS with inodes and hardlinks, which therefore can't easily support this sort of magic. Not that I'd regard "backup" to the same hard disk, or even the same system, as what I'm backing up from, as secure. The backup disk needs to be far enough away that it won't be stolen by the same thief, nor destroyed by the same flood or fire.

  5. VoiceOfRaisin
    Go

    I'd bet my lunch...

    ...that Apple uses the techniques described here:

    http://www.mikerubel.org/computers/rsync_snapshots/

    I've used a variation of this on Linux for years, but it should be OK on Windows (there is a section in the FAQ about some cygwin rsync issues, but there are workarounds). This has saved my bacon several times, as I have suffered from more than my fair share of hard-drive failures (actually the reason I read this misleadingly titled article).

    How it works is fully described if you follow the link, but in summary, each backup stores a complete snapshot of the filesystem, but unchanged files are simply linked. If you're canny, you can have daily backups, with hourly snapshots since the last daily, and 5-minute backups since the last hourly. You therefore need never lose more than 5 minute's work.

    For typical usage patterns, where most of a disk is unchanged, a backup drive about twice the size of the live drive can store backups going back several years, even if snapshots are taken every 5 minutes. All snapshots can be accessed just like a live filesystem. Comparing two versions of a file is simplicity itself, and file recovery is just a simple file copy.

    Of course, there's no slick GUI, but once it's up and running, there's no need for any UI at all - it just works, and because the backups are live filesystems, you can access files in the snapshots using whatever software you want.

  6. Alan Brown
    Coat

    Replication is not backup either!

    Ditto on the comments about raid, but simply using rsync or other techniques to copy your files to a remote drive isn't a backup strategy either.

    What happens when the file you just copied turns out to be corrupt? You need an older version.

    How many older versions do you need? In one case where a telco killed their system, they had to go back 14 MONTHS to find data images for a phone exchange that weren't corrupted, then wind in all the incremental changes over a 4 week period. Meantime 90,000 people had dodgy phone service (I was one of them)

    Home backups are a pain in the a***, primarily because of media fragility and where to keep it safely (backing up 1Tb+ onto DVDs is a no-go, big tapes are pricey and hard drives are touchy)

    For $orkplace I use Bacula, but I'm backing up several tens of Terabytes with it and have all the right gear (tape robot, fibre, giant data safe, blah blah etc ) to go with it.

    Bacula's fairly simple to setup for home use. It doesn't have a whizzy gui and you need a propellor beanie to set it up, BUT it works really well.

    The online services seem like a good idea, modulo the question of "can you trust them?" - encypting your data BEFORE it goes upstream might solve that though - but you still have the issue of blowing your data cap every time you make a full backup. :(

    Mine's the one with the tinfoil beanie in the shoulder pocket.

  7. Anonymous Coward
    Coat

    Tar cvf ... ...

    mine's the one with the unix manual in the pocket.

    Err... pockets.

  8. Paul Bruneau

    @Bert Ragnarok, AMOS user

    I programmed AMOS from 1995 until about 1999. We had the VCR tapes lying around from the past, but I didn't have to use them. We had a DAT drive by the time I got there :)

  9. BlueGreen

    @Replication is not backup either!

    re. corrupt backups discovered rather too late, I had a similar experience about ten years ago. Fortunately it was a prototype systems and we only had to go back six days, so it was a very cheap lesson: always (where size permits) do a consistency check of your DB before backup.

    I've pushed that policy on management wherever I've worked since.

    It does leave a question though; never mind disks failing, most people make the unwarranted assumption that what's written to a disk is what you told it to write, and I don't believe it. Disks are probabilistic devices (like any physical device) so you can't assume that. A consistency check as mentioned above will catch many of those write failures but not all, and consistency means only consistency - it doesn't mean the data is what you gave it. To be fair, it usually does and when it doesn't it will be most likely be single bit flips in the major bulk of the data such as a phone number, not the relatively tiny and very much more fragile DB metadata which would be caught by a consistency check.

    Short of more complex error detection & correction everywhere from the mobo to the disk platter (which I'd gladly pay for, in money and performance), I don't know how to compensate for this. I don't think RAID does; it seems only there to compensate for disk crashes, not bit/sector/whatever errors.

    Any thoughts?

  10. Anonymous Coward
    Anonymous Coward

    I can't believe nobody has linked to this yet

    Here.

    http://www.jwz.org/doc/backups.html

    This counts as another vote for rsync.

  11. F Seiler

    re: Synctoy? By Rob Daglish

    I used SyncToy scheduled with the normal windows scheduler at work to copy the work on my local disk to a share on a server which had some kind of "go back to time X mode" (not sure what, but not windows based). Yes the SyncToy is nice, but it has its limitations too (apart from it alone not being a real back up, but when copying to ZFS or similar it can be part of one).

    My work folders were not particularly lage, maybe 20GB max, but contained a lot of files (source code, test file sets).

    The bloody thing spent usually some 20minutes figuring out what it had to copy over, thereby pushing the CPU really hard, more or less regardless of how much changed. Usually it then did the actual "sync" in considerably less time than it before spent on figuring out what to do.

    I suspect that the destiation was on a remote machine (LAN) didn't help, but still i think there must be better algorithms out there than what SyncToy uses to "analyze". Anyway, MS presented it a simple tool to sync "one or multiple folders", but i can't rember they ever said it was designed to sync a whole jungle of folders and text files over LAN :)

  12. Anonymous Coward
    Paris Hilton

    Why Time Machine works

    Time Machine takes away the constant battle of effort trying to decide what needs backing up and how to back it up - it turns your disk from being a hard-drive with files and directories into more like the idea non-techies have of what that disk is, a place that stores everything you do and allows you to go back to old versions simply if you need to. That it also helps when the hardware is upgraded, lost or broken is even better.

    I'm sure all the readers are aware that the weakest link in any backup strategy is the fleshy button-prodder. Apple have looked at the problem in terms of people who are used to the browser recording every site they've visited and the word-processor remembering every change they made to a document (more or less, depending on things like whether the program was quit out of or not) (anybody remember the apocryphal "some users" tale of the man with one huge word doc of every document he'd ever written?) but somehow when things are written to disk (or "memory" depending on the level of understanding) they can get corrupted, lost etc.

    I like Apple's design strategy for these things - I've always used various unixes (including inevitably now some godawful linux) at work, when I go home I want to use a computer that's taken care of all the things I have to worry about in the office. I don't want another job at home, I want a computer that has tried to save me from worrying about them. Adding backups and transfers to the list of these things that have been thought out and essentially just work is as good to me as when the iPod worked out that spinning a wheel was a great way to browse through my albums along with the breakthrough that software I used to play mp3s on the computer would also be the software that would copy them to the iPod.

    Backing up isn't hard when it's a job or a hobby ... but when you just want it to work while living in blissful ignorance (I can't make the car engine metaphor here because it's overused and truthfully I do have more understanding of how they work than is helpful in most situations, but normally this is where that cliche creeps in) it's easier to ignore it and hope for the best than try to weigh up the options, formulate a plan (do I need to backup weekly, daily, monthly ?!) and remember how to get stuff back from backup, how to search and find the version / file I want to restore and many other things that Time Machine has essentially solved.

    To backup, plug in backup disk. To backup continually leave disk attached to computer. To restore (while backup disk attached - or capsule is switched on, for the wireless version), find/search document then hit Time Machine button to go back in time to before you broke it. Would you rather explain this to your parents or how to use rsync ? (Assuming you could get them over the shock of saying "arse-ing" so very many times.)

    wwphd?

  13. Mike

    Backups today are a joke

    Honestly, backups anymore are a joke. You don't like Vista's shadow copy? How can you honestly make it easier than opening the entire drive or folder up in the exact same format as the drive itself. Oh wait, it isn't pretty and doesn't have rainbow flags on it? I'm sorry.

    1996: Tape backups. Ugh. Slow, and you couldn't use your computer for several hours. Set it overnight and leave it running, assuming you could actually sleep through the racket.

    1998: The glory of CD burning came to computers. Sure it was 2x and took 36 minutes to burn a single disc, and if you so much as surfed the Internet it would break, but it was faster than tape backup and didn't require software for other computers to read the files.

    2000 or so: Buffering technology which prevented coastering of CDs, and fast enough computers to handle the load. Also DMA Mode for optical drives. Drives that were fast and could burn a whole CD in 3 or 4 minutes.

    2002: Jumpdrives which let you quickly copy data and take it to another computer. No software required.

    2004: DVD burners start becoming affordable and mainstream. Now instead of 700 megabytes, you can burn 4.7 gigabytes for about the same price.

    2009: DVD burners and jumpdrives are the primary backup media. For truly large things, external hard drives are available.

  14. Goran Garevski

    Where has all the usability gone?

    Home data protection should be all about usability (assuming it works). I can understand that El Reg readers are a bunch of techies, but after years of (perl)scripting during your life – you get sick of building a new plane each time you want to fly (especially because you are the only pilot). While Time Machine addresses pretty much the backup usability aspect, I would expect that in the future there will be the ultimate file recovery interface – an advanced content based search.

    Implementing similar solution for Windows does not introduce major technical challenges. The scary parts are the patents. What is the situation here?

  15. Lottie
    Thumb Down

    Storage?

    I was assuming this was going to be an article about what disks to avoid/ the history of how they have become more reliable. We got a bitch about tape storage and then an advert for Apple.

    Whatever.

  16. Anonymous Coward
    Anonymous Coward

    Another one

    I was looking for this one, but nobody has picked it up: How about Lazy Mirror?

    it's here:

    http://www.xs4all.nl/~wstudios/LazyMirror/index.html

    Maybe there is something horribly wrong with it - (like, gee, RAID5's lack parity on read).

    If Lazy Mirror is offensive to the cognoscenti, I will stop using it (as soon as I have shut down all my RAID5 arrays).

    Why I don't use Time Machine: I hate single button mice and don't need a lecture on the efficiency of using control keys. Sure, I remember ^K^K^B and ^C/^V but why do I have to use them?

    Single button mice are just one example of the Orwellian Mac universe. There are plenty of others. Remember the PowerPC chip? RISC good/CISC bad? Oops. Forget I said that. Repeat after me: CISC good/RISC bad. PC vs mac ads? A company that needs to promote bigoty against users of competing products is just plain evil.

    And in the end it's just an effing tool.

    Get a life. It just works better than arguing about computers.

    No, I'm not arguing about computers with you. Or am i? Is black white?

    sixscrews

Page:

This topic is closed for new posts.

Other stories you might like