Online Backup and Redundancy

Do you use an online backup product/service? Ever wonder where your data are actually being stored? Ever wonder how safe and reliable that storage is?

It comes down to 1 question:

 

How much redundancy do you have?

 

Let’s look at the types of redundancy. But first a word about tape:

Disk vs. Tape Backup

In the past most backup systems used tape for storage. Tape was slow but it had much higher capacity than disk drives. Another killer feature was redundancy. Best practices for tape-based backup include keeping multiple historical tapes containing backups of your files at various points in history. Perhaps you needed to keep historical data for compliance reasons, but you also kept multiple tapes for redundancy.

This redundancy also helps protect you from data loss. If your most recent backup tape isn’t readable, you can always use the prior backup tape. You will lose the most recent items but that’s better than complete data loss.

RAID Is Not Backup

Most online backup offerings don’t use tape. They use disk. It’s cheaper now (and getting cheaper all the time), faster, and easier for the provider to use. Also, it’s “random access” — you don’t have to wind through the tape to get the file you want. But unlike tape there’s no extra disk with last week’s data.

Many providers use RAID arrays to protect against failure of an individual disk drive. This RAID can be effective in mitigating that risk, but it can fail.

How does your provider mitigate against disk failure within their data center?

Multi-Site Redundancy

In addition to risk of disk failure, there’s the risk that a data center experiences some catastrophe. Does your provider replicate your data across multiple data centers? They may store your files in an underground former bank vault with armed guards, but what if the vault takes on water or suffers a lightning strike? Can they withstand the loss of one data center, or even more than one, without losing your data?

Ongoing Integrity Monitoring

Unlike paper or film which degrade gracefully (yellowing and fading but still readable), magnetic media (disks and tapes) often fail catastrophically — one minute they’re readable and the next they’re not. Corruption happens. If you’re going to keep your data on disk, you should periodically verify the data’s integrity. Does your provider verify your backups on your behalf?

Provider’s Recovery Strategy

If an online backup provider loses a customer’s data, the only option is to start uploading the current files from the customer’s computer and hope the upload finishes before the customer suffers a disk failure or other form of data loss (e.g. customer inadvertently deleting an important file). Historical data are gone forever; the history of changes to your files can’t be recreated.

You Get What You Pay For

Most consumer-oriented online backup offerings are focused on price. Consumers would rather pay $5/month for “unlimited” backup. (Many providers limit things in one way or another by excluding certain file types or deleting old backups of external drives, but that’s another blog post). Customers get some sort of data protection, but it often comes with one or more of the risks described above.

Amazon S3 (“Simple Storage Service”) takes a different approach. It focuses on durability. S3 is:

  • Designed to provide 99.999999999% durability and 99.99% availability of objects over a given year.
  • Designed to sustain the concurrent loss of data in two facilities.

S3 is just a cloud storage system. It doesn’t come with software. That’s why I wrote Arq. Because it uses your S3 account for storage it’s a very reliable online backup solution.

Questions For Your Provider

Ask your online backup provider the following questions:

  • Where are my data stored?
  • How many data centers are my data redundantly stored at?
  • If you lose my data in one of your data centers, can you repair by retrieving it from another data center?
  • How many data centers can simultaneously lose some of my data without you permanently losing my data?
  • Do you regularly verify the integrity of my data and repair corruption using your redundant copies of my data?
  • What’s your durability design goal?

Then decide what price vs. redundancy trade-off is right for you.

 

8 Comments

  1. I agree with your opinions on this point, but I’d be more inclined to believe claims you make about your competitors if you cited sources. You cited a source for the claim that Carbonite relied on non-redundant RAID storage, but where’s your source for the claim about Crashplan?

    Comment by Ryan Grove — June 21, 2011 @ 7:12 pm

  2. This page mentions RAID 6: http://www.crashplan.com/consumer/features-datacenter.html
    It also talks about a single data center, an “underground facility within a vault of a former bank in Minneapolis, Minnesota.”

    If you could get answers from Crashplan for some or all of those questions I listed in the blog and post the answers here, that would be awesome!

    Comment by Stefan Reitshamer — June 21, 2011 @ 7:46 pm

  3. Thanks. Not to pick nits, but there’s a difference between relying on a single copy of data in a RAID array and relying on a single datacenter (in which you may be storing many redundant copies of data). Granted, more datacenters means better potential redundancy, but single-datacenter redundancy is still vastly more reliable than just a RAID array.

    CrashPlan’s website doesn’t specify whether they store redundant copies of data (I’m curious about this myself), but it also doesn’t say they don’t. Your post implies that they don’t, especially since you lump them in with Carbonite, which made that mistake.

    I’m curious though: did you ask CrashPlan about this and they refused to answer? Or have you just not asked?

    Comment by Ryan Grove — June 21, 2011 @ 8:05 pm

  4. No, I didn’t ask Crashplan about that. I removed the term “Crashplan” from the blog post since it’s really not relevant to the point, and unfairly implies something about them. Thanks for pointing it out!

    Comment by Stefan Reitshamer — June 21, 2011 @ 8:40 pm

  5. Classy move. Kudos. I’ve asked @crashplan on Twitter about their storage redundancy. Will post here if I hear anything back.

    Comment by Ryan Grove — June 21, 2011 @ 8:44 pm

  6. I asked @crashplan, “I’m curious: does CrashPlan Central store redundant copies of data in case of disk failure, or do you rely entirely on RAID?”

    They replied, “CP offers multi-destination, local drives, and other computers– sets too! CP does not rely exclusively on raid.” – http://twitter.com/crashplan/status/83376885949669376

    Not exactly the answer to the question I asked, unfortunately.

    Comment by Ryan Grove — June 21, 2011 @ 11:42 pm

  7. I agree these questions are very valid and I use Arq + s3 for these reasons and more

    but from the article quoted it’s sounding like people are expecting $5p/m internet based backup to be all they need.

    Arq + s3 is just one layer of backup, to protect me from just one type of instance, namely fire or theft.

    Even though I backup the newest stuff to the cloud daily I still have local redundancy to a second set of HD’s etc etc

    One day, with super fast internet we may be able to get away with only cloud based backups, but at the moment, with TB’s, it’s too expensive and too slow.

    Comment by mcg — August 5, 2011 @ 2:50 am

  8. I can vouch that Crashplan’s online backup is not redundant. They just lost my entire online backup (180 GB) due to a “hardware issue”. Their customer service is very responsive. They are sending me a drive to reseed my backup and gave me a complete refund for the previous seeding service I had ordered from them. It’s still a pain in the ass, though.

    You can make your Crashplan backup redundant by using their software to backup to online, and other sources such as a external drive or your friends computer. I think this is more than many of the other online backup services can say.

    Comment by Scott — December 21, 2011 @ 9:46 pm

RSS feed for comments on this post.

Sorry, the comment form is closed at this time.