How do you handle data archiving? [closed]

How much data are we talking about? Our "archive" data is small enough that we just keep it in live storage (on a NAS unit) which gets backed up with the usual live data, so it exists just as our usual data does and is subject to the same recovery techniques without having to worry about keeping decade-old technology around. If our live data moves to a new storage platform, the archive moves right along with it. We also set the permissions for archive data such that only a member of the archive security group (of which there are very few member users) have access to remove anything from those folders.

If you're dealing with more data than can reasonably kept in live storage, tape may be your best bet for long-term storage, coupled with keeping at least two copies in different controlled storage locations. Bring them out every couple of years to make sure the media is still readable, and migrate from older media to new media if you're switching vendors to ensure you can continue to access the data.


In my case, we do archives to tape, and I will tell you why it makes sense for us.

First of all, the software we use, Tivoli Storage Manager, is hands down, the software we will be using for the foreseeable future. In my opinion TSM is the top of the food chain in terms of enterprise backup software, so the problem of changing to a different software platform is relatively non-existent for us.

That being said, we do maintain quite a few long term archives, and we have a few different ways of doing it. Fortunately for us, we use some of the latest tape technologies which can store upwards of 1TB per cartridge, so having a large quantity of archives live inside the library is not a problem.

  • Archive to local tapes, and copy those archives to our alternate data center, with the archive tapes sitting inside the tape library.

We do this for certain types of data because the data is restored from on a regular basis, and is important enough to require two copies.

  • Archive to local tapes, but not remote tapes, and leave the tapes inside the library.

We do this for slightly less important data, which needs to be restored from on a somewhat regular basis.

  • Archive to local tape, and send them offsite for storage.

The above practices have worked out for us pretty well in the past. We did migrate over to a totally different tape technology a couple years ago, and it was a bit of work to move the data across, but it really wasn't a big deal. In TSM it was literally one command per server worth of archives, and it would run. Might take a couple days to run, but it was not an active work effort for anyone.

The wave of the future is obviously disk based storage though.

As soon as the opportunity comes along, I will be collocating a disk array in a secure storage facility and copying long term audit type archives to a device like that.


Two EMC Centera installations (mirrored from different locations). It costs big dollars, but it works.

http://www.emc.com/products/family/emc-centera-family.htm