You are on page 1of 8

Skip to content

Data Protection Hub


All things Data Protection

 About
 Quick Links
 Nomenclature
 Contact
 Site Map
 Archives
 Why the flag?
 Subscription
 Tools
 Books
 Twitter

Posts
Unmanaged Data Hoarding is
Deadly to your Business
November 15, 2021 Preston de Guise  Leave a comment

I’ve seen a lot of talk lately of data hoarding. Data can bring us new insights, new
business opportunities, and all those other good things — so it stands to reason that
even if you can’t think of a use for data today, you might think of a use later. Therefore,
once you capture the data into the gravity-well of your infrastructure, you don’t let it go,
because it might be a missed opportunity.

In short, deleting data when you might need later could be deadly to the business.

But here’s the rub: hoarding data you don’t need yet without managing it is just as
deadly.

Leveraging data can bring in new income streams, or amplify existing ones. But data
also has a cost — the cost of storing it, protecting it and managing it. When you’re
hoarding data, you’re accepting there is an up-front cost associated with that hoarding
which only might someday pay off for the business.

Here’s a slight modification to a data lifecycle management diagram I included in the


second edition of Data Protection:
Data Lifecycle Management Process. Adapted from Data Protection: Ensuring Data
Availability (2nd Edition)

My take is this: if you’re hoarding data but not managing it like the above, you’re
needlessly wasting money.

For every 1TB of data you have sitting in primary storage, you’re probably consuming
30-50TB or more in protection/logical storage. RAID, Snapshots, Replica, Operational
Backup Retention, Replicated Operational Backup Retention, Long-Term Backup
Retention, Replication Long-Term Backup Retention. Then, assuming you’re not wall-to-
wall with tape, all those backups will have RAID and a some of them will have
snapshots, too.

That 1TB of data you decide to store on your primary systems is the proverbial pebble
kicking off a land-slide down the side of the mountain. Yes, to be sure, deduplication
can help save your storage footprint along the way, for primary storage and data
protection, but it’s not magic. Deduplication is not a singularity — it doesn’t crush data
away to nothing. (Sure, you could deduplicate everything down to a single 0 and a
single 1, but the metadata will then crush you.)

Ask yourself this: how much of your annual data growth comes from true data growth,
and how much comes from hoarded data? You know, the data that no-one wants to
delete, just in case.

If your business is going to do data-hoarding, it needs to do archive. The only way you
stop every TB of data you hoard just in case from crippling storage, infrastructure and
data protection costs is to remove it from those equations. Take it off primary storage.
Limit the infrastructure for storage. Remove it from data protection. Properly archived, it
can sit in your environment for as long as you want, still be accessible, still be found,
still be indexed, ready and waiting for its purpose to be discovered.

If you’re looking for a cost-effective way to hoard data, you’d do well to look into ECS.

Did you like this post? Please share it.

 Facebook

 Twitter

 Reddit

 Email

 Print

 LinkedIn

 Tumblr

 Pinterest

 Pocket

 WhatsApp


Like this:

Related Posts:
1. The importance of being archived (and deleted)
2. A basic data lifecycle
3. Sisyphus, the Storage King
4. Vendors! Listen up! Stop talking about archive when you mean HSM

Posted in: Architecture, Best PracticeFiled under: archive, data, hoard


Post navigation
← Crunching NetWorker Deduplication Stats

Leave a Reply
Your email address will not be published. Required fields are marked *

Comment

Name *

Email *

Website

 Notify me of follow-up comments by email.

 Notify me of new posts by email.

Post Comment

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Data Protection: Ensuring Data Availability (2nd Edition)


The second edition of Data Protection: Ensuring Data Availability is now available. It’s
not just an update to existing content, there’s significant new information in this new
edition. Buy here.
Dell Technologies Webinars
Did you know Dell Technologies runs regular webinars? You can watch recordings of
completed webinars and register for upcoming ones at the Dell Technologies
Webinars Homepage. (There are some great data protection sessions in these!)

Musing About Tech


My new blog!

Search…
Search for:

Subscribe to updates

Email Address*

First Name

Last Name
* = required field
Subscribe

unsubscribe from list

powered by MailChimp!

Powered by  Translate


 About
 
Quick Links
 
Nomenclature
 
Contact
 
Site Map
 
Archives
 
Why the flag?
 
Subscription
 
Tools
 
Books
 
Twitter
 Facebook

 Twitter

 Google Plus

 Custom Social

Copyright © 2021 Data Protection Hub — Primer WordPress theme by GoDaddy

You might also like