Professional Documents
Culture Documents
01 - Storage Concepts
01 - Storage Concepts
Storage Concepts
In order to understand CommVault® concepts regarding storage management we need to understand how and
why we protect data, traditional backup methods, and the CommVault philosophy for data management.
Modern management of protected data is much more complicated than in previous times. Everything from
documents, e-mails, databases, and even phone conversations and video surveillance is being digitized and stored.
Though the capacity to store and transmit data has greatly increased, the philosophy of managing this data in the
backup environment has for the most part remained the same. Most data protection software simply backs up
server data and manages it based on physical locations. Using CommVault® software, intelligent, logical
management of data and information is possible to meet a variety of business needs.
The term ―Business Needs‖ will be used extensively in this book. It is important to understand that any technical
decisions regarding data storage should always be based on business needs and not technical biases or legacy
methodology. CommVault Simpana® software adds greater flexibility to the management of business
information.
As backups became decentralized, so did backup software. Software was installed individually on each computer
system requiring protection. This added great complexity to data management. With higher bandwidth and larger
tape backup systems, data could again be consolidated, however most backup software was simply modified to
meet changing needs.
CommVault Simpana® software was designed from the bottom up to be a true enterprise data management
software suite. It allows data to be managed in a centralized or decentralized environment with a common
interface for managing all protected information.
An evolutionary design in backup software emerged with the concept of policy driven data management. The idea
of managing data based on business needs rather than physical placement of data on a server allowed backup
engineers to design robust storage solutions. CommVault software implements this design strategy through the
use of storage policies A storage policy allows data to be logically centralized and managed based on storage and
retention needs regardless of whether the data itself is physically centralized or decentralized. This provides the
freedom to manage data in an infinite number of possibilities. Though this may seem like a complex method for
managing backup data, the using a singular and intuitive interface for data management simplifies the complex
tasks of managing protected data. CommVault software provides flexibility to meet all of today‘s and tomorrow‘s
business needs providing data backup, data archiving, snapshot technology, and data replication solutions through
a common management interface.
Disaster Recovery
Data Recovery
Data Preservation & Compliance
Disaster Recovery
From a server crashing, to site loss, to major disasters affecting entire cities, it is critical to protect data based on
perceived risk. Someone living in Seattle may be worried about an earthquake or a volcano eruption, while
someone in the Midwest may feel tornados are the biggest problem. Identifying potential risks allows you to
properly plan for a disaster. With all critical business systems, two numbers must be generated to properly
prepare for a disaster:
Example: If a file server crashes and users lose access to home folders, a 48 hour recovery window may suffice.
However, a critical database which is responsible for orders processing which, if down may cause your company
to go out of business, may have a four hour recovery window.
Of course everyone in your company may think their system is the most critical. Your CFO may need Accounts
Receivable up first to ensure revenue continues. Your CEO may want e-mail up first so he can monitor the
recovery. Your CIO may want the DNS server up first because without it, none of the systems can communicate
with each other. So when determining your RTO, be sure to factor in both technical dependencies and business
requirements.
Data Recovery
Disaster recovery has traditionally been the main reason we backup data but the ability to recover user data at a
more granular level is extremely important as well. Let‘s say your sales manager is working on a huge proposal
which can double your company‘s revenue almost overnight. The proposal documents are lost or become corrupt
and they are asking you to recover them. Unfortunately the most recent backup has been sent off-site and it will
take 24 hours to bring it back. By that time your company loses the deal to a competitor. For this reason, Service
Level Agreements or an SLA should be set defining the acceptable length of time to recover user data.
It is recommended to always have at least two additional copies of your critical production data available at all
times. One copy should be stored offsite for use during disaster recovery. The other copy should be kept local for
accessibility of data restoration.
Financial data usually has requirements defined to take a point-in-time backup every fiscal boundary (month,
quarter, and year) and archived for later recovery if necessary. Should problems arise, these records can be
recovered and the company‘s fiscal responsibilities and reporting validated.
E-mail correspondence should also be archived. This may be easier with some e-mail applications then others.
There are various methods to protecting e-mail from database backups to journaling. Looking beyond the backup
method, companies need to consider what e-mail needs to be retained, how the content of those e-mails can be
searched, and how easy it is to restore those e-mails. Money and time saved performing backups in a certain
manner may well be insignificant when compared to the cost of restoring the data.
Deduplication
Deduplication is the concept of writing unique data blocks only once to disk media. If the same block needs to be
written to disk again, from a duplicate file or subsequent backup of the same data, the original block is simply
referenced by the new data being written. This results in substantial space savings when protecting data.
Simpana v9 deduplication is a content aware deduplication solution which will not only provide considerable
space savings on disk, but will also greatly reduce network bandwidth consumption during data protection
operations.
Virtualization
Many people know the benefits in cost reduction and administrative overhead in virtualizing data centers. A
major issue with virtualization is by consolidating so much data in such close proximity, protecting the data
becomes more of a challenge. Another major concern is the abuse of the technology in that everyone wants a
server for something. The result is over time, hundreds or even thousands of virtual machines which makes it
more difficult to keep track of and protect the environment.
The Simpana Virtual Server Agent (VSA) simplifies protecting virtual machines by integrating with the hosting
virtual infrastructure to coordinate snapshots of the machines and then backing up the snapshots. During the
backup operations the virtual disks will be indexed providing full machine recovery, volume recovery, or granular
recovery of objects within the virtual disks.
Integrating the VSA Agent with Simpana Deduplication, hundreds of virtual machines can be protected within
short backup windows. Integrating these two components will provide for considerably faster backups and much
less disk media required to store the protected machines.
Snapshot Technologies
Using hardware based snapshot technologies provides for extremely fast protection and recovery/failover for
short term protection. Simpana software enhances these capabilities through the SnapProtect™ Agent. For
supported hardware, the SnapProtect™ agent can manage snapshots on the hardware through the CommCell
console providing easier administration. The SnapProtect agent can also backup snapshots, indexing the data
providing full snap recovery as well as granular browse and recovery capabilities.
By integrating the SnapProtect feature with Simpana application iDataAgents, application aware snapshots and
backups can be conducted. This intelligence provides a major advantage over normal hardware snapshots which
snap volumes without awareness of what is being snapped. The coordination of the SnapProtect feature and
application agents can also be extended into virtual environments to provide complete protection of virtualized
applications.
Replication
With disk storage, deduplication, and WAN bandwidth becoming very affordable, replicating data to off-site
storage has become a very effective and efficient high availability and disaster recovery solution. Simpana
software works with replicated disks and also has its own replication, Continuous Data Replicator (CDR) to
replicate production data for high availability. Using Simpana Deduplication and DASH Copy, the intelligent
replication of CommVault protected data can be used as an effective disaster recovery solution.
Cloud Storage
Cloud storage makes use of on-demand hosted storage solutions and Internet connections to protect data. It is
affordable for many companies that do not want to invest in a high end and expensive disk storage solution.
Though not ideal for backing up large amounts of data, it is a good solution for small offices, data archiving,
remote user backups, and Simpana SILO storage.
The capabilities discussed in this section will undoubtedly be expanded upon as the Simpana product suite
continues to evolve and revolutionize the way data and information is managed. Providing a comprehensive
approach to data management provides enormous power not only for Simpana administrators but application,
storage, and network administrators as well as the end user.