Professional Documents
Culture Documents
and Support
Student guide
(1 of 2)
V2.0
Confidential – For Training Purposes Only
Use of this material to deliver training without prior written permission from HPE is prohibited.
Copyright 2019 Hewlett Packard Enterprise Development LP
The information contained herein is subject to change without notice. The only warranties for
HPE products and services are set forth in the express warranty statements accompanying
such products and services. Nothing herein should be construed as constituting an additional
warranty. Hewlett Packard Enterprise shall not be liable for technical or editorial errors or
omissions contained herein.
This is an HPE copyrighted work that may not be reproduced without the written permission of
Hewlett Packard Enterprise. You may not use these materials to deliver training to any person
outside of your organization without the written permission of HPE.
Printed in the US
Data deduplication
Data deduplication is the ability of a software application or an appliance to compare blocks of data being written to
disk with data blocks that currently reside on that disk. Deduplication technology analyzes backup data and stores
each unique block of data only once, using pointers for additional instances of the same data. When duplicate data is
found, a pointer is established to the original set of data, as opposed to actually storing the duplicate blocks, removing
or “deduplicating” the redundant blocks from the volume.
A very important point is that the deduplication is being done at the block level and not at the file level. In fact, beware
of products that only dedupe at the file level (sometimes called “single instancing”). For example, assume you are
backing up a very large database that changes throughout the day. With a typical backup application, you must back
up, and more importantly, you must store the entire database with each backup. An incremental backup does not help
you in this case.
With block-level deduplication, you can back up the same database to the device on two successive nights and,
because of the ability to identify redundant blocks, only the blocks that have changed will be stored. Any redundant
data will be stored as pointers instead.
Often, deduplication is used in conjunction with other forms of data reduction, such as conventional data compression,
to further reduce the data volume stored.
Key benefits of data deduplication
– Keep more data on disk:
– Allows longer retention of backup data on disk before archiving off, or purchasing more backup storage.
– Improves service levels and business performance—data is available on disk for longer for fast recovery of lost
or corrupt files from multiple backup points
– More efficient storage utilization—Effectively reduces cost per gigabyte for backup data
– Enables more efficient replications network:
– Cuts the costs of data transmission through lower bandwidth lines
– Makes remote backup and disaster recovery more affordable for a wider range of businesses
– Reduces the overhead required at remote sites required for backup, media management, and off-site vaulting
Chunking example
You can think of this image backup example as a part of a sequence of sentences. This example shows only 1
sentence.
Every file is composed of one or more words. In the example of fixed chunking, every file is a fixed size of 10
characters.
Housekeeping
If data is deleted from the StoreOnce system (e.g. a virtual cartridge is overwritten or erased), any unique chunks will
be marked for removal, any non-unique chunks are de-referenced and their reference count decremented. The
process of removing chunks of data is not an inline operation because it would significantly impact performance. The
process of cleaning up after deletions, termed “housekeeping,” runs on the appliance as a background operation.
Housekeeping runs on a per-cartridge and per-NAS file basis. By default it will run as soon as the cartridge is
unloaded and returned to its storage slot, or as soon a NAS file completes writing and is closed by the appliance
unless it is prevented by a configured housekeeping blackout configuration. The housekeeping blackout windows are
configured across the whole appliance.
Housekeeping places a significant load on the StoreOnce device. The housekeeping process checks for available
resources before running and, if other operations are in progress, the housekeeping dynamically reduces its activity to
prevent impacting the performance of other operations. However, the hold-off is not binary (i.e. on or off) so, even if
backup jobs are in process, some low level of housekeeping can still take place which may have a slight impact on
backup performance unless a blackout window is configured.
Before we could set housekeeping windows, housekeeping ran in the background as soon as the device was closed. If
backups and replication were running, this could lead to unpredictable performance. Now we can control precisely
when Housekeeping runs. It should be scheduled to occur during “quiet times” when no other major activities are
running.
Housekeeping times
Best practices for housekeeping blackout windows:
– Only use housekeeping blackout if backup or replication is not completing in the required window, after a rotation
scheme starts to overwrite backups.
– Blackout windows for housekeeping (and replication) make overall performance more predictable, but might not
deliver the overall quickest completion time.
– Housekeeping is an important process and must be allowed enough time to run.
– Apply a blackout window to cover the backup and replication/tape offload times, to have the greatest impact.
– Some trial-and-error might be required to get the windows in the right place because placing a window will change
backup and replication performance (and therefore the end time).
What is replication?
HPE StoreOnce Replication allows cartridges in slots on a library device that have been configured on one HPE StoreOnce
Backup System (the Source Appliance) to be replicated to corresponding mapped slots on a library device on another HPE
StoreOnce Backup System (the Target Appliance) at another location. Data is always mapped at slot level.
NOTE: If the replication Source Appliance fails and goes offline while it was performing a NAS replication job, the replication
target share appliance may be left in an inconsistent state.
This state may result in failure to recover from some files in the share or perform a list of files in the share because relationsh
between files and folders have been lost. This will automatically be repaired by the target appliance as soon as it detects the
of the source appliance, however it may take up to 10 minutes to complete this repair during which time the share will not be
available for access.
NOTE: Replication-Target libraries are not visible to the host, unless you configure devices to make them visible. See the HP
StoreOnce Backup System user guide for more information.
Once you have set up a replication configuration between Source and Target HPE StoreOnce Backup Systems, it is initiated
automatically whenever a cartridge is returned from a drive to a slot that has been configured for replication or when a file is
created or modified or deleted in a NAS share.
Replication Target libraries are not visible to the host, unless you configure devices to make them visible. See the HPE
StoreOnce Backup System User Guide for more information.
There is no replication for Catalyst Data Stores since objects in Catalyst data stores are copied between Catalyst data stores
an ISV backup application that supports Catalyst so the copy can be scheduled and is known by the backup application.
Catalyst Copy is the equivalent of Virtual library and NAS share replication. The same principles apply in that only the new da
created at the source site needs to be copied (replicated) to the target site. The fundamental difference is that the copy jobs a
created by the backup application and can, therefore, be tracked and monitored within the backup application catalog as well
from the StoreOnce Management GUI.
.
Bidirectional replication
In this example there are two offices, each with an HPE StoreOnce Backup System. The HPE StoreOnce Backup
System in Office A is being used by the host server for backup and restore, and all Replication-Source libraries and
shares are being replicated to matching Replication-Target libraries and shares on the HPE StoreOnce Backup
System in Office B. However, there is also a single host backing up to six cartridges on another library on the HPE
StoreOnce Backup System in Office B. Two of those cartridges are configured to replicate to a Replication-Target
library on the HPE StoreOnce Backup System in Office A. A replication license is required for both HPE StoreOnce
Backup systems.
These are the four main usage models for replication using StoreOnce VTL and NAS devices.
– Active/Passive: A StoreOnce system at an alternate site is dedicated solely as a target for replication from a
StoreOnce system at a primary location.
– Active/Active: Both StoreOnce systems are backing up local data as well as receiving replicated data from each
other.
– Many-to-One: A target StoreOnce system at a data center is receiving replicated data from many other StoreOnce
systems at other locations.
– N-Way: A collection of StoreOnce systems on several sites are acting as replication targets for other sites.
The ability to connect StoreOnce Catalyst to internal and external cloud service providers
Supported with major Cloud Service Providers (MS Azure, Amazon S3) as well as internal Scality ring
Only available as Catalyst copy target NOT DIRECT BACKUP TARGET
Ideal for off site data storage such as data archiving
Cost effective disaster recovery option
Catalyst copy available with ISV’s, Catalyst Plugin and HPE RMC
Cloud Bank Storage is a StoreOnce feature to allow a customer to store the deduplicated data in public clou
storage (AWS/Azure) or private cloud storage (Scality) as opposed to within the StoreOnce System.
The functionality addresses three potential use cases:
-Customer wants long term offsite retention of backups
-Customer wants offsite disaster recovery copies of backups
-Customer wants to reduce the cost of running a StoreOnce VSA in the cloud
-Objects written to cloud storage contain all the information necessary to reconstruct all the backup data within the Cloud
Bank Catalyst Store.
-Locally the Cloud Bank Catalyst Store will also maintain a Catalyst database containing details of each Catalyst Item for th
purposes of providing quick listing responses to data protection software and dedupe will maintain a Sparse Index and
copies of all the Container Indexes for the purpose of providing quick deduplication matching responses
Bonding modes
See http://linux.cloudibee.com/2008/02/network-bonding-part-ii-modes-of-bonding/
SAN technologies
– In order to share the resources, storage area networks (SAN) are quite common (such as a backup server and
media agent sharing the same D2D appliance).
– Depending on the model, the StoreOnce D2D supports two types of SAN: FC-SAN and iSCSI-SANs.
– In order to consolidate all three types of networks (LAN, iSCSI, and FC), a new technology has been established.
– The so-called “Converged Enhanced Ethernet (CEE)” also uses the Ethernet network technology of the LAN/iSCSI
networking for the FC protocol.
– Interfaces that can handle all three types protocol (LAN, iSCSI, and FC) are the converged network adapters
(CNAs) used in HPE BladeSystem, for example.
– A SAN is built up of different components: HBA–Fabric–Target
– HBAs used in a SAN:
– iSCSI: Network interface card (NIC)
– FC: Host bus adapter (HBA)
– In a SAN, iSCSI devices are connected using the ISCSI Qualified Name (IQN). A host must have an iSCSI initiator
installed, in order to connect to an iSCSI SAN when using normal Ethernet NICS.
– In an FC SAN, devices are recognized using the World-Wide-Name (WWN). This information is stored, for
example, on an FC HBA, or emulated by software (such as StoreOnce) for target devices.
– Fabric components in a SAN:
– Local SANs use switches for connection between server and storage devices:
– FC: FC switch ( 8 Gb or 16 Gb)
– iSCSI: iSCSI switch (1 Gb or 10 Gb)
– These switches need some characteristics in order to work with a HPE StoreOnce D2D Backup System. An FC
SAN switch must support NPIV to virtualize N-Ports to use more than one device on a physical port.
– An iSCSI switch needs flow control and Jumbo Frame support to work in an iSCSI SAN correctly.
FC SAN settings (1 of 2)
– As compared to iSCSI-SANs, FC-SANs have a fixed bandwidth. The main focus is the traffic separation.
– It is important because Windows based servers attempt to write volume labels to all available LUNs. This can
render the LUNs unusable by other operating systems, and can result in data loss.
HK766S H.00 92
HPE StoreOnce Gen4 vILT
The HPE StoreOnce appliance is based on a Linux operating system with a heavily tuned and
customised configuration. Customer have limited access to the operating system but use a web-based
management interface and a command line interface. The backup (media) server communicate via fibre
channel for virtual tape or IP network connection for iSCSI virtual tape, NAS or StoreOnce Catalyst. The
disk storage is arranged in RAID configuration. Backup servers, sometimes know as media servers,
running data protection software read/write data via virtual tape drives, NAS shares, or catalyst stores.
512 emulation(512e) 4K physical sectors on the drive media with 512 byte logical configuration
512n – 512 byte sector size
Skylake SP (Skylake Scalable Performance) is the code name for Intel's series of
server multiprocessors based on the Skylake microarchitecture as part of the Purley platform serving as a
successor to both Broadwell EX and Broadwell EP. These chips support up to 8-way multiprocessing, up to 28
cores, and incorporate a new AVX-512 x86 extension. Skylake SP-based chips are manufactured on an
enhanced 14nm+ process and utilize the Lewisburg chipset. Skylake SP-based models are branded as
the processor families: Xeon Bronze, Xeon Silver, Xeon Gold, and Xeon Platinum.
Catalyst and Replication deployment SKU’s stay the same as with Gen3.
HA124A1#5V0 and HA124A1#5V1 (add’l hours) will also be available for Gen4
Installation checklist
DHCP network addressing is enabled by default. For a Quick Install, connect the HPE StoreOnce appliance to LAN
port 1. If your network uses static IP addressing, you will need IP addresses, domain names, and gateway details for
all ports that are to be connected. If you intend to use network port bonding, you will need to know which bonding
mode is required for your network.
NOTE: An installation, planning, and preparation guide and checklist is available at hpe.com to support the customer
and installer through this process.
Flex is triggered by installation of a flex config license or a system start/reboot. You can re-trigger the
flex config validation be re-installing a flex license – the installation will fail if the license already exists
in the store, but this will still trigger a re-evaluation of the hardware and licenses.
For example if the customer had installed cards in the following configuration:
Flex would determine that the two NIC cards in slots 1 & 2 were correctly installed and would activate
them if suitable licenses were installed. However, slot 4 contains an HBA card so the NIC card in slot 5
is incorrect as there is a gap in the installed load order (the HBA card in slot 4). This card would not be
licensed.
Flex would then attempt to validate the installed HBA cards. The HBA card in slot 4 is invalid as there is
a non HBA card in slot 5.