Professional Documents
Culture Documents
dp55 File Library
dp55 File Library
hp OpenView
storage
data protector 5.5
Version: A.05.50
Hewlett-Packard Co.
Network Storage Solutions
-1-
DP File Library
Table of Contents
Advanced backup to disk – File Library....................................................... 3
Contents .............................................................................................. 4
Overview – Advanced backup to disk ....................................................... 5
Positioning of backup to disk technologies.................................................. 7
Backup to disk-based devices - benefits...................................................... 8
What is a File Library ........................................................................... 10
Concept (1)........................................................................................ 11
Concept (2)........................................................................................ 12
Use case – example for “Disk Staging” .................................................... 14
New file library license: B7038AA – Example: ......................................... 15
Implementation in GUI: create a file library............................................... 16
File library – repository: define directories / number of writers .................. 17
File library – define media type ........................................................... 19
File library – summary pane ................................................................ 20
File library – repository: directory properties ........................................... 21
File library: drive / writer properties – general tab .................................. 23
File library: drive / writer properties – settings tab................................... 24
File library: file depot properties – general tab........................................ 25
File library: file depot properties – info tab............................................. 26
File library: file depot properties – objects tab ........................................ 27
File library: file depot properties – usage tab.......................................... 28
File library – media view .................................................................... 29
File library – device view.................................................................... 30
File library: new notification ................................................................ 32
Changes in CLI / mmdb and configuration .............................................. 33
Compatibility / what is supported .......................................................... 38
Limitations / performance / supportability ................................................ 39
Lab for file library: ............................................................................... 43
-2-
DP File Library
-3-
DP File Library
Contents
• Overview – Advanced backup to disk - benefits
• Positioning of backup to disk technologies
• What is a file library
• Concept
• Implementation in DP GUI:
− how to create/configure a file library
• Changes in CLI
• Configuration files
• Compatibility / what is supported
• Limitations / performance / supportability
Contents
This is a content list of items which will be presented and discussed in this training
chapter.
-4-
DP File Library
August 5, 2004 3
Many applications and databases frequently make small changes to existing files or
produce many new files containing business-critical data throughout the working
day. These files need to be backed up immediately to guarantee the data in them
will not be lost. This requirement means that a fast medium which can store large
amounts of data that works without interruption is necessary for storing data.
Disk-based storage media has become increasingly cheaper in recent years. At the
same time, the storage capacity of disks has risen. This has led to the availability of
low-cost, high-performance single disks and disk arrays for storing data.
Disk backup (also known as disk-to-disk backup) is becoming ever more important.
In the past, tape storage was the favored medium for backup and restore because
of its price and effectiveness in meeting disaster recovery requirements. Today,
more and more businesses are augmenting their tape storage backup solutions with
faster disk-based backup solutions. This ensures faster data backup and recovery.
-5-
DP File Library
On the slide above you see the advantages of a backup to disk solution.
Of course there are some disadvantages compared to tape to consider as well:
− Bad price performance ratio for sequential read/write
− Remote replication to a safe place might be quite expensive
− Not virus resistant, e.g. Windows to Windows backup
− Most disks are not designed to be moved
− High power consumption
− Low MTBF (Mean Time Between Failure)
− ATA: 3 years with 80% idle time
− SCSI: 3 years, no idle time
− No hardware compression build in
-6-
DP File Library
P
t0 t1 t2
How? mirror, snap… backup to disk backup to disk
-7-
DP File Library
-8-
DP File Library
24 hours to enable fast, convenient restore without the need to stream data
from tape first.
Mechanically, a disk-based device is quicker to use than a tape. When using
a file device there is no need to mount and unmount a tape. When backing
up or restoring a small amount of data, a disk-based device is quicker
because it does not need the initialization time that a tape drive requires.
With a file device there is no need to move a robotic to load or unload
media, which consumes more time in a small backup or restore. This is
especially true when restoring from an incremental backup.
The risk of media problems such as faulty tapes and tape mounting failures
are reduced to a minimum. The availability of RAID disk configurations
provides protection of data in cases where a disk fails.
Overhead costs are reduced because there is no need for tape handling, for
instance when performing incremental backups to disk.
Overall, disk-based storage space is becoming increasingly cheaper than
tape-based storage.
Jukebox
The jukebox device is a logical equivalent of a tape stacker. It contains slots whose
size is defined by the user during initial device configuration. This device is
configured manually. The jukebox properties can be altered while it is being used.
Each slot in the file jukebox device has a maximum capacity of 2 TB. The device’s
maximum capacity is equal to:
Number of slots x 2 TB
-9-
DP File Library
Out of the three disk-based devices, the file library device is recommended for use
as an unattended backup device.
You specify the capacity of a file device when you first configure the medium. It is
possible to re-set the sizing properties of the file library at any time during use of the
device in the Data Protector GUI.
The file library device can be located on a local hard drive, or even on a network
share, as long as Data Protector knows its path. The directory path is defined at
configuration of the file library device.
However, it is recommended to use a local disk or a disk in SAN. Disks connected
via NFS/CIFS links provide only a slow connection which also is not reliable.
- 10 -
DP File Library
Concept (1)
• File library is a disk-based device
• Backed up data is written into files on disk in DP
tape format
• Similar concept as for File-device and File-Jukebox
• Consist of configured directories used as container
for files
• These files are called “file depot”
• By default each backup/copy session creates a
new file depot
• Each file depot has a unique file name
August 2, 2004 6
Concept (1)
A file library consists of configured directories which include files where the data is
stored into. The directories are configured at the initial configuration of the file
library device. The files inside are called “file depot” and they are created each
time a backup or copy session is made to the file library. If the amount of data
being backed up is larger than the maximum file depot size, Data Protector creates
more than a single file depot for a backup session. The backed up object will span
over two file depot’s. A file depot is equivalent to a tape media in a slot, whereas
the directories represent the repository (slots) part of a library. As a consequence,
many of the known media operations can be applied, e.g. scan, format, recycle,
export, etc. However, some operations are not available, e.g. eject.
The name of each file depot is a unique identifier which is automatically generated
by the system. It looks similar to the Data Protector media ID, but actually it is not a
media ID, instead just a unique file name. Example (including path):
“C:\data\backup\0100007f54106d9295058c50008.fd”
Since each file depot contains backed up or copied data, a corresponding DCBF
file keeps the detail catalog information for it in the IDB. Thus for each file depot a
corresponding DCBF file exists.
- 11 -
DP File Library
Concept (2)
Concept (2)
The size of file depots is defined when you initially create the file library device.
During this process you specify all sizing properties for the device, including the
maximum size of the file depots. The sizing properties of the file depots, although
only entered once, are globally applied to each file depot within its directory. If the
size of data to be backed up within one session is larger than the originally
specified file depot size, Data Protector automatically creates more file depots until
the allocated disk space for the file library device has been consumed.
On Windows the maximum recommended file depot/slot size is 50 GB, although
the standalone file device has been tested on Windows with file depots of up to
600 GB. On Unix the maximum recommended file depot size is 2 TB.
Analog to the file jukebox file drives can be created as well; these are called
“writers”. The naming convention for the writers is:
<file library name>_Writer<number>
For each newly created file library, by default there will be a new media pool
created with the naming convention <library name>_MediaPool. Of course the
user can change this setting to any other existing mediapool of type ‘file’.
- 12 -
DP File Library
- 13 -
DP File Library
Disk Agent
Stream to tape
Backup
Disk Agent
Tape
NEW:
direct restore from tape
Restore:
Fast restore from disk if Backup 1: Store on disk
data still available there and copy to tape on
scheduled basis; expire
backup on disk as disk fills
September 3, 2004 8
There are some use cases where a file library as part of a disk staging concept is
very useful, e.g.:
- continuous backup of transaction log files (no overhead through media
load/unload and for tape drives there is no issue with start/stop mode)
- backup of slow clients without multiplexing
- tape-less backup of branch offices
- works similar as a virtual tape library
- 14 -
DP File Library
Disk Agent
Media Agent Media Agent
Disk Agent
Stream to tape
2.5 TB
This new product number B7038AA includes the license-to-use (LTU) for 1 TB of
backup disk storage. It is required once per terabyte (TB) used backup disk storage
in one DP cell and is platform independent (e.g. same license for UNIX and
Windows systems). Used backup disk storage is defined as the space occupied by
protected backups and could be distributed over multiple disk arrays and systems.
Any RAID overhead is excluded and thus does not need to be considered.
The new file library requires only the B7038AA license(s), it does not require any
drive and library LTU’s, regardless how many ‘writers/drives’ and ‘filedepots/slots’
are configured.
- 15 -
DP File Library
In previous DP version (DP5.1 and older) backup to disk was implemented using
the standalone file device and the file jukebox, which contains file devices. The
creation and usage of the standalone file device and the jukebox with file devices
will be kept as it is. This includes the known licensing modes as well.
On the above slide you see the device creation wizard. Its first window allows
defining the file library name and its description. As the device type you select from
the pulldown menu ‘File Library’. And you have to select the associated client
system from which the file library will be controlled and used.
- 16 -
DP File Library
In this page specify a directory or a set of directories where you would like the file
library to reside. You can use the browser (click on browse button to open the
dialog) to select the directory to be used as container for the file depots. Multi
selection is not possible in this browsing dialog.
Please note: the directories has to be on different filesystems(!) (this is enforced) and
they must exist on the disk (DP will NOT create them).
Also important: the disk on which the file library device will reside must be visible in
the filesystem. Create the directory(ies) on either an internal or external disk, e.g.
C:\backup\filelib_dir1\
D:\backup\filelib_dir2\
IMPORTANT:
It is critical that the directory created for the file library is not deleted from the disk.
If it is deleted, any data within the file library device will be lost.
At the time the file library is created it does NOT contain any file depots. File
depots will be created as needed when a backup is made to the device.
- 17 -
DP File Library
You can also define the number of writers to be configured for this file library. The
default value is the number of configured directories. The naming convention for the
writers (drives) is:
<FileLibraryName>_Writer<Number>.
This name as well as the other “Writer” properties can NOT be changed at the
point of creation of the file library, but any time later you can modify the writer
(driver) specific properties by selecting the writer and opening its properties
window. A writer is the equivalent to a drive! For each writer which is used to write
data during backup a separate BMA will be running. Increasing the number of
writers might improve the performance, but will also consume more system
resources, e.g. memory.
For each newly created file library there will be a new media pool created. The
naming convention for this media pool is <FileLibraryName>_MediaPool.
- 18 -
DP File Library
File library –
define media type
In this page please select the type of media. The only available selection is the type
“File”.
- 19 -
DP File Library
File library –
summary pane: click finish to create
August 2, 2004 13
This page is the summary pane and it shows you details of the current file library
device settings. It includes the following information:
- Directory name: the full path to where the file library device is saved.
- Total size: the total amount of space allocated to the entire file library device.
- Used: the amount of space allocated to the file library device which has already
been used for backups.
- Maximum available disk space for backup: the total amount of space available to
save data in the file library device.
- Maximum File Depot Size: the maximum size of a single file depot.
- 20 -
DP File Library
For each configured directory you can specify the sizing details in the properties
dialog window. This can be accessed in the repository tab by selecting one of the
available directories.
Any changes made will be applied to each file depot created in this directory. The
properties of any file depot created before this change will not be affected.
- 21 -
DP File Library
This table shows the default values as well as minimum and maximum values for
each setting:
Disk Properties
Total directory size = 100 GB
Used directory space = 20 GB
File Library Device Properties
Free disk space setting = 30 GB
Available Writable Space Calculation
Available space minus Free disk space 80 GB - 30 GB = 50 GB
Available writable space = 50 GB
Data Protector performs the calculation above before using a file depot. If the
amount of space is insufficient, Data Protector sends a message to notify you. This
check is made at the start of each back up session.
If this is below the minimum of free disk space required to create a new file depot a
special mount request (space request) will be issued to ask for free disk space.
- 22 -
DP File Library
If you open the Devices & Media context and expand the devices item, you will
see the list of configured devices.
After you expand the file library item you can see two more sub items: directories
and drives. If you expand these both as well you can see the directories structure
with the created file depots and the created drives.
If you open the properties for a writer (drive) there will be two tabs available:
1. general and
2. settings
In the general tab pane you can change the default device (writer) name and its
description.
- 23 -
DP File Library
August 1, 2004 16
In the Settings tab pane you can access and change the media pool to be used as
well as the advanced options (e.g. concurrency, block size, segment size, lock
names, CRC, etc.) and in case needed you can also disable the device (writer).
This is done the same way as for regular tape devices.
- 24 -
DP File Library
Similar as opening the properties for writers/drives you expand the file library
directory structure and get access to the file depot properties.
There are four tabs available:
1. General
2. Info
3. Objects
4. Usage
Under general tab you can modify the description and location. Here you can also
see the associated media label, location, and media pool.
- 25 -
DP File Library
In the info tab pane you can view the file depot specific information such as:
- medium ID
- Quality
- Protection
- Create date/time
- Last access
- etc.
- 26 -
DP File Library
- 27 -
DP File Library
In the Usage tab pane you see the estimated used and free space on the medium.
This pie chart displays the usage information relative to the defined maximum of this
file depot. This is visible from the device view perspective.
Similar properties exist for the specific media from the media view perspective.
However, if the media usage policy is set to non-appendable (default) the usage
there shows always 100% used (red pie chart, no free space on media) for
protected media or 100% free for unprotected media (green pie chart).
This is due to the file library concept to allocate space as needed during the
backup/copy session. There is no pre-allocation of disk space.
Simply only this amount of disk space will be allocated as needed to write the
data, not more and not less.
- 28 -
DP File Library
All file depots have a corresponding media ID associated. And of course this
media is grouped in a media pool. This media pool is automatically created at the
creation time of the file library. You can view all media belonging to a media pool
and perform some operations on the media. This includes
- Export
- Change location
- Recycle
- Move to pool
- Copy
- Verify
- Import catalog
- Select media and
- Properties
- 29 -
DP File Library
Similar as for the media view applies to the device view. By expanding devices
item, expanding file library and its directories you can view and access the listed
file depots. Similar operations as before for media can be performed here as well
on every file depot:
- Scan
- Format
- Import
- Export
- Recycle
- Import catalog
- Properties
- Import
The import function can only be executed on file depots which have been
previously exported from this file library. You can NOT import any other file depots,
e.g. from a different system or device files created by a standalone file device or a
file jukebox. It is rather the other way around: you can import file depots from a file
library using a standalone file device or a file jukebox.
- 30 -
DP File Library
- Export
In addition the export function provides a different result as well:
If you export a file depot from a file library, this needs first to be recycled to remove
any protection then it can be exported. Any reference will be removed from the
IDB. However, export does not mean the file depot will be totally removed from the
file library (as one would expect). Instead you still can see the file depot icon, but
the name and its details disappeared and the item is inactive (gray). Its also not
visible in the media pool.
Now you can delete these inactive file depots.
- Recycle
The recycle operation of a file library also works slightly different:
You can recycle either an individual file depot or all of the file depots in a file
library. This means that the protection will be set to none and that the disk space
which has been allocated for the recycled items, will be re-used in the next backup.
This is done by deleting the unprotected file depot(s) and creating new ones.
- Delete
To delete a file library you first need to remove the protection from all data stored in
file depot(s) of this file library. This is done by the recycle process. Then all file
depot(s) must be exported from IDB. Then you can delete the entire file library.
- 31 -
DP File Library
August 5, 2004 23
- 32 -
DP File Library
Changes of configuration:
• Media allocation procedure
• Media retention policy
• FileMark table extention
• ‘Disk Full’ handling
Example: A file library is with only one directory configured. The directory has a
size of 100 GB. Out of this 20 GB have been already used. The free disk space
has been set to 2 GB. This still leaves 78 GB to write data into this directory.
NAME "Filelib_4"
DESCRIPTION "This is a FileLibrary"
HOST testsystem.bbn.hp.com
POLICY Filelibrary
TYPE File
REPOSITORY
"/var/datastore/FL_4/dir1/0f8b594f53f7bfbe850b055038b.fd"
"/var/datastore/FL_4/dir1/0f8b594f53f7bfbe850b055038e.fd"
- 33 -
DP File Library
"/var/datastore/FL_4/dir1/0f8b594f53f7bfbfa50b05504bd.fd"
"/var/datastore/FL_4/dir1/0f8b594f53f819e2452c535021e.fd"
DIRECTORY
"25.00 100 2000 10 /var/datastore/FL_4/dir1"
- 34 -
DP File Library
depot. All directories that are already used by any drive will be put at the
bottom of the list.
MMD sends the list of directories to BSM.
BSM forwards it to BMA.
BMA goes through the list and checks each directory for empty space. As
soon as it finds one it sends the name of this directory back to BSM.
BSM sends this directory name to MMD.
MMD receives the name of directory. It creates a new file depot in this
directory.
MMD sends the name of new file depot to BSM.
BSM forwards file depot’s name to BMA.
BMA formats file depot and starts writing data into it.
Example:
Media with expired data will be deleted as soon as possible. With this minimum
retention the usage of disk space is minimized which is appropriate for disk space.
The task of deleting expired media will be executed by the BMA when it requires a
new media (MSG_SLOT), which means at the start of a backup or copy session
and when a new media is requested. The deletion of expired media will be done
in the following steps:
- 35 -
DP File Library
one CS are always tightly related together. One FM will handle exact one of these
segment-offsets. This results in 470 FM-Pairs ((946 – 2(EOD) - 4(FMBuffer)) : 2) for
the DS-CS-Pairs.
The currently supported maximum DS-Size is 2GB and CS-Size is 12MB. This
results in 940GB (470 DS’s * 2GB) maximum backup data and about 6GB (470
CS’s * 12MB) catalog data.
The theoretical maximum reachable file device size is 950GB.
Note: This is a theoretical value because DS and CS will not be filled up both to
the maximum. Typically only one of them (DS or CS) reaches the maximum size.
Whereas the maximum OS-Supported file size is 2TB (e.g. supported on Windows,
HP-UX, Solaris, Linux).
2. 2.
… DS CS 8KB DS CS … DS CS 8KB … DS CS
}
MAFileMarkBlock
CatalogSegment
DataSegment EndOfData
- 36 -
DP File Library
At the beginning of a backup the FM-Table inside the MablkHeader will be used
until this FM-Table is filled up (see point 1. in above picture). The current
implementation does stop to write the current file-depot and request another one.
The extended FM-Table feature writes a new FileMark-Block behind the current CS
and progress with the backup session. All new FM’s after the FileMark-Block will be
written to that FM-Block, until this block is filled up too. Then the next FileMark-Block
will be written (see point 2. in above picture). The FM-Table contains a new offset
value, which points to the offset of the next FM-Table.
FM1 FM 2 FM 3 … FM n …
- 37 -
DP File Library
• compatibility:
− NO manual import of device files possible / supported
− DP5.0/5.1 DA compatible with DP5.5 MA (file library)
• supported platforms:
− HP-UX (including 11.23, 11.31)
− Windows 2000/XP/2003
− Sun Solaris
− Linux (RedHat, SuSe)
− AIX
− True64
− OpenVMS
− Netware
August 3, 2004 24
Since the file library has been introduced with Data Protector 5.5 for the first time
there are no ‘old’ file library configurations or ‘old’ file depots where backwards
compatibility needs to be considered. In any case to configure and use a file
library, a DP5.5 MA in a DP5.5 cell is required.
It should also be possible to use ‘older’ DP disk agents to send backup data to a
DP5.5 MA to write this into a file library. However, officially it is only supported to
have all DP agents in a cell of one version (5.5).
- 38 -
DP File Library
Limitations
- 39 -
DP File Library
Performance
Each customer configuration and disk layout is somehow different. Therefore it is not
possible to offer a setup and configuration that would provide best performance in
all these customer cases. If the customer complains about bad performance it is
essential first to look where really the performance bottleneck is and as next tries to
identify the root cause. If the result of such an analysis would be that it is because
of a poor file library configuration, you might try to improve the performance by
changing some of the configuration parameters, e.g.
- change location of the directories in order to use different, maybe faster disks
- change the concurrency of a writer or increase the number of writers
- check if there is a conflict with other applications using intensively the same
disks
- consider to use faster disks
- etc.
Supportability
The free disk space is configured per directory. DP will check for free disk space at
the creation of new file depot and at the start of a backup. The size of the new file
depot will be calculated in that way that even if device becomes full, there will be
still enough disk space available to complete the write of the catalog segment. The
problem arises if there are other applications writing to the same disk. The other
applications might use up the disk space so that there is not enough space
available on the disk after backup is done. Its recommended that DP should be the
exclusive user of the assigned disk space.
- 40 -
DP File Library
- Only one file library directory per mountpoint (on UX) or drive (on Windows)
You can configure several directories as the repository for file libraries, but only one
directory for each mountpoint or drive partition.
- 41 -
DP File Library
Default block size for all file devices (incl. file library) has changed:
Default block size has changed for standalone file devices, file jukebox and file
library from 16k to 64k. The default is now also visible in the advanced options
window for a device:
- 42 -
DP File Library
Objective: the user should become familiar with the new file library as an important
part of the “advanced backup to disk” approach.
2. Run two backup sessions using the newly created file library. Please ensure the
data protection has been set other then none. Check which file depot’s exist
and what the content is.
Conclusion: _____________________________________________________
Please note the file depot ID’s. Now remove the protection of these file depot’s
(recycle) and run a new backup session to the same file library.
Check again what file depot’s exist (compare against noted ID’s) and what their
content is.
Conclusion: _____________________________________________________
3. Export a file depot from a file library. Check in the GUI if this file depot is still
present/visible?
What options do you have to get this file depot back? __________________
Can you delete the remaining reference? ______________________________
Will the physical file be also deleted? ________________________________
How can you “clean up” the file depot list (e.g. in order not to list the inactive
references anymore)? ______________________________________________
- 43 -