You are on page 1of 14

Data Domain Overview

Jason Schaaf Senior Account Executive


Troy Schuler Systems Engineer

Copyright 2009 EMC Corporation. All rights reserved.

Data Domain: Leadership and Innovation


 Deduplication storage systems
> 10,000 systems installed
> 3,700 customers
> 1,600 Peta-Bytes under Data Domain protection worldwide

 A history of industry firsts


2003

2004

First
First Dedupe
Dedupe NAS
NAS

2005

2006

2007

First
First Dedupe
Dedupe Gateway
Gateway

First
First Dedupe
Dedupe
Volume
Volume Replication
Replication

Largest
Largest Dedupe
Dedupe
Array
Array

First
First Dedupe
Dedupe
Directory
Directory Replication
Replication

First
First Dedupe
Dedupe VTL
VTL

Copyright 2009 EMC Corporation. All rights reserved.

2008

2009

Fastest
Fastest Backup
Backup
Controller
Controller
Cascaded
Cascaded
Replication
Replication

First
First Dedupe
Dedupe
Nearline
Nearline Storage
Storage
2

Dedupe: A Storage Fundamental


Storage 1.0

Storage 1.0

Tape

Primary Disk

Storage 2.0

Storage 2.0

Primary Disk

Storage 3.0

Storage 3.0
Primary

Dedupe SATA

Before

Tape

After

Storage 4.0

Storage 4.0
Flash
Copyright 2009 EMC Corporation. All rights reserved.

Tape

SATA

Dedupe SATA

Before

After

It Is Not All Dedupe Out There

Regular Storage Array


1:1
LZ Compression
~ 2:1

Whitespace
Reduction
File Level

Single Instance Storage


~ 3:1

Fixed Blocks,
Snapshots

Fixed Block
~ 3:1

Backup Target,
Variable Segment

Copyright 2009 EMC Corporation. All rights reserved.

Variable
Segment
~ 20:1

Deduplication
Significantly Reduces
- Replication WAN Bandwidth
- Power
- Heat
- Cooling
- Management

Key Attributes of Data Domain Technology


 Easily Integrates with Existing Infrastructure
 Retention: Deduplication
 Recovery: Data Invulnerability Architecture
 Replication: WAN Efficient

Data Domain Deduplication Storage


for Nearline Applications

Copyright 2009 EMC Corporation. All rights reserved.

Deduplication Fundamentals

Copyright 2009 EMC Corporation. All rights reserved.

Data Domain Basics


Easy integration with existing environment
Control Tier
Backup & Archive
Applications

Target Tier

DR Tier

CIFS, NFS,
NDMP, OpenStorage

Ethernet

Replication
VTL over FC

DD880 Appliance










Copyright 2009 EMC Corporation. All rights reserved.

DD880 Appliance

4U
2 - 6 ports
10 and 1 Gb Ethernet; 4 Gb Fibre Channel
RAID-6
5.4 to 71 TB usable capacity with shelves
1 TB or 500 GB 7.2k rpm SATA HDD in shelf
File system
NVRAM
N+1 fans and redundant, hot-plug power supplies

Data Deduplication: Under the Hood


Store more backups in a smaller footprint
Friday Full Backup

A B C D A E F G
Mon Incr
Tues Incr

Weds Incr

Thurs Incr

Backup
Data

Logical

Estimated Physical
Reduction

FRIDAY FULL

1 TB

2-4x

250 GB

Monday Incr

100 GB

7-10x

10 GB

Tuesday Incr

100 GB

7-10x

10 GB

Wednesday Incr

100 GB

7-10x

10 GB

Thursday Incr

100 GB

7-10x

10 GB

50-60x

18 GB

7.8x

308 GB

A B H
C B

E G J
A C K

Second Friday Full Backup

B C D E F

L G H

A BCDE FGH I J K L

Copyright 2009 EMC Corporation. All rights reserved.

Second FRIDAY FULL 1 TB


TOTAL

2.4 TB

Retain: Store More for Longer with Less


Over 1 year of retention in 3U of Data Domain deduplication storage

Backup
Data

Cumulative
Logical

Estimated
Reduction

Physical

First Full

1 TB

4x

250 GB

Week 1

April 7

2.4 TB

8x

308 GB

Week 2

April 14

3.8 TB

10x

366 GB

Week 3

April 21

5.2 TB

12x

424 GB

Month 1

April 28

6.6 TB

14x

482 GB

Month 2

May 31

12.2 TB

17x

714 GB

Month 3

June 30

17.8 TB

19x

946 GB

Month 4

July 31

23.4 TB

20x

1178 GB

TOTAL

23.4 TB

20x

1178 GB

Copyright 2009 EMC Corporation. All rights reserved.

Data Integrity: Data Invulnerability Architecture


Trust but verifyhope is not a strategy

Data verification
CheckSum
Dedupe, write to disk
Verify

Generate
Checksum

Verify
Data

File System
Global Compression

Self-healing file system


Cleaning
Expired data
Defrag
Verify

Local Compression
RAID

Verify the file system


metadata integrity

Verify user data


integrity

Verify stripe integrity

Other
RAID-6
NVRAM
Snapshots

Copyright 2009 EMC Corporation. All rights reserved.

10

Network-efficient Replication for True DR


True DR; lowers WAN costs; improves SLAs
Flexible replication
1-5%
DB

Data Domain System

 Many-to-one
 Bi-directional
 System-to-system
 Cascaded

DIR A

Home

Archive Data
WAN
Backup Data

Data Domain System

1-5%

1-5%
Home

Data Domain System

Source:
Remote Sites

DDX with DD880s

95-99% cross-site bandwidth reduction

Destination:
Data Center Hub
Supports hundreds
of remote sites

Copyright 2009 EMC Corporation. All rights reserved.

11

Multi-site Protection for Remote Sites

Cascaded
Replication

Remote Sites
London

Tokyo
Collection
WAN

WAN

Directory

Directory

Protection
Site # 1

Copyright 2009 EMC Corporation. All rights reserved.

Protection
Site # 2

12

Industrys Most Scalable Inline


Deduplication Systems
DD880

DD600
Appliance Series

Software Options:
OpenStorage, VTL, Replicator
and Retention Lock

DD140 Remote Office


Appliance

DD140

DD610

DD630

DD660

DD690/g

DD880

2 TB/hr

2.7 TB/hr

5.4 TB/hr

Speed

450 GB/hr

675 GB/hr

1.1 TB/hr

Logical Capacity

17-43 TB

75-195 TB

165-420 TB

.520-1.31 PB

.710-1.7 PB

1.4-3.5 PB

Usable Capacity

.86 TB

Up to 3.98 TB

Up to 8.4 TB

Up to 26.1 TB

Up to 35.3 TB

Up to 71 TB

Copyright 2009 EMC Corporation. All rights reserved.

13

Thank You

Copyright 2009 EMC Corporation. All rights reserved.

14

You might also like