Professional Documents
Culture Documents
for Business-Critical
Data
MetroCluster Technical
Overview
Agenda
Causes of Downtime
MetroCluster Overview
MetroCluster Deployment
Scenarios
MetroCluster Failure
Scenarios
New Features
Summary
Customer Challenges
Internal
Data Center
Failures
External
Data Center
Failures
FAS HA
(Active-Active)
MetroCluster
storage
hardware
Addresses the
major
causes
failures
within
of
downtime
in the
the data
data center
center
Source: Forester / Disaster Recovery Journal Global Disaster Recovery Preparedness Online Survey, Oct, 2007
2010 NetApp. All rights reserved.
Data
Protection
Continuous
Operations
LAN/WAN
Clustering
Synchronous
Replication
Synchronous
Clusters
MetroCluster
Asynchronous
Replication
Synchronous
SnapMirror and Clusters
Block-Level
Incremental
Backups
SnapVault
Application
Recovery
SnapRestore
Daily
Backup
Cost
Availability
Snapshot
Copies
Local Restore
Remote Restore
Capability
MetroCluster Overview
MetroCluster
MetroCluster
Stretch
up to 500
MetroCluster
Stretch
upm
to 100 km
MetroCluster
provides continuous
availability within a
single data center
and across data
centers in adjacent
floors, buildings and
metro areas
Support for
FAS3000, FAS3100,
FAS6000 and VSeries
Automatic Protection
Primary Data Center
Exchange
Exchange
Oracle
Others
Oracle
Sharepoint
Sharepoint
Home Dirs
Create
Create volume
volume
Oracle
Home Dirs
New Volume
New Volume
Set
Set up
up
Home Dirs replication
replication
Create
Create Destination
Destination
Volume
Home Dirs
Volume
Oracle
MetroCluster
Sharepoint
Exchange
New Volume
Create
Create volume
volume
2010 NetApp. All rights reserved.
Aggregate
Mirroring
Exchange
Automatic
Automatic Mirroring
Mirroring
NetApp Confidential Limited Use
Sharepoint
New Volume
Others
Sharepoint
New Volume
3. Break
Break Mirror
Mirror
3.
4. Bring
Bring Online
Online
4.
5. Break
Break Mirror
Mirror
5.
6. Bring
Bring Online
Online
6.
7. Break
Break Mirror
Mirror
7.
8. Bring
Bring Online
Online
8.
Home Dirs
Oracle
1. Break
Break Mirror
Mirror
1.
2. Bring
Bring Online
Online
2.
Exchange
Oracle
ONLINE
ONLINE
Sharepoint
Home Dirs
ONLINE
9. Break
Break Mirror
Mirror
9.
10. Bring
Bring Online
Online
10.
New Volume
ONLINE
Oracle
Home Dirs
ONLINE
Home Dirs
MetroCluster
Exchange
ONLINE
Exchange
Sharepoint
New Volume
New Volume
ONLINE
CF
CF Forcetakeover
Forcetakeover -d
-d
2010 NetApp. All rights reserved.
ONLINE
Sharepoint
Node 1
Stretch MetroCluster
500m
Volume X
Volume Y Mirror
Volume Y
Data Center
Volume X Mirror
MetroCluster Deployment
Across City or Metro Areas
Building
A
Fabric MetroCluster
Building
B
<=100 km
DWDM
Volume X
Volume Y Mirror
Dark Fiber
No SPOFs
redundancy in all
components and
connectivity
Dedicated pairs of
FC switches
DWDM
Volume Y
Volume X Mirror
MetroCluster Components
Cluster License Provides automatic failover capability between
sites in case of most hardware failures
MC Component: SyncMirror
Writing to an
unmirrored
volume
Writing to a
mirrored
volume
Volume
1
A
Plex 0
Pool 0
2010 NetApp. All rights reserved.
Note:
Plex #
may change
over time
Volume1
Plex 0
Pool 0
Plex 1
Pool 1
Disk Ownership
Software
Disks assigned to pools by administrator
FAS3040, FAS3070, FAS31xx, FAS6000
FAS3020, FAS3050 (Stretch MC only)
Hardware
Determined by physical connection
900 series, FAS3020, FAS3050
NetApp
V-Series
Open Storage
Controller
V-Series MetroCluster
FAS
MetroCluster
Up to 30km distance
Up to 100km
Dedicated
Brocade or Cisco
Switches
Brocade
Fabric or Stretch
Configurations
Fabric or Stretch
Configurations
NetApp Storage
MetroCluster Failure
Scenarios
19
Data
Center
#1
Data
Center
#2
Host
2
FAS 1
DC1
Primary
DC2
Mirror
Recovery
Automatic
Host #2 will
be accessing
its data
across the
data centers
FAS 2
DC2
Primary
DC 1 Primary
DC1
Mirror
DC 2 Mirror
2010 NetApp. All rights reserved.
DC2
Mirror
Data
Center
#2
Host
2
Writes go to both
plexes.
Reads come from
local by default
FAS 1
DC1
Primary
Data
Center
#1
FAS 2
provides data
services for
both data
centers
FAS 2
DC2
Primary
DC 1 Primary
DC1
Mirror
DC 2 Mirror
2010 NetApp. All rights reserved.
Recovery
Automatic
Data
Center
#1
Data
Center
#2
Recovery
Host
2
FAS 1
DC1
Primary
DC2
Mirror
Single
command
failover
FAS 2
DC2
Primary
DC 1 Primary
DC1
Mirror
DC 2 Mirror
2010 NetApp. All rights reserved.
Data
Center
#1
Data
Center
#2
Recovery
Host
2
FAS 1
DC1
Primary
DC2
Mirror
Single
command
failover
FAS 2
DC2
Primary
DC 1 Primary
DC1
MIrror
DC 2 Mirror
2010 NetApp. All rights reserved.
Data
Center
#1
Data
Center
#2
Recovery
Host
2
No failover;
mirroring disabled
Both appliance
heads will continue
to run serving their
LUNs/volumes
Resyncing
happens
automatically after
interconnect is
reestablished
FAS 1
DC1
Primary
DC2
Mirror
FAS 2
DC2
Primary
DC 1 Primary
DC1
Mirror
DC 2 Mirror
2010 NetApp. All rights reserved.
Data
Center
#1
Loss of
Entire
Data
Center
Data
Center
#2
Recovery
Host
2
Loss of Network
FAS 1
DC1
Primary
DC2
Mirror
Single
command
failover
FAS 2
DC 1 Primary
DC2
Primary
DC 2 Mirror
DC1
Primary
DC2
Mirror
Data
Center
#2
Host
2
Mirror is rebuilt
DC1 primary
reverts back to
DC1
FAS 1
DC1
Primary
Data
Center
#1
One-step
return to
normal
CF giveback
FAS 2
DC 1 Primary
DC2
Primary
DC 2 Mirror
DC1
Primary
Mirror
Support Matrix
Model
3020
3040
3050
3070
3140
3170
6030
6040
6070
6080
Stretch
Supported Max. Disks
YES
168
YES
336
YES
336
YES
504
YES
420
YES
840
YES
840
YES
840
YES
1008
YES
1166
Fabric
Supported
Max. Disks
YES
168
YES
336
YES
336
YES
504*
YES
420
YES
672*
YES
672*
YES
672*
YES
672*
YES
672*
31xx Configurations
NOTE:
1.Single controller chassis
only
2.FC/VI card required in both
Stretch and Fabric
configuration
2010 NetApp. All rights reserved.
No SPOF
High failure resiliencyno failover required for
any single component failures:
Disks, disk shelves, disk loop, MetroCluster
switches, ISLs, cables, interconnects, single
Exchange node, appliance head
No storage failover required, even in case of
failure of both Exchange nodes
No/minimal disruption to service availability for
any of above failure conditions*
Failover to the DR site within minutes after sitewide failure
No loss of data
Easy failover operation aided by server cluster
and storage cluster
Seamless backup and recovery across cluster
resources with SME
Support of active-active configurationsfully
utilized HW at both sites
Minimal performance degradation during and after
recovery
Whats New?
MetroCluster TieBreaker Solution
Vmware Certification with HA
Vmware Certification with FT
TieBreaker Diagram
ESX Server
ESX Server
ESX Server
ESX Server
ESX Server
VMotion Technology
Simplifies operations
Zero interdependencies
Tested interoperability
Leverages Virtualization
Guarantee Program
2010 NetApp. All rights reserved.
39
Challenges
Business requiring 24x7 operations
Risk of downtime was unacceptable
Replace its ERP system based on Oracle
with an ERP infrastructure based on SAP
Implement a VMware server virtualization
solution
Solution
Stretch MetroCluster
Data Center
Proposing a MetroCluster
Information to be gathered
Distance between locations
Number of disks consider mirroring, growth
MetroCluster Positioning
Qualification questions
Is continuous data availability important to one
or more of your product applications?
Would providing geographic protection be of
value?
What distance between controllers would
provide the best protection (building, floor
campus, metro, or regional)?
If you are employing an HA solution for a
virtualized environment, why not have the same
resilience at the storage level?
2010 NetApp. All rights reserved.
Deduplication
SnapMirror Sync
MetroCluster
IP or FC
FC only
Yes
No limit
Up to 200 KM
(latency driven)
100 KM
Yes
No
CLI
Yes (FlexClone)
No (dump)
No restrictions
Yes
References
TR3548 - MetroCluster Design and Implementation Guide
http://media.netapp.com/documents/tr3548.pdf
45
Value of MetroCluster
Data Center
Simple
Set it Once: Simple to deploy, with no scripts
or dependencies on the application or operating
system
Zero change management through automatic
mirroring of changes to user, configuration, and
application data
Continuous Availability
Zero unplanned downtime through
transparent failover with protection for hardware
plus network and environmental faults
Zero planned downtime through nondisruptive
upgrades for storage hardware and software
Cost Effective
50% lower cost, no host-based clustering
required, added efficiency through data
deduplication and server virtualization
2010 NetApp. All rights reserved.
MetroCluster
Thank You
Copyright 2009 NetApp, Inc. All rights reserved. No portions of this document may be
reproduced without prior written consent of NetApp, Inc. NetApp, the NetApp logo, Go
further, faster, Data ONTAP, NOW, RAID-DP, SnapMirror, and SyncMirror are trademarks
or registered trademarks of NetApp, Inc. in the United States and/or other countries.
Microsoft is a registered trademark of Microsoft Corporation. Oracle is a registered
trademark of Oracle Corporation. Veritas is a trademark of Symantec Corporation or its
affiliates in the U.S. and other countries. SAP is a registered trademark of SAP AG. VMware
and VMotion are registered trademarks of VMware, Inc. All other brands or products are
trademarks or registered trademarks of their respective holders and should be treated as
such.