You are on page 1of 59

VMware vCenter

Site Recovery Manager 5.x


with EMC VNX Arrays &
MirrorView

By Dave O’Sullivan

David.C.OSullivan@emc.com

EMC CONFIDENTIAL—INTERNAL USE ONLY 1


Intended Audience:

- VNX Block
- CLARIION Block

- This training is designed to give an overview of SRM and explain


how the relevant software plugins and hardware all interact with
each other.

- This will cover :

- Pre-requisites
- Design
- Test Failover / Failover / Recovery [DEMO]
- Required Logs
- Troubleshooting

EMC CONFIDENTIAL—INTERNAL USE ONLY 2


Assumptions:

- You are familiar [not expert] with:

- VM’s !
- vCenter
- MirrorView A and S
- VNX / CLARIION Arrays
- SRA (s) ???

- Before the customer does any work with SRM:


- MirrorView is working (check zoning)
- All the appropriate software is installed. [including enablers]

EMC CONFIDENTIAL—INTERNAL USE ONLY 3


What is SRM?

- ensures the simplest and most reliable disaster protection for all
virtualized applications.
- Site Recovery plans can be tested non-disruptively as frequently as
required to ensure that they meet business objectives.
- At the time of a site failover or migration, Site Recovery Manager
automates both failover and failback processes, ensuring fast and
highly predictable recovery point objectives (RPOs) and
recovery time objectives (RTOs).

EMC CONFIDENTIAL—INTERNAL USE ONLY 4


Pre-requisites

- SRM is heavily reliant on DNS, so it would be assumed DNS is fully


setup and all hosts can be resolved in both directions.
- IP Connectivity between all SP’s / VC / ESX on both sites.
- SRM is also reliant on Databases, in this setup there are 4 in total.
- 1 DB for VC
- 1 DB for SRM
- This applies for both sites.
- This doc covers the DB setup in full details :

Virtual How to Install and Configure


SQL Express 2005
For use with Site Recovery Manager V4
Rob Nourse, Sr. Consultant
VMware Consulting Services

http://
communities.vmware.com/servlet/JiveServlet/download/11547-1-32136/Install%20%26%
20Configure%20SQL%20Express%20for%20use%20with%20SRM4%20v1.3.pdf

EMC CONFIDENTIAL—INTERNAL USE ONLY 5


Design:

- ….

EMC CONFIDENTIAL—INTERNAL USE ONLY 6


Design considerations : IP / DNS

EMC CONFIDENTIAL—INTERNAL USE ONLY 7


Design considerations : Software / Plugins

EMC CONFIDENTIAL—INTERNAL USE ONLY 8


Design considerations: MirrorView
- In order for SRM failover to work, the “protected” VM’s must be
located within a LUN that is replicated form production site to DR
site.
- This is handled my MirrorView A/S (we are using A in this setup)
- Below is the LUN info for my setup:

EMC CONFIDENTIAL—INTERNAL USE ONLY 9


Design considerations: MirrorView Zoning
- For MirrorView to work, we need to ensure that the appropriate ports
are zoned together.
- So in the setup, the FC ports used for MirrorView are zoned to the
opposite Array.

EMC CONFIDENTIAL—INTERNAL USE ONLY 10


Design considerations: MirrorView

EMC CONFIDENTIAL—INTERNAL USE ONLY 11


Design considerations: MirrorView
- LUN is created first on Prod (Athena) side
- Then used MirrorView options “create secondary mirror” and follow
thru wizard.
- I used the Mirror Wizard to complete this task

EMC CONFIDENTIAL—INTERNAL USE ONLY 12


Design considerations: MirrorView
- When its working, it should look like this:

EMC CONFIDENTIAL—INTERNAL USE ONLY 13


Design considerations: Reserved Lun Pool
- You must add LUNs with adequate capacity to the Reserved LUN
Pool before proceed.
- This will be used when the SRA calls a snapshot for the SRM failover
test (only!)

EMC CONFIDENTIAL—INTERNAL USE ONLY 14


Design considerations: DEMO

EMC CONFIDENTIAL—INTERNAL USE ONLY 15


Design considerations:
- That’s pretty much it on the VNXZ side
- Once MirrorView is up & running, you should be good to go with the
SRM windows / VMWare side of the setup.

- Next

- What is the SRA?

EMC CONFIDENTIAL—INTERNAL USE ONLY 16


SRA [Storage Replication Adapters]
- What is the SRA?
- The SRA is a windows .exe installed on the same windows box as
SRM as part of the SRM setup
- The vCenter “talks” to the SRA -> the SRA sends navi commands to
the Array.
- This is why naviseccli is required to be installed don the same box as
SRM (check path!)
- Each Array vendor has their own SRA adapters.
- The SRA’s are EMC code, so we support them!

SRAs for SRM 5.x


For the full list of storage replication adapters supported by SRM 5.x,
see

http
://www.vmware.com/resources/compatibility/search.php?deviceCategory
=sra
.
EMC CONFIDENTIAL—INTERNAL USE ONLY 17
SRA [Storage Replication Adapters]
- These are the most current supported EMC SRA’s

EMC CONFIDENTIAL—INTERNAL USE ONLY 18


SRA [Storage Replication Adapters]
- So on both sites, the following should be installed:

- Note that there is 2 SRA’s


- VNX SRA for vCenter
- MirrorView enabler for VNX SRA

- As we are only doing block replication, we only need the


MirrorView enabler.
- NFS replication is also possible using the
EMC_VNX_Replicator_Enabler_for_VNX_SRA_v5.0.xx

EMC CONFIDENTIAL—INTERNAL USE ONLY 19


SRA [Storage Replication Adapters] DEMO

EMC CONFIDENTIAL—INTERNAL USE ONLY 20


Test Failover – sequence of events
We will look happens on both sites concerning:

• SRM
• VNX
• ESX

EMC CONFIDENTIAL—INTERNAL USE ONLY 21


Test Failover – sequence of events [PROD]

EMC CONFIDENTIAL—INTERNAL USE ONLY 22


Test Failover – sequence of events [PROD]
- Ensure to check the output form:
Recovery Plan History Report
VMware Site Recovery Manager 5.0

EMC CONFIDENTIAL—INTERNAL USE ONLY 23


Test Failover – sequence of events [PROD]
- The is a cosmetic issue which *should* be fixed in later versions of
SRM

Warning: Failed to update embedded paths in virtual machine file '/vmfs/volumes/507432e1-3a92a2a4-


027e-b8ac6f866cc6/2008-1/2008-1_1.vmdk'. A general system error occurred: No such device
Failed to update embedded paths in virtual machine file '/vmfs/volumes/507432e1-3a92a2a4-027e-
b8ac6f866cc6/2008-2/2008-2_1.vmdk'. A general system error occurred: No such device

EMC CONFIDENTIAL—INTERNAL USE ONLY 24


Test Failover – sequence of events [PROD ESX]
- Some fairly serious errors in the vmkernel on prod. Esx, these can be
ignored.

)WARNING: VMW_SATP_LIB_CX: satp_lib_cx_otherSPIsHung:338:Path "vmhba2:C0:T1:L3" Peer


SP is hung.
)WARNING: VMW_SATP_LIB_CX: satp_lib_cx_otherSPIsHung:338:Path "vmhba3:C0:T0:L3" Peer
SP is hung.
)ALERT: NMP: vmk_NmpVerifyPathUID:1166:The physical media represented by device
naa.600601609da02e0012ce6a8f930de211 (path vmhba3:C0:T1:L9) has changed. If this is a
data LUN, this is a critical error. Detecte[0$
)ALERT: NMP: vmk_NmpVerifyPathUID:1166:The physical media represented by device
naa.600601609da02e0012ce6a8f930de211 (path vmhba2:C0:T0:L9) has changed. If this is a
data LUN, this is a critical error. Detecte[0$
)NMP: nmp_DeviceUpdatePathStates:547: Activated path "vmhba2:C0:T1:L9" for NMP device
"naa.600601609da02e0012ce6a8f930de211".

- Watch out for messages like this, customers could open cases based
on these errors alone…

EMC CONFIDENTIAL—INTERNAL USE ONLY 25


Test Failover – sequence of events [DR]

EMC CONFIDENTIAL—INTERNAL USE ONLY 26


Test Failover – sequence of events[DR ESX]
LVM: 8445:00:00 Device naa.6006016054502e005e32bf721a12e211:1 detected to be a snapshot:

LVM: 8452:00:00 queried disk ID: <type 2, len 22, lun 11, devType 0, scsi 0, h(id) 14674641011867526612>

LVM: 8459:00:00 on-disk disk ID: <type 2, len 22, lun 1, devType 0, scsi 0, h(id) 15441737393007327004>

LVM: 8445:00:00 Device naa.6006016054502e005e32bf721a12e211:1 detected to be a snapshot:

LVM: 8452:00:00 queried disk ID: <type 2, len 22, lun 11, devType 0, scsi 0, h(id) 14674641011867526612>

LVM: 8459:00:00 on-disk disk ID: <type 2, len 22, lun 1, devType 0, scsi 0, h(id) 15441737393007327004>

LVM: 8445:00:00 Device naa.6006016054502e005e32bf721a12e211:1 detected to be a snapshot:

LVM: 8452:00:00 queried disk ID: <type 2, len 22, lun 11, devType 0, scsi 0, h(id) 14674641011867526612>

LVM: 8459:00:00 on-disk disk ID: <type 2, len 22, lun 1, devType 0, scsi 0, h(id) 15441737393007327004>

LVM: 8445:00:00 Device naa.6006016054502e005e32bf721a12e211:1 detected to be a snapshot:

LVM: 8452:00:00 queried disk ID: <type 2, len 22, lun 11, devType 0, scsi 0, h(id) 14674641011867526612>

LVM: 8459:00:00 on-disk disk ID: <type 2, len 22, lun 1, devType 0, scsi 0, h(id) 15441737393007327004>

LVM: 8825:00:00 Device naa.6006016054502e005e32bf721a12e211:1 unsnapped

LVM: 5510:00:00 Snapshot LV <snap-37ce81f0-503f25d9-c56a845d-4ee3-0026b9736706> successfully resignatured

LVM: 13188 : One or more LVM devices have been discovered.

EMC CONFIDENTIAL—INTERNAL USE ONLY 27


Test Failover – sequence of events[DR VNX]
• A 10/09/12 14:20:23 SnapCopy 7100000a Snapshot Logical Unit device CopyDisk0000 has been created.
• B 10/09/12 14:20:23 SnapCopy 7100000a Snapshot Logical Unit device CopyDisk0000 has been created.
• A 10/09/12 14:20:24 4600 'Create a SnapShot LU' called by 'admin' (10.64.29.93) on 'Navi_SnapCopyFeature' with result: Success (Successfully created
SnapShot LU.)
• A 10/09/12 14:20:27 4600 '' called by 'admin' (10.64.29.93) on 'Navi_SnapCopyFeature' with result: Success (Started SnapView session successfully.
Session name - async-25_SRM-TEST-FAILOVER_session)
• A 10/09/12 14:20:27 SnapCopy 71000003 SnapView persistent session async-25_SRM-TEST-FAILOVER_session has been started on LUN 25.
• B 10/09/12 14:20:27 SnapCopy 71000003 SnapView persistent session async-25_SRM-TEST-FAILOVER_session has been started on LUN 25.
• A 10/09/12 14:20:30 4600 'Activate' called by 'admin' (10.64.29.93) on 'SnapShot WWN: 60:06:01:60:54:50:2E:00:92:7D:58:76:1C:12:E2:11' with
result: Success (Successfully activated snapshot LU: 60:06:01:60:54:50:2E:00:92:7D:58:76:1C:12:E2:11 (async-25_SRM-TEST-FAILOVER_session))
• A 10/09/12 14:20:35 4600 'ExecuteClientRequest' called by ' Navi User admin' (10.64.29.93) on 'CLIFeature' (Result: Success). snapview -storagegroup -
addsnapshot -gname SG_dellpr710-g.emcvmw.ctc -hlu 9 -snapshotname async-25_SRM-TEST-FAILOVER -compatibilitymode called by 'admin'
• A 10/09/12 14:20:39 RemoteMirror 71050115 MirrorView quiesce LU request.
• A 10/09/12 14:20:39 RemoteMirror 71050139 RM_ADMIN_INFO_WILL_REBIND the object.
• A 10/09/12 14:20:39 RemoteMirror 71050111 MirrorView rebind request for LUN 60060160b9502e00:a4cc7fc81507e211.
• A 10/09/12 14:20:39 RemoteMirror 71050115 MirrorView quiesce LU request.
• A 10/09/12 14:20:39 SnapCopy 71000006 SnapView has been bound to device 00000017.
• B 10/09/12 14:20:39 RemoteMirror 71050136 Quiesce request from peer SP.
• B 10/09/12 14:20:39 RemoteMirror 71050136 Quiesce request from peer SP.
• B 10/09/12 14:20:39 RemoteMirror 71050120 Rebind request from peer SP for LUN 60060160b9502e00:a4cc7fc81507e211.
• B 10/09/12 14:20:39 SnapCopy 71000006 SnapView has been bound to device Disk0001.
• B 10/09/12 14:20:39 RemoteMirror 71050120 Rebind request from peer SP for LUN 60060160b9502e00:a4cc7fc81507e211.
• A 10/09/12 14:20:40 4600 'Create a SnapShot LU' called by 'admin' (10.64.29.93) on 'Navi_SnapCopyFeature' with result: Success (Successfully created
SnapShot LU.)
• A 10/09/12 14:20:40 SnapCopy 7100000a Snapshot Logical Unit device CopyDisk0001 has been created.
• B 10/09/12 14:20:40 SnapCopy 7100000a Snapshot Logical Unit device CopyDisk0001 has been created.
• A 10/09/12 14:20:43 4600 '' called by 'admin' (10.64.29.93) on 'Navi_SnapCopyFeature' with result: Success (Started SnapView session successfully.
Session name - sync-24_SRM-TEST-FAILOVER_session)
• A 10/09/12 14:20:43 Bus1 Enc0 DskE 60a A logical unit has been enabled [ALU 2] 0 ffff0000 2100e
• A 10/09/12 14:20:43 SnapCopy 71000003 SnapView persistent session sync-24_SRM-TEST-FAILOVER_session has been started on LUN 24.

EMC CONFIDENTIAL—INTERNAL USE ONLY 28


Test Failover – sequence of events[DR VNX]
• B 10/09/12 14:20:43 Bus1 Enc0 DskE 606 Unit Shutdown for Trespass [ALU 2] 0 ffff0000 2100e
• B 10/09/12 14:20:43 SnapCopy 71000003 SnapView persistent session sync-24_SRM-TEST-FAILOVER_session has been started on LUN 24.
• A 10/09/12 14:20:46 4600 'Activate' called by 'admin' (10.64.29.93) on 'SnapShot WWN: 60:06:01:60:54:50:2E:00:32:0B:08:80:1C:12:E2:11' with
result: Success (Successfully activated snapshot LU: 60:06:01:60:54:50:2E:00:32:0B:08:80:1C:12:E2:11 (sync-24_SRM-TEST-FAILOVER_session))
• A 10/09/12 14:20:51 4600 'ExecuteClientRequest' called by ' Navi User admin' (10.64.29.93) on 'CLIFeature' (Result: Success). snapview -storagegroup -
addsnapshot -gname SG_dellpr710-g.emcvmw.ctc -hlu 10 -snapshotname sync-24_SRM-TEST-FAILOVER -compatibilitymode called by 'admin'
• A 10/09/12 14:20:55 RemoteMirror 71050115 MirrorView quiesce LU request.
• A 10/09/12 14:20:55 RemoteMirror 71050111 MirrorView rebind request for LUN 60060160b9502e00:fe606cb30d03e211.
• A 10/09/12 14:20:55 RemoteMirror 71050115 MirrorView quiesce LU request.
• A 10/09/12 14:20:55 SnapCopy 71000006 SnapView has been bound to device 0000000f.
• A 10/09/12 14:20:55 RemoteMirror 71050139 RM_ADMIN_INFO_WILL_REBIND the object.
• B 10/09/12 14:20:55 RemoteMirror 71050136 Quiesce request from peer SP.
• B 10/09/12 14:20:55 RemoteMirror 71050120 Rebind request from peer SP for LUN 60060160b9502e00:fe606cb30d03e211.
• B 10/09/12 14:20:55 RemoteMirror 71050136 Quiesce request from peer SP.
• B 10/09/12 14:20:55 SnapCopy 71000006 SnapView has been bound to device Disk0002.
• B 10/09/12 14:20:55 RemoteMirror 71050120 Rebind request from peer SP for LUN 60060160b9502e00:fe606cb30d03e211.
• A 10/09/12 14:20:56 4600 'Create a SnapShot LU' called by 'admin' (10.64.29.93) on 'Navi_SnapCopyFeature' with result: Success (Successfully created
SnapShot LU.)
• A 10/09/12 14:20:56 SnapCopy 7100000a Snapshot Logical Unit device CopyDisk0002 has been created.
• B 10/09/12 14:20:56 SnapCopy 7100000a Snapshot Logical Unit device CopyDisk0002 has been created.
• A 10/09/12 14:20:59 4600 '' called by 'admin' (10.64.29.93) on 'Navi_SnapCopyFeature' with result: Success (Started SnapView session successfully.
Session name - sync-0_SRM-TEST-FAILOVER_session)
• A 10/09/12 14:20:59 SnapCopy 71000003 SnapView persistent session sync-0_SRM-TEST-FAILOVER_session has been started on LUN 0.
• A 10/09/12 14:20:59 Bus1 Enc0 DskE 60a A logical unit has been enabled [ALU 3] 0 ffff0001 3100e
• B 10/09/12 14:20:59 Bus1 Enc0 DskE 606 Unit Shutdown for Trespass [ALU 3] 0 ffff0001 3100e
• B 10/09/12 14:20:59 SnapCopy 71000003 SnapView persistent session sync-0_SRM-TEST-FAILOVER_session has been started on LUN 0.
• A 10/09/12 14:21:02 4600 'Activate' called by 'admin' (10.64.29.93) on 'SnapShot WWN: 60:06:01:60:54:50:2E:00:54:39:77:89:1C:12:E2:11' with result:
Success (Successfully activated snapshot LU: 60:06:01:60:54:50:2E:00:54:39:77:89:1C:12:E2:11 (sync-0_SRM-TEST-FAILOVER_session))
• A 10/09/12 14:21:06 4600 'ExecuteClientRequest' called by ' Navi User admin' (10.64.29.93) on 'CLIFeature' (Result: Success). snapview -storagegroup -
addsnapshot -gname SG_dellpr710-g.emcvmw.ctc -hlu 11 -snapshotname sync-0_SRM-TEST-FAILOVER -compatibilitymode called by 'admin' f

EMC CONFIDENTIAL—INTERNAL USE ONLY 29


Test Failover – sequence of events [DEMO]

EMC CONFIDENTIAL—INTERNAL USE ONLY 30


Cleanup – sequence of events [PROD]
Recovery Step Result Step Started Step Completed Execution Time

1. Power Off Test VMs at Success 2012-10-09 14:46:24 (UTC 0) 2012-10-09 14:46:30 (UTC 0)
Recovery Site

1.1. 2008-1 Success 2012-10-09 14:46:24 (UTC 0) 2012-10-09 14:46:30 (UTC 0)

1.1.1. Power Off Success 2012-10-09 14:46:24 (UTC 0) 2012-10-09 14:46:25 (UTC 0)

1.1.2. Reset Storage Success 2012-10-09 14:46:30 (UTC 0) 2012-10-09 14:46:30 (UTC 0)

1.2. 2008-2 Success 2012-10-09 14:46:24 (UTC 0) 2012-10-09 14:46:30 (UTC 0)

1.2.1. Power Off Success 2012-10-09 14:46:24 (UTC 0) 2012-10-09 14:46:27 (UTC 0)

1.2.2. Reset Storage Success 2012-10-09 14:46:30 (UTC 0) 2012-10-09 14:46:30 (UTC 0)

2. Resume Non-critical VMs Inactive


at Recovery Site

3. Discard Test Data and Success 2012-10-09 14:46:30 (UTC 0) 2012-10-09 14:47:25 (UTC 0)
Reset Storage

3.1. Protection Group test7 Success 2012-10-09 14:46:30 (UTC 0) 2012-10-09 14:47:25 (UTC 0)

Device "Mirror of dellpr710-c.w2k8.emcvm...":


Success
Device "Mirror of dellpr710-c.w2k8.emcvmw.ctc RDM-23":
Success
Device "Mirror of dellpr710-c.w2k8.emcvmw.ctc SRM_VMs":
Success

EMC CONFIDENTIAL—INTERNAL USE ONLY 31


Cleanup– sequence of events [PROD ESX]
- Some fairly serious errors in the vmkernel on prod. Esx, these can be
ignored.

)WARNING: VMW_SATP_LIB_CX: satp_lib_cx_otherSPIsHung:338:Path "vmhba2:C0:T1:L3" Peer


SP is hung.
)WARNING: VMW_SATP_LIB_CX: satp_lib_cx_otherSPIsHung:338:Path "vmhba3:C0:T0:L3" Peer
SP is hung.
)ALERT: NMP: vmk_NmpVerifyPathUID:1166:The physical media represented by device
naa.600601609da02e0012ce6a8f930de211 (path vmhba3:C0:T1:L9) has changed. If this is a
data LUN, this is a critical error. Detecte[0$
)ALERT: NMP: vmk_NmpVerifyPathUID:1166:The physical media represented by device
naa.600601609da02e0012ce6a8f930de211 (path vmhba2:C0:T0:L9) has changed. If this is a
data LUN, this is a critical error. Detecte[0$
)NMP: nmp_DeviceUpdatePathStates:547: Activated path "vmhba2:C0:T1:L9" for NMP device
"naa.600601609da02e0012ce6a8f930de211".

- Watch out for messages like this, customers could open cases based
on these errors alone…

EMC CONFIDENTIAL—INTERNAL USE ONLY 32


Cleanup – sequence of events [DR ESX]

EMC CONFIDENTIAL—INTERNAL USE ONLY 33


Cleanup – sequence of events [DR VNX]
A 10/09/12 14:46:49 4600 'storagegroup' called by ' Navi User admin' (10.64.29.93) with result: Success (Navisphere CLI command: '
storagegroup -removesnapshot -o -gname SG_dellpr710-g.emcvmw.ctc -snapshotname async-25_SRM-TEST-FAILOVER ')
A 10/09/12 14:46:50 4600 'Stop' called by 'admin' (10.64.29.93) on 'Session Name: async-25_SRM-TEST-FAILOVER_session' with
result: Success (Deactivated snapshot LU successfully: 60:06:01:60:54:50:2E:00:92:7D:58:76:1C:12:E2:11 (async-25_SRM-TEST-
FAILOVER_session)Stopped session su
A 10/09/12 14:46:50 SnapCopy 71000004 SnapView session async-25_SRM-TEST-FAILOVER_session has been stopped on LUN 25 with
status of 0.
B 10/09/12 14:46:50 SnapCopy 71000004 SnapView session async-25_SRM-TEST-FAILOVER_session has been stopped on LUN 25 with
status of 0.
A 10/09/12 14:46:52 SnapCopy 7100000b Snapshot Logical Unit device CopyDisk0000 has been removed.
A 10/09/12 14:46:52 NaviCimom 71288021 Failing Command: Set LUN.
B 10/09/12 14:46:52 SnapCopy 7100000b Snapshot Logical Unit device CopyDisk0000 has been removed.
A 10/09/12 14:46:53 4600 'Destroy a SnapShot' called by 'admin' (10.64.29.93) on 'SnapShot WWN:
60:06:01:60:54:50:2E:00:92:7D:58:76:1C:12:E2:11' with result: Success (Destroy snapshot successfully:
60:06:01:60:54:50:2E:00:92:7D:58:76:1C:12:E2:11)
B 10/09/12 14:46:53 NaviCimom 71288021 Failing Command: Set LUN.
A 10/09/12 14:47:00 4600 'storagegroup' called by ' Navi User admin' (10.64.29.93) with result: Success (Navisphere CLI command: '
storagegroup -removesnapshot -o -gname SG_dellpr710-g.emcvmw.ctc -snapshotname sync-24_SRM-TEST-FAILOVER ')
A 10/09/12 14:47:01 4600 'Stop' called by 'admin' (10.64.29.93) on 'Session Name: sync-24_SRM-TEST-FAILOVER_session' with
result: Success (Deactivated snapshot LU successfully: 60:06:01:60:54:50:2E:00:32:0B:08:80:1C:12:E2:11 (sync-24_SRM-TEST-
FAILOVER_session)Stopped session succ
A 10/09/12 14:47:01 Bus1 Enc0 DskE 606 Unit Shutdown for Trespass [ALU 2] 0 ffff0000 2100e
A 10/09/12 14:47:01 SnapCopy 71000004 SnapView session sync-24_SRM-TEST-FAILOVER_session has been stopped on LUN 24 with
status of 0.
B 10/09/12 14:47:01 SnapCopy 71000004 SnapView session sync-24_SRM-TEST-FAILOVER_session has been stopped on LUN 24 with
status of 0.
B 10/09/12 14:47:01 Bus1 Enc0 DskE 60a A logical unit has been enabled [ALU 2] 0 ffff0000 2100e
A 10/09/12 14:47:02 SnapCopy 7100000b Snapshot Logical Unit device CopyDisk0001 has been removed.
B 10/09/12 14:47:02 SnapCopy 7100000b Snapshot Logical Unit device CopyDisk0001 has been removed.
A 10/09/12 14:47:03 4600 'Destroy a SnapShot' called by 'admin' (10.64.29.93) on 'SnapShot WWN:
60:06:01:60:54:50:2E:00:32:0B:08:80:1C:12:E2:11' with result: Success (Destroy snapshot successfully:
60:06:01:60:54:50:2E:00:32:0B:08:80:1C:12:E2:11)
A 10/09/12 14:47:03 RemoteMirror 71050115 MirrorView quiesce LU request.
A 10/09/12 14:47:03 RemoteMirror 71050115 MirrorView quiesce LU request.
A 10/09/12 14:47:03 RemoteMirror 71050139 RM_ADMIN_INFO_WILL_REBIND the object.
A 10/09/12 14:47:03 RemoteMirror 71050111 MirrorView rebind request for LUN 60060160b9502e00:a4cc7fc81507e211.

EMC CONFIDENTIAL—INTERNAL USE ONLY 34


Cleanup – sequence of events [DR VNX]
B 10/09/12 14:47:03 RemoteMirror 71050136 Quiesce request from peer SP.
B 10/09/12 14:47:03 RemoteMirror 71050120 Rebind request from peer SP for LUN 60060160b9502e00:a4cc7fc81507e211.
B 10/09/12 14:47:03 RemoteMirror 71050136 Quiesce request from peer SP.
B 10/09/12 14:47:03 SnapCopy 71000007 SnapView has been unbound from device Disk0001.
B 10/09/12 14:47:03 RemoteMirror 71050120 Rebind request from peer SP for LUN 60060160b9502e00:a4cc7fc81507e211.
A 10/09/12 14:47:09 4600 'storagegroup' called by ' Navi User admin' (10.64.29.93) with result: Success (Navisphere CLI command: '
storagegroup -removesnapshot -o -gname SG_dellpr710-g.emcvmw.ctc -snapshotname sync-0_SRM-TEST-FAILOVER ')
A 10/09/12 14:47:10 4600 'Stop' called by 'admin' (10.64.29.93) on 'Session Name: sync-0_SRM-TEST-FAILOVER_session' with
result: Success (Deactivated snapshot LU successfully: 60:06:01:60:54:50:2E:00:54:39:77:89:1C:12:E2:11 (sync-0_SRM-TEST-
FAILOVER_session)Stopped session succes
A 10/09/12 14:47:10 Bus1 Enc0 DskE 606 Unit Shutdown for Trespass [ALU 3] 0 ffff0001 3100e
A 10/09/12 14:47:10 SnapCopy 71000004 SnapView session sync-0_SRM-TEST-FAILOVER_session has been stopped on LUN 0 with status
of 0.
B 10/09/12 14:47:10 SnapCopy 71000004 SnapView session sync-0_SRM-TEST-FAILOVER_session has been stopped on LUN 0 with status
of 0.
B 10/09/12 14:47:10 Bus1 Enc0 DskE 60a A logical unit has been enabled [ALU 3] 0 ffff0001 3100e
A 10/09/12 14:47:12 4600 'Destroy a SnapShot' called by 'admin' (10.64.29.93) on 'SnapShot WWN:
60:06:01:60:54:50:2E:00:54:39:77:89:1C:12:E2:11' with result: Success (Destroy snapshot successfully:
60:06:01:60:54:50:2E:00:54:39:77:89:1C:12:E2:11)
A 10/09/12 14:47:12 RemoteMirror 71050115 MirrorView quiesce LU request.
A 10/09/12 14:47:12 RemoteMirror 71050139 RM_ADMIN_INFO_WILL_REBIND the object.
A 10/09/12 14:47:12 RemoteMirror 71050111 MirrorView rebind request for LUN 60060160b9502e00:fe606cb30d03e211.
A 10/09/12 14:47:12 RemoteMirror 71050115 MirrorView quiesce LU request.
A 10/09/12 14:47:12 SnapCopy 7100000b Snapshot Logical Unit device CopyDisk0002 has been removed.
B 10/09/12 14:47:12 SnapCopy 7100000b Snapshot Logical Unit device CopyDisk0002 has been removed.
B 10/09/12 14:47:12 RemoteMirror 71050120 Rebind request from peer SP for LUN 60060160b9502e00:fe606cb30d03e211.
B 10/09/12 14:47:12 RemoteMirror 71050136 Quiesce request from peer SP.
B 10/09/12 14:47:12 SnapCopy 71000007 SnapView has been unbound from device Disk0002.
B 10/09/12 14:47:12 RemoteMirror 71050120 Rebind request from peer SP for LUN 60060160b9502e00:fe606cb30d03e211.
B 10/09/12 14:47:12 RemoteMirror 71050136 Quiesce request from peer SP.

EMC CONFIDENTIAL—INTERNAL USE ONLY 35


Cleanup – sequence of events [PROD]

EMC CONFIDENTIAL—INTERNAL USE ONLY 36


Failover - DEMO

EMC CONFIDENTIAL—INTERNAL USE ONLY 37


Troubleshooting: Obtaining the correct logs
- Ensure to capture the SRM & SRA logs.
- Please use VMWare KB 1009253
- “Export system Logs”
- Please complete these actions on both sites!

- http://kb.vmware.com/selfservice/microsites/search.do?cmd=displayKC&docType=kc&externalId=1009253

EMC CONFIDENTIAL—INTERNAL USE ONLY 38


Troubleshooting: Obtaining the correct logs
If the issue is related to a Test Failover or actual Failover then having
the failed Recovery Plan export log will also be invaluable in
troubleshooting the issue. To generate the log Export for the failed
Recovery Plan:

In the left pane, click Recovery Plans and select the Recovery Plan which had the issue.
Select the Plan Name which is showing an Error in the Result column.
On the Plan Name with the error click the Export action to generate the report for the failed Test Failover
or actual Failover.
Save the file to your desktop and upload this file with the SRM system logs.

EMC CONFIDENTIAL—INTERNAL USE ONLY 39


Troubleshooting: Obtaining the correct logs
- Exported information will look like this, take note of the time stamps
as this is what we will use to search thru the SPCOLLECT with
- The errors listed here are extremely useful in the actual diagnosis of
the issue.

EMC CONFIDENTIAL—INTERNAL USE ONLY 40


Troubleshooting: log files of interest:
- There are 2 main folder of interest within the exported log bundles:
- The Logs folder surprisingly enough:
- This will contain all the activity form the SRM application on that
particular site.
- Extract all .gz archives in case the errors you are searching for a
while back….
- The mail file of interest in this folder is called “vmware-dr-XX”
- Sort by date and review most recent, or search for time stamp
obtained form the html page described on slide 20.

EMC CONFIDENTIAL—INTERNAL USE ONLY 41


Troubleshooting: log files of interest:
- Please note the sate in the SRM logs are in the following format
- 2012-09-02T09:10:48.508+01:00
- The dates in the exported .html page are in:
- 2012-09-02 09:10:29 (UTC 0)

- So adjust accordingly when searching for errors across logs.


- In my example I’ll search thru the SRM logs with :

- 2012-09-02T09

- This will be a good start point


- For the Linux heads, this is what I’ using to make the logs more
human readable:

- grep "2012-09-02T09" vmware-dr-3*|grep -v "<" |less

EMC CONFIDENTIAL—INTERNAL USE ONLY 42


Troubleshooting: log files of interest:
- Most on the information in these vmware-dr-XX logs are really of
more interest to VMWare than EMC, as its just really verbose logging
of the SRM application and database interaction.
- No harm I having a peek to see it there is anything n jumping out
though.
- These log flies we want to next focus on is the SRA logs, and there
are a few different logs.
- Location = srm-support\Logs\SRAs\EMC VNX SRA
- sra_discoverArrays_08-30-2012_11-12-08.359
- sra_discoverDevices_08-30-2012_12-46-07.042
- sra_failover_08-29-2012_14-34-05.253
- sra_prepareFailover_09-02-2012_22-21-46.604
- sra_prepareReverseReplication_09-02-2012_22-26-51.711
- sra_queryCapabilities_02-24-2012_14-02-20.738
- sra_queryConnectionParameters_02-24-2012_14-02-23.035
- sra_queryErrorDefinitions_02-24-2012_14-02-25.676
- sra_queryInfo_02-24-2012_14-02-09.816
- sra_queryReplicationSettings_08-30-2012_17-47-54.968
- sra_queryStrings_02-24-2012_14-02-24.160
- sra_querySyncStatus_09-02-2012_22-19-09.277
- sra_reverseReplication_09-02-2012_22-17-17.440
- sra_syncOnce_09-02-2012_22-20-26.700
- sra_testFailoverStart_08-29-2012_12-52-23.461
- sra_testFailoverStop_09-02-2012_10-21-37.204

EMC CONFIDENTIAL—INTERNAL USE ONLY 43


Troubleshooting: log files of interest:
- Timestamp format again is similar here, so just take note of it.
- Get timestamp from .html page as before:

- The SRA log folder will be loaded full of many logs so I’m just going to focus
on the logs form Sept 02 2012
- The above error was received when trying to do a Test Failover.
- We need to check in the following log:

- sra_testFailoverStart_09-02-2012_08-42-09.811.log

- Note the UTC time adjustment.

EMC CONFIDENTIAL—INTERNAL USE ONLY 44


Troubleshooting: log files of interest:
- Looking at the log file, we can pull some very useful information:
- [sra_testFailoverStart_09-02-2012_08-42-09.811.log]

- grep / search for:

- com.emc.mirrorview.platform.naviseccli.NaviseccliConnection

- This will show the actual navi commands that are being issued by the
SRA to the SP

EMC CONFIDENTIAL—INTERNAL USE ONLY 45


Troubleshooting: log files of interest:
- Within the same log file search / grep for :

- Command result:

- This should give a clear indication of where the error lies.


- In this case, looks like we have a issue with Snapview

EMC CONFIDENTIAL—INTERNAL USE ONLY 46


Troubleshooting: log files of interest:
- Switch over to the SPCOLLECT’s for both sites, and grep out any messages
related to SnapCopy
- On the DR site, we can see:

- Dave@QQWWQQWW /cygdrive/c/Users/Dave/Documents.backup/Logs/SRM_LOGS_PPTX/Pandora
- $ grep SnapCopy "TRiiAGE_full_SPlogs.txt“

- B 09/02/12 07:54:35 NaviCimom 7100808b Failing Command: K10SnapCopyAdmin DBid 0 Op 1046.


- A 09/02/12 07:54:43 NaviCimom 7100808b Failing Command: K10SnapCopyAdmin DBid 0 Op 1046.
- A 09/02/12 07:54:44 4600 'Create a SnapShot LU' called by 'admin' (10.64.29.93) on
'Navi_SnapCopyFeature' with result: Failure (Could not create SnapShot LU.. [0x7100808B] A SnapView snapshot
already exists with the specified name (0x7100808b))
- A 09/02/12 07:54:45 SnapCopy 71008031 You must add LUNs with adequate capacity to the
Reserved LUN Pool before you can use this feature.
- A 09/02/12 07:54:46 4600 '' called by 'admin' (10.64.29.93) on 'Navi_SnapCopyFeature' with
result: Failure (Could not start SnapView session. Session name - sync-0_SRM-TEST-FAILOVER_session.
[0x71008031] You must add LUNs with adequate capacity to the Reserved LUN Pool before you
- A 09/02/12 07:54:46 NaviCimom 71008031 Failing Command: K10SnapCopyAdmin DBid 0 Op 1038.

- This is indicating that there is no Reserve Lun Pool setup, as described on


slide 13.

EMC CONFIDENTIAL—INTERNAL USE ONLY 47


Troubleshooting: Logs needed - Recap.
• So for all SRM / SRA cases you will need the following logs:

– SPCOLLECT form both sites


– SRM logs form both sites
– SRA logs form both sites.
– “Recovery Plan Export Log.html” as explained on slide 19

• Seriously, don’t proceed until you have everything listed above.

• http://kb.vmware.com/selfservice/microsites/search.do?
cmd=displayKC&docType=kc&externalId=1009253

EMC CONFIDENTIAL—INTERNAL USE ONLY 48


Troubleshooting: Workflow
Ok, so for every SRM / SRA case that does go in, the following should
apply as valid workflow towards resolution of the case.

1. Collect Logs
2. Check MirrorView & confirm it is actually working.
3. Have customer reconfirm DNS & IP connectivity is OK
1. all hosts should be DNS resolvable on both sites
2. All hosts / SP’s should have IP connectivity on same VLAN, all hosts /
SP’s should be able to ping each other…
4. Confirm Software requirements listed on Slide 7
5. Check error that is reported in Recovery Plan Export Log.html
6. Search in TRiiAGE_full_Splogs for any errors at the time reported
on the Recovery Plan Export Log.html

EMC CONFIDENTIAL—INTERNAL USE ONLY 49


Log error / message examples

• In this section we will provide examples of errors and informative messages


that may assist in troubleshooting your issue.

EMC CONFIDENTIAL—INTERNAL USE ONLY 50


Log error / message examples
Search for errors that are returned form the Navi commands within the SRA logs.

In particular : look for Command result: stdout(Error:

Search the whole folder of SRA logs as you will get hits on different files depending on the issue you
are having.

Some examples: [sra_testFailoverStart_08-30-2012_16-40-15.999.log]

2012-08-30 16:40:31,693 [com.emc.mirrorview.platform.snapshot.HashMapSnapviewSnapshotRepository]: Caching SnapView snapshot with


name sync-0_SRM-TEST-FAIL
OVER
2012-08-30 16:40:31,693 [com.emc.mirrorview.platform.snapshot.session.SnapviewSessionServiceImpl]: Starting SnapView session with name:
sync-0_SRM-TEST-FA
ILOVER_session, for snapshot: sync-0_SRM-TEST-FAILOVER
2012-08-30 16:40:31,693 [com.emc.mirrorview.platform.naviseccli.NaviseccliConnection]: 10.64.29.85 Executing command: snapview -
startsession "sync-0_SRM-
TEST-FAILOVER_session" -snapshotname "sync-0_SRM-TEST-FAILOVER" -persistence
2012-08-30 16:40:33,003 [com.emc.mirrorview.platform.naviseccli.NaviseccliConnection]: 10.64.29.85 Command result: stdout(Error:
snapview command failed
You must add LUNs with adequate capacity to the Reserved LUN Pool before you can use this feature. (0x71008031)), stderr()
2012-08-30 16:40:33,003 [com.emc.mirrorview.platform.snapshot.session.SnapviewSessionServiceImpl]: Retrieving info for SnapView session
with name sync-0_SRM-TEST-FAILOVER_session
2012-08-30 16:40:33,003 [com.emc.mirrorview.platform.naviseccli.NaviseccliConnection]: 10.64.29.85 Executing command: snapview -
listsessions -name "sync-0_SRM-TEST-FAILOVER_session"

EMC CONFIDENTIAL—INTERNAL USE ONLY 51


Log error / message examples
sra_discoverArrays_08-30-2012_11-15-50.833.log

2012-08-30 11:15:51,457 [com.emc.sra.SraController]: Building SRM command response...


2012-08-30 11:15:51,457 [com.emc.sra.ResponseBuilder]: Building discoverArrays response...
2012-08-30 11:15:51,473 [com.emc.sra.mirrorview.MirrorviewCommands]: MirrorView Enabler Version: 5.0.22
2012-08-30 11:15:51,473 [com.emc.mirrorview.platform.naviseccli.NaviseccliConnection]: 10.64.29.85 Executing command:
arrayname
2012-08-30 11:15:51,504 [com.emc.mirrorview.platform.naviseccli.NaviseccliConnection]: 10.64.29.85 Command result: stdout(),
stderr('naviseccli' is not recognized as an internal or external command,
operable program or batch file.)
2012-08-30 11:15:51,504 [com.emc.mirrorview.platform.naviseccli.NaviseccliConnection]: Unknown error occurred while opening
naviseccli connection.
2012-08-30 11:15:51,504 [com.emc.sra.mirrorview.MirrorviewCommands]: Unable to connect using SPA, trying SPB...
2012-08-30 11:15:51,504 [com.emc.mirrorview.platform.naviseccli.NaviseccliConnection]: 10.64.29.85 Executing command:
arrayname
2012-08-30 11:15:51,520 [com.emc.mirrorview.platform.naviseccli.NaviseccliConnection]: 10.64.29.85 Command result:
stdout(), stderr('naviseccli' is not recognized as an internal or external command,
operable program or batch file.)
2012-08-30 11:15:51,520 [com.emc.mirrorview.platform.naviseccli.NaviseccliConnection]: Unknown error occurred while opening
naviseccli connection.
2012-08-30 11:15:51,520 [com.emc.sra.ResponseBuilder]: Unable to get SRA Enabler for this connection info Unable to get SRA
Enabler for this connection infocom.emc.sra.ResponseBuilder.getEnabler(ResponseBuilder.java:346)

It would seem form the above that Naviseccli is not installed properly on SRM
host, check path!

EMC CONFIDENTIAL—INTERNAL USE ONLY 52


Log error / message examples
If MirrorView is working is would look like this in sra_discoverDevices_08-29-2012_12-20-26.942.log

2012-08-29 12:20:32,823 [com.emc.mirrorview.platform.naviseccli.NaviseccliConnection]: 10.64.29.83 Executing command: mirror -sync -info -systems
2012-08-29 12:20:34,102 [com.emc.mirrorview.platform.naviseccli.NaviseccliConnection]: 10.64.29.83 Command result: stdout(Remote systems that can be enabled for mirroring:
Remote systems that are enabled for mirroring:
Array UID: 50:06:01:60:C7:20:0A:2D
Status: Enabled on both SPs), stderr()
2012-08-29 12:20:34,102 [com.emc.sra.response.ReplicatedDevicesBuilder]: Attempted discovery of peer array:50:06:01:60:C7:20:0A:2Dfailed.
2012-08-29 12:20:34,102 [com.emc.mirrorview.platform.mirror.MirrorServiceImpl]:
************* SYNC MIRRORS ***************
2012-08-29 12:20:34,102 [com.emc.mirrorview.platform.naviseccli.NaviseccliConnection]: 10.64.29.83 Executing command: mirror -sync -list
2012-08-29 12:20:35,381 [com.emc.mirrorview.platform.naviseccli.NaviseccliConnection]: 10.64.29.83 Command result: stdout(MirrorView Name: Mirror of dellpr710-c.w2k8.emcvmw.ctc Datastore_1
MirrorView Description:
MirrorView UID: 50:06:01:60:BE:A0:39:93:03:00:00:00:00:00:00:00
Logical Unit Numbers: 0
Remote Mirror Status: Mirrored
MirrorView State: Active
MirrorView Faulted: NO
MirrorView Transitioning: NO
Quiesce Threshold: 60
Minimum number of images required: 0
Image Size: 125829120
Image Count: 2
Write Intent Log Used: YES
Images:
Image UID: 50:06:01:60:BE:A0:39:93
Is Image Primary: YES
Logical Unit UID: 60:06:01:60:9D:A0:2E:00:52:49:31:9F:C9:D7:E1:11
Image Condition: Primary Image
Preferred SP: A

Image UID: 50:06:01:60:C7:20:0A:2D


Is Image Primary: NO
Logical Unit UID: 60:06:01:60:B9:50:2E:00:FA:DE:7D:16:AB:F1:E1:11
Image State: Synchronized
Image Condition: Normal
Recovery Policy: Manual
Preferred SP: A
Synchronization Rate: Medium
Image Faulted: NO
Image Transitioning: NO
Synchronizing Progress(%): 100), stderr()

EMC CONFIDENTIAL—INTERNAL USE ONLY 53


Log error / message examples
sra_discoverDevices_08-29-2012_12-54-59.945.log

2012-08-29 12:55:13,486 [com.emc.mirrorview.platform.snapshot.SnapviewSnapshotServiceImpl]: Retrieving SnapView snapshot


information...
2012-08-29 12:55:13,486 [com.emc.mirrorview.platform.naviseccli.NaviseccliConnection]: 10.64.29.85 Executing command:
snapview -listsnapshots
2012-08-29 12:55:13,861 [com.emc.mirrorview.platform.naviseccli.NaviseccliConnection]: 10.64.29.85 Command result:
stdout(This version of Core Software does not support Snapview), stderr()
2012-08-29 12:55:13,861 [com.emc.mirrorview.platform.snapshot.HashMapSnapviewSnapshotRepository]: Caching SnapView
snapshot with name null
2012-08-29 12:55:13,861 [com.emc.sra.SraController]: Writing XML response...

In the above example, I did not have the correct Snapview enabler installed on the VNX, there is a
new one for INYO.

Reference Slide 7

EMC CONFIDENTIAL—INTERNAL USE ONLY 54


Log error / message examples
sra_testFailoverStart_08-30-2012_16-43-48.316.log

2012-08-30 16:43:58,722 [com.emc.mirrorview.platform.snapshot.SnapviewSnapshotServiceImpl]: Retrieving SnapView snapshot information...


2012-08-30 16:43:58,722 [com.emc.mirrorview.platform.naviseccli.NaviseccliConnection]: 10.64.29.85 Executing command: snapview -listsnapshots
2012-08-30 16:43:59,923 [com.emc.mirrorview.platform.naviseccli.NaviseccliConnection]: 10.64.29.85 Command result: stdout(SnapView logical unit name: syn
c-0_SRM-TEST-FAILOVER
SnapView logical unit ID: 60:06:01:60:54:50:2E:00:78:4E:71:B9:BA:F2:E1:11
Target Logical Unit: 0
State: Inactive), stderr()
2012-08-30 16:43:59,923 [com.emc.mirrorview.platform.snapshot.HashMapSnapviewSnapshotRepository]: Caching SnapView snapshot with name sync-0_SRM-TEST-FAIL
OVER
2012-08-30 16:43:59,923 [com.emc.mirrorview.platform.snapshot.SnapviewSnapshotServiceImpl]: Creating SnapView snapshot with name: sync-0_SRM-TEST-FAILOVER
, of LUN: 0
2012-08-30 16:43:59,923 [com.emc.mirrorview.platform.snapshot.SnapviewSnapshotServiceImpl]: Searching for current SP owner of lun: 0
2012-08-30 16:43:59,923 [com.emc.mirrorview.platform.naviseccli.NaviseccliConnection]: 10.64.29.85 Executing command: snapview -createsnapshot 0 -snapsho
tname "sync-0_SRM-TEST-FAILOVER"
2012-08-30 16:44:02,029 [com.emc.mirrorview.platform.naviseccli.NaviseccliConnection]: 10.64.29.85 Command result: stdout(Error: snapview command failed A
SnapView snapshot already exists with the specified name (0x7100808b)), stderr()
2012-08-30 16:44:02,029 [com.emc.mirrorview.platform.snapshot.session.SnapviewSessionServiceImpl]: Starting SnapView session with name: sync-0_SRM-TEST-
FAILOVER_session, for snapshot: sync-0_SRM-TEST-FAILOVER
2012-08-30 16:44:02,029 [com.emc.mirrorview.platform.naviseccli.NaviseccliConnection]: 10.64.29.85 Executing command: snapview -startsession "sync-0_SRM-TEST-
FAILOVER_session" -snapshotname "sync-0_SRM-TEST-FAILOVER" -persistence
2012-08-30 16:44:03,355 [com.emc.mirrorview.platform.naviseccli.NaviseccliConnection]: 10.64.29.85 Command result: stdout(Error: snapview command failed You
must add LUNs with adequate capacity to the Reserved LUN Pool before you can use this feature. (0x71008031)), stderr()

Both of the above errors were experienced when I did not have the Reserved Pool setup.

EMC CONFIDENTIAL—INTERNAL USE ONLY 55


Log error / message examples [ESX]
From the esx side we will see errors in the vmkernel during a HBA rescan of the MirrorView
devices:

This is expected behaviour

2012-10-03T13:11:19.187Z cpu9:2057)NMP: nmp_ThrottleLogForDevice:2318: Cmd 0x28 (0x4124413ba440) to dev


"naa.60060160b9502e00a4cc7fc81507e211" on path "v
mhba5:C0:T0:L5" Failed: H:0x0 D:0x2 P:0x2 Possible sense data: 0x5 0x25 0x1.Act:NONE
2012-10-03T13:11:19.187Z cpu9:2057)ScsiDeviceIO: 2316: Cmd(0x4124413ba440) 0x28, CmdSN 0x5c5e6 to dev
"naa.60060160b9502e00a4cc7fc81507e211" failed H:0x0
D:0x2 P:0x2 Possible sense data: 0x5 0x25 0x1.
2012-10-03T13:11:19.187Z cpu14:2833)Partition: 484: Read of GPT header failed on "naa.60060160b9502e00a4cc7fc81507e211": I/O error
2012-10-03T13:11:19.188Z cpu9:2057)ScsiDeviceIO: 2316: Cmd(0x4124413ba440) 0x28, CmdSN 0x5c5e7 to dev
"naa.60060160b9502e00a4cc7fc81507e211" failed H:0x0
D:0x2 P:0x2 Possible sense data: 0x5 0x25 0x1.
2012-10-03T13:11:19.188Z cpu14:2833)WARNING: Partition: 944: Partition table read from device naa.60060160b9502e00a4cc7fc81507e211 failed:
I/O error

EMC CONFIDENTIAL—INTERNAL USE ONLY 56


Log error / message examples [ESX]
Taking a closer look at those scsi sense codes:

H:0x0 D:0x2 P:0x2 Possible sense data: 0x5 0x25 0x1

http://10.241.217.72/vmdecoder/index.php

EMC CONFIDENTIAL—INTERNAL USE ONLY 57


Questions

EMC CONFIDENTIAL—INTERNAL USE ONLY 58


EMC CONFIDENTIAL—INTERNAL USE ONLY 59

You might also like