Professional Documents
Culture Documents
DR Drill No. Activity Start Date End Date Primary Data Near Data
Center (PDC) Center (NDC)
SAP-2021-02-21 Failover 20-FEB-2021 20-FEB-2021 Samanvay DC Naranpura DC
SAP-2021-02-28 Failback 27-FEB-2021 27-FEB-2021 Naranpura DC Samanvay DC
List of Nodes
Following is the list of servers (referred as Node-X) that were tested along with their physical location.
System PDC NDC
ECC AECCDB, AECC00 BECCDB, BECC00
BW ABWDB, ABWA0 BBWDB, BBWA0
EP AEPDBCS BEPDBCS
PO APROORCH BPROORCH
SOLMAN ASOLMAN BSOLMAN
DMS ADMS BDMS
FIORI AFFEDB BFFEDB
Web Dispatcher AWEBDIS BWEBDIS
IBM Spectrum Backup ATSM BTSM
Node-X tag Node-A Node-B
Prepared By Approved By
Sign: Sign:
16-Mar-2021
Date: ____________ Name: Umang Patel Date: __/ __ / ____ Name: Shri. Jagdish Trivedi (VP-IT)
Page 1 of 4
Functional Observation/Issues (if any):
(List here any non-technical or non-BASIS issue or observation that understood during testing)
1. No functional issues reported by end-users or IT support team
EP Issue:
1. EP cluster was failover successfully from Site A to Site B, but application did not start automatically.
Observations / Findings:
• AIX cluster of EP system was moved successfully from Site A (AEPDBCS) to Site B (BEPDBCS).
• When the cluster went to Site B, we observed that the application was taking time to start.
• For initial startup load of EP, it was needing more memory resources at HANA tenant database TPP.
• Issue has been resolved after increased memory from 100GB to 125GB of HANA tenant TPP.
PO Issues:
1) First Issue: PO cluster did not failover by cluster move resource group method.
Observations / Findings:
• We stopped sap application and executed command “smitty clstop” at Site A (APROORCH) and
selected option: Move resource group.
• The cluster started moving to Site B (BPROORCH) but got stuck at one point with below warning
message.
WARNING: Cluster has been running recovery program 'TE_RG_MOVE' for 360 seconds. Please
check cluster status.
• We waited for 45 minutes, but it was stuck at the above point only.
• To resolve this, we restarted both the nodes and again started cluster at Site A.
• Executed halt at Site A and cluster successfully moved to Site B.
• We had raised case: TS005063238 to IBM for the error faced in the beginning.
• Feedback from IBM that this behaviour is a bug.
• Support quoted: “Here what I understood is the application was stopped by the user in advance,
since there was no monitor script configured the Power HA works as usual. When user-initiated
cl_stop command Power HA called the stop script and wait for the execution completion.”
Observations / Findings:
• Situation: The cluster was online at site B (BPROORCH) and Site A (APROORCH) was in halt mode.
• When we started the Site A (APROORCH) node from HMC, the Site B (BPROORCH) node
automatically got restarted.
• We created support case: TS005065070 to IBM for this issue.
Page 2 of 4
• Feedback from IBM: “When node APROORCH was restarted after the HALT, it caused node
BPROORCH to be rebooted by the RSCT (Reliable Scalable Cluster Technology) subsystem. This is a
known issue with RSCT subsystem and is resolved by installing apar IJ02843.”
Observations / Findings:
• The cluster was successfully moved from Site A (ASOLMAN) to Site B (BSOLMAN).
• Due to old profile (ASE sysbase DB) file on bsolman host, Application did not connect HANA
database.
• After copied new profile from asolman to bsolman issue has been resolved.
TSM Issue:
2. We will failback TSM-Backup Server cluster to Site-A after resolve “en2 adapter” issue on ATSM host.
Observations / Findings:
• We executed cluster failback command at Site B (BTSM) to Site A (ATSM).
• The cluster command returned error as below: “The PowerHA System Mirror adapter en2 is not
available on node atsm.”
Page 3 of 4
• Raised issue to IBM with case number: TS005108185.
• Initial Observation: The en2 adapter is in defined state at Site A (ATSM) and we must put the adapter
in available state. We are awaiting further instructions from IBM.
Page 4 of 4