Professional Documents
Culture Documents
RCA - SEADRILL - E2-PM00051911 - Hyperion - Task Flows Are Not Running - E2-IM...
RCA - SEADRILL - E2-PM00051911 - Hyperion - Task Flows Are Not Running - E2-IM...
1. EXECUTIVE SUMMARY
Incident Resolution:
Hyperion team re-registered the HFM applications with shared services.
Customer Impact: Business Impact: There was no Business impact for the Hyperion business
users.
Service Impact: Corporate Application will not get the data and consolidation
schedules will not run.
System Criticality: High
Impacted Locations: None of the location were impacted. There was only
service impact of the Hyperion application.
Number of users impacted : N/A
After the e-mail monitoring is set, the alerting will be done automatically and
will notify the team on any issues with the state of the task flows.
Cause Code1 Application
Sub-Cause Code1 App: Human Error
Key actions to Eliminate
Initiating Root Cause: Action 1: DXC Hyperion functional support team and the Seadrill Hyperion
functional administrator to ensure that the task flows are updated with a
correct password every 2 weeks. – I need to understand the issue first
before I agree. Piling up the Task flows breaks the Task flow functionality. In
this case the password expiry was the cause for the task flow pileup.
Action 2: Calendar alert to be set in order to re-schedule the task flows
every week. – can we not find an automated solution? As we do not receive
the email alert for the password expiry to DXC email id’s, Either we have to
schedule Task flows with generic account so that password will never expire
or We need to have this calendar alert.
Contributory Causes: Action 3: DXC Hyperion team to set up an e-mail monitoring to alert the
team regarding the status of the task flows. – Please share the design with
me. We have steps for receiving Email alerts for Tasks which are either
completed or failed but the issue happens when the task flow remains in the
running status. We are checking with Oracle for further steps.
1 From the standard Cause and Sub-Cause Codes documented in PRBM Cause Codes Sub-
Cause Codes.
Q:2 Why there were a lot of active task flows in active state in the Hyperion application.
A: The task flows run with the functional team's Active Directory ID. This particular ID was
locked out, hence all the task flows were not able to authenticate and remained in the
active state. This has led to piling up of the task flows, which were not getting cleared
correctly and this caused the connection breakdown between the HFM module and the
Hyperion shared services.
Action: Action 2:Calendar alert to be set in order to re-schedule the task flows every week. Can
we not have an automated solution? As we do not receive the email alert for the
password expiry to DXC email id’s, Either we have to schedule with generic
account so that password will never expire or We need to have this calendar
alert.
Q:3 Why the particular active directory ID got locked out?
A: This is an individual ID to a person from the functional team. The password of this
individual ID was expired, which has caused the ID to get locked.
Action:
Q:4 Why was an individual user-id able to break the system? Why are jobs running under
user-id names and dependent upon have active credentials to enable jobs to run?
The functionality which was scheduled with the user ID caused the pileup and
break the connection specific to that functionality but not the entire system. As
the product behavior, it uses the active credentials to run the jobs.
Q: 4 Why the password was expired and did not get renewed?
A: It is the responsibility of the individual from the functional team to renew such
passwords and not let them expire and lock their ID accounts.
Action: Action 1:DXC Hyperion functional support team and the Seadrill Hyperion functional
administrator to ensure that the task flows are updated with a correct password every 2
weeks.
Q:5 How the DXC Hyperion support team is monitoring the task flows?
A: At that the time of the Incident there was no monitoring on the state of the task flows.
After the Incident was resolved, the DXC Hyperion support team has raised a SR
(Severity 2 SR 3-16157666521: HFM Task flows progress alert) to the Oracle vendor. –
Why raise an SR? Are DXC not able to identify how to monitor task flows? As per the
functionality it is possible to send an alert after a step in the Task flow completes or
Failure but not when it remains in running state. We are reaching Vendor if there is a
Q:6 Can we re-schedule task flows every week to avoid password expire issue?
A: The DXC Hyperion support team will set a calendar alert in order to re-schedule the
task flows every week.
Action: Action 2:Calendar alert to be set in order to re-schedule the task flows every week.
Resolution List
Completion
# Action Statement Action deliverable Action Owner Target date
date
DXC Hyperion functional support team Mitigation
and the Seadrill Hyperion functional
nageswararao.korra
1 administrator to ensure that the task 01/12/2017
pati@hpe.com
flows are updated with a correct
password every 2 weeks.
Calendar alert to be set in order to re- Corrective nageswararao.korra
2 24/11/2017
schedule the task flows every week. pati@hpe.com
DXC Hyperion team to set up an e- Corrective
nageswararao.korra
3 mail monitoring to alert the team 15/12/2017
pati@hpe.com
regarding the status of the task flows.
DXC Hyperion team to ensure Corrective
following the INCM process for the nageswararao.korra
4 30/11/2017
correct prioritization of Incidents pati@hpe.com
Checkpoint meeting: Held between DXC Hyperion support team, Scott Ainslie and Daniel Arciniega on the
13th of November
Problem Manager - Spas Tsanov
DXC ADM - Abhranil Dhar
Application Lead - Nageswararao Korrapati
DXC on call person that attended the War room calls – No WAR room was organized
Main CIM representative that managed the War room – Georgi Todorov
Main Seadrill representative that attended the War room calls – No WAR room was organized