Professional Documents
Culture Documents
net/publication/343678095
CITATIONS READS
0 94
7 authors, including:
Some of the authors of this publication are also working on these related projects:
Optimización de la Gestión del Mantenimiento y Análisis crítico de Indicadores de Benchmarking bajo el enfoque integral de la Gestión de Activos (ISO 55000). View project
All content following this page was uploaded by Carlos Parra on 15 August 2020.
Carlos Parraa, Adolfo Crespoa, Vicente Gonzáleza, Fredy Kristjanpollerb, Pablo Viverosb,
Gabriel Llortc, Alfredo Aguilarc
a
Department Industrial Management School of Engineering of the University of Seville, España
parrac@ingecon.net.in, adolfo@esi.us.es, vicente.gonzalezprida@gdels.com
a
Department of Industrial Engineering, University Federico Santa María, Valparaíso, Chile
pablo.viveros@usm.cl, fredy.kristjanpoller@usm.cl
c
MWH Global, Construction Project: Third Set of Locks in the Panamá Canal, Panamá
gabriel.llort@mwhglobal.com, alfredo.aguilar@mwhglobal.com
Abstract
The purpose of this chapter book, is to provide a Maintenance and Reliability Management Model for the
project: Design and Construction of the Third Set of Locks in the Panama Canal, with the approach of the
process of asset management optimization. A practical vision of the maintenance and reliability management
process and framework is presented with the idea of:
• Structuring the maintenance management process by grouping management activities within a series of
so-called management building blocks;
• Structuring the framework grouping techniques that can be used to support decisions to be taken within
each of these building block.
The chapter is divided in three sections. The first one presents a generic model proposed for maintenance and
reliability management, which integrates other models found in the literature for built and in-use assets, and
consists of eight sequential management building blocks (Crespo, 2007, Parra and Crespo, 2012). The different
maintenance engineering techniques are playing a crucial role within each one of those eight management
building blocks. Following this path, it characterizes the “maintenance management framework”, i.e. the
supporting structure of the management process. Additionally, in this section, are described, the roles of
managers and supervisors, who will perform the activities in the maintenance management model proposed.
The second section deals an introduction to the principles of RCM (Reliability Centered Maintenance) and
describes an RCM process that incorporates risk-based decision tools. The content can be readily assimilated
by the future maintenance and operations people at all levels, in the start of operations of the third set of locks
in the Panama Canal. RCM will provide a systematic approach for defining maintenance tasks for critical
systems.
The third section is related with the three basic indicators of maintenance management, availability, reliability
and maintainability. We hereafter shall present the parameters to be used in the calculation of these indexes and
how to use these indicators in the process of maintenance optimization.
Finally, this chapter book presents not only a process but also the framework and techniques to manage and
improve maintenance and reliability effectiveness and efficiency. This report will be used to assist different
plant teams to elaborate the optimal strategies for maintenance and inspection for the assets, specified for the
project: Third Set of Locks in the Panama Canal.
Keywords: Maintenance and Reliability Management Model, Asset Management, RCM: Reliability Centered
Maintenance.
1
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
1. MAINTENANCE MANAGEMENT MODEL PROPOSED FOR THE
PROJECT: THIRD SET OF LOCKS IN THE ACP (AUTORIDAD DEL
CANAL DE PANAMÁ)
The Maintenance Management Models are frequently associated with a wide range of
difficulties. Why is this function, at least in appearance, so difficult to manage? We have
carried out a review of literature to find out some of the reasons:
- Lack of maintenance management models (Parra and Crespo, 2012). There is a lack
of models that could improve the understanding of the underlying dimensions of
maintenance. Maintenance is somewhat “under-developed” with a lack of effective
prevention methodologies and the integration of said methods in manufacturing
companies in most continents;
- Lack of time to complete the analysis required. Many managers indicate how they do
not have the required time to carry out suitable maintenance problems analysis. Day
to day actions and decision making activities distract them from these fundamental
activities to improve maintenance;
2
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
Some authors (Parra and Crespo, 2012) have worked on the characterization of the
complexity found in managing the maintenance function in a production environment,
creating tools where we are able to value each one of previously reviewed factors for a certain
organization (with a degree of fulfilment – DFi), and evaluate them according to
environmental aspects (with a relevance factor – RFi). The maintenance management
complexity index can be helpful as one way of comparing across different production
environments to help decide the relative effort and resources required to maintain them.
The maintenance management process can be divided into two parts: the definition of the
strategy, and the strategy implementation. The first part, definition of the maintenance strategy,
requires the definition of the maintenance objectives as an input, which will be derived directly
from the business plan. This initial part of the maintenance management process conditions the
success of maintenance in an organization, and determines the effectiveness of the subsequent
implementation of the maintenance plans, schedules, controls and improvements. Effectiveness
shows how well a department or function meets its goals or company needs, and is often
discussed in terms of the quality of the service provided, viewed from the customer’s
perspective. Effectiveness concentrates then on the correctness of the process and whether the
process produces the required result (Vagliasindi, 1989; Wireman, 1998 and Palmer, 1999).
The second part of the process, the implementation of the selected strategy has a different
significance level. Our ability to deal with the maintenance management implementation
problem (for instance, our ability to ensure proper skill levels, proper work preparation, suitable
tools and schedule fulfilment), will allow us to minimize the maintenance direct cost (labour
and other maintenance required resources). In this part of the process, we deal with the
efficiency of our management, which should be less important. Efficiency is acting or producing
with minimum waste, expense, or unnecessary effort. Efficiency is then understood as providing
the same or better maintenance for the same cost.
In this report, we present a generic model proposed for maintenance management integrates
other models found in the literature (Pintelon and Gelders, 1992; Vanneste and van
Wassenhove, 1995) for built and in-use assets, and consists of eight sequential management
building blocks, as shown in Figure 1 (Parra and Crespo, 2012). The first three building blocks
condition maintenance effectiveness, the fourth and fifth ensure maintenance efficiency, blocks
3
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
six and seven are devoted to maintenance and assets life cycle cost assessment, finally block
number eight ensures continuous maintenance management improvement. The maintenance
management model proposed to the project: Third Set of Locks in the ACP, is presented below
(model is based on the Asset Management Standards ISO 55000, 55001 and 55002).
In this section, we will briefly introduce each block and discuss methods that may be used to
improve each building block decision-making process (Figure 2) (Parra and Crespo 2012).
4
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
Figure 2. Sample of techniques within the maintenance management framework
Source: Crespo Marquez, A. (2007), The Maintenance Management Framework, Models
and Methods for Complex Systems Maintenance, Springer, London.
Regarding the definition of maintenance objectives and key performance indicators – KPI’s
(Phase 1), it is common the operational objectives and strategy, as well as the performance
measures, are inconsistent with the declared overall business strategy (Gelders et al., 1994). This
unsatisfactory situation can indeed be avoided by introducing the balanced scorecard – BSC
(Kaplan and Norton, 1992). The BSC is specific for the organization for which it is developed
and allows the creation of KPIs for measuring maintenance management performance which
are aligned to the organization’s strategic objectives.
Once the maintenance objectives and strategy are defined, there are a large number of
quantitative and qualitative techniques which attempt to provide a systematic basis for deciding
what assets should have priority within a maintenance management process (Phase 2), a decision
5
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
that should be taken in accordance with the existing maintenance strategy. Most of the
quantitative techniques use a variation of a concept known as the “probability/risk number” –
PRN (Moubray, 1997).
Assets with the higher PRN will be analysed first. Often, the number of assets potentially at risk
outweighs the resources available to manage them. It is therefore extremely important to know
where to apply available resources to mitigate risk in a cost-effective and efficient manner. Risk
assessment is the part of the ongoing risk management process that assigns relative priorities for
mitigation plans and implementation. In professional risk assessments, risk combines the
probability of an event occurring with the impact that event would cause. The usual measure of
risk for a class of events is then R = P x C, where P is probability and C is consequence. The
total risk is therefore the sum of the individual class-risks (Parra and Crespo, 2012). The
procedure to follow in order to carry out an assets criticality analysis following risk assessment
techniques could be then depicted as follows:
(1) define the purpose and scope of the analysis;
(2) establish the risk factors to take into account and their relative importance;
(3) decide on the number of asset risk criticality levels to establish; and
(4) establish the overall procedure for the identification and priorization of the critical
assets.
Notice that assessing criticality will be specific to each individual system, plant or business unit.
For instance, criticality of two similar plants in the same industry may be different since risk
factors for both plants may vary or have different relative importance.
Once the assets have been prioritized and the maintenance strategy to follow defined, the next
step would be to develop the corresponding maintenance actions associated with each category
of assets. Before doing so, we may focus on certain repetitive – or chronic – failures that take
place in high-priority items (Phase 3).
Finding and eliminating, if possible, the causes of those failures could be an immediate
intervention providing a fast and important initial payback of our maintenance management
strategy. The entire and detailed equipment maintenance analysis and design could be
accomplished, reaping the benefits of this intervention if successful.
There are different methods developed to carry out this weak point analysis, one of the most
well known being RCFA (Root Cause Failure Analysis). This method consists of a series of
actions taken to find out why a particular failure or problem exists and to correct those causes.
Causes can be classified as physical, human or latent. The physical cause is the reason why the
asset failed, the technical explanation on why things broke or failed. The human cause includes
the human errors (omission or commission) resulting in physical roots. Finally, the latent cause
includes the deficiencies in the management systems that allow the human errors to continue
unchecked (flaws in the systems and procedures). Latent failure causes will be our main concern
6
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
at this point of the process. Note that although informal RCFA techniques are usually used by
individual or groups to determine corrective actions for a problem, they have limitations that
can make the development of long-term solutions difficult
Designing the preventive maintenance (PM) plan for a certain system (Phase 4) requires
identifying its functions, the way these functions may fail and then establish a set of applicable
and effective PM tasks, based on considerations of system safety and economy. A formal
method to do this is the RCM (Parra and Crespo, 2012). . RCM methodology allows the
identification of real maintenance needs starting from the analysis of the 7 questions:
- What are the functions and associated performance standards of the asset in its present
operating context?
- In what ways does it fail to fulfil its functions?
- What causes each functional failure?
- What happens when each failure occurs?
- In what way does each failure matter?
- What can be done to prevent each failure?
- What should be done if a suitable preventive task cannot be found?
Optimization of maintenance planning and scheduling (Phase 5) can be carried out to enhance
the effectiveness and efficiency of the maintenance policies resulting from an initial PM plan
and program design.
Models to optimize maintenance plan and schedules will vary depending on the time horizon of
the analysis. Long-term models address maintenance capacity planning, spare parts
provisioning and the maintenance/replacement interval determination problems, mid-term
models may address, for instance, the scheduling of the maintenance activities in a long plant
shut down, while short-term models focus on resources allocation and control (Duffuaa, 2000).
Modelling approaches, analytical and empirical, are very diverse. The complexity of the
problem is often very high and forces the consideration of certain assumptions in order to
simplify the analytical resolution of the models, or sometimes to reduce the computational
needs.
For example, the use of Monte-Carlo simulation modelling can improve PM scheduling,
allowing the assessment of alternative scheduling policies that could be implemented
dynamically on the plant/shop floor (Parra and Crespo, 2012).
7
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
1.2.6 Maintenance Execution Assessment and Control
The execution of the maintenance activities, once designed planned and scheduled using
techniques described for previous building blocks have to be evaluated and deviations controlled
to continuously pursue business targets and approach stretch values for key maintenance
performance indicators as selected by the organization (Phase 6). Many of the high-level
maintenance KPIs, are built or composed using other basic level technical and economical
indicators. Therefore, it is very important to make sure that the organization captures suitable
data and that data are properly aggregated/disaggregated according to the required level of
maintenance performance analysis.
A life cycle cost analysis (Phase 7) calculates the cost of an asset for its entire life span (Figure
4) (Parra and Crespo, 2012). The analysis of a typical asset could include costs for planning,
research and development (R&D), production, operation, maintenance and disposal. Costs such
as up-front acquisition (research, design, test, production and construction) are usually obvious,
but life cycle cost analysis crucially depends on values calculated from reliability analyses such
us failure rate, cost of spares, repair times, and component costs. A life cycle cost analysis is
important when making decisions about capital equipment (replacement or new acquisition)
(Campbell and Jardine, 2001), it reinforces the importance of locked in costs, such as R&D, and
it offers three important benefits:
This section summarizes the process (the course of action and the series of stages or steps to
follow) and the framework (the essential supporting structure and the basic system) needed to
manage and optimize the maintenance strategies in theThird Set of Locks of the ACP.
This section contains the minimum requirements of the theoretical knowledge for a
maintenance and reliability manager in general. However, this document aims to fulfil the
intention to be comprehensive enough and include the essential and fundamental knowledge
that any expert in maintenance management needs to have, regardless of which company or
in which country he or she is working. The requirements cover the following areas (EFNMS
9
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
publication of June 19th 2006. The requirements of Competencies and Responsibilities for
an European Expert in Maintenance and Realibility):
Within this area, it is essential to have a very good knowledge about the importance of
maintenance for the economy in the company, for achieving production goals and for the
quality of the product, and so on. It is important to have good knowledge of how maintenance
activities are organized. Therefore, the following knowledge is necessary:
12
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
2.2. RCM IMPLEMENTATION
2.2.1 The process and de review team
Hereafter we present the proposed scheme for implementation of RCM. The success of this
process will depend basically on the selection of the proper RCM review team. This team will
have the responsibility of answering the seven basic questions of the RCM, according to the
scheme in Figure 4.
People with different functions in the organization will form the RCM review team. This
team will work jointly for a certain period of time in a positive atmosphere, to analyses
common problems with the different departments and with a common goal.
The facilitator plays an important role in this team; his/her basic function consists in guiding
and leading the implementation of the RCM process. The activities that the instructor should
undertake are:
- Guide the review team during the failure modes and effects analysis (FMEA) and
the selection of maintenance activities;
- Help to select the decision level to be used in the failure mode and effects analysis;
13
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
- Help to identify the critical assets that should be analyzed under this methodology;
- Ensure that the work meetings are led in a professional manner;
- Ensure real consensus;
- Motivate the team;
- Ensure that all required information is in place when needed;
- Ensure that results are correctly recorded.
The selection of the assets, systems or equipment where RCM will be applied can be carried
out using techniques explained in the previous chapters. The correct definition of the
operational context will be a requirement to do this phase properly. Operational context
definition will require certain information to be gathered. A graphical tool that eases the
visualization of the overall operational context is the IPO (Input—Process—Output)
diagram, which can be synthesized and represented as in Figure 5. In Figure 5, insumes are
raw materials or resources to be transformed or converted. The process will be divided into
systems that will have a certain function (or group of functions). Maintenance efforts will be
then concentrated on each one of the systems functions.
FMECA is recognized as the most fundamental tool that is used in RCM. Due to its practical
and qualitative approach, it is also the most widely understood and applied form of reliability
and risk analysis found throughout industry. Given a specific process, FMECA deals with
the identification of its failure modes, failure causes and frequencies (reliability), and the
14
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
effects that might result if any specific failure occurs during the process operation (risk).
Based on the information provided by FMECA, design and management personnel are better
informed about the way to determine what can be done in order to avoid or mitigate failure
modes.
1. Description of functions;
2. Description of functional failures;
3. Failure modes (failure rate data) definition;
4. Description of failure mode effects and criticality (RPN: Risk priority
number).
Each item or equipment usually has more than one function. They can be divided into five
categories:
- Primary Functions. These are functions required to fulfil the intended purpose of the
item;
- Auxiliary Functions. These are functions that support the primary function;
- Protective and Control Functions. These are functions intended to control a process
and protect people, equipment, or the environment;
- Information Functions. These are related to alarms, and the monitoring of several
conditions;
- Interface Functions. These are functions that apply to the interface between two items.
The interface may be active or passive.
FMECA
Item: Heat Exchanger without Change of Phase
15
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
2.2.3.2 Defining Functional Failures
A functional failure is defined as the inability of the equipment to keep a desired standard of
performance (function). Functional failures vary in degree of magnitude; for example, a
pump may have no output or may have its output restricted. An example of definition of
functional failures for heat exchangers is shown in Table 2.
FMECA
Item: Heat Exchanger without Change of Phase
Auxiliary -Contain cooling and heating fluids Unable to contain cooling and heating fluids
/ catastrophic
A failure mode is defined, in RCM as the physical cause of the functional failure. Only failure
modes with a high occurrence possibility are recorded. It is not recommended to list every
single failure possibility. Reasonably likely failure modes include the following:
Additionally, we need to estimate the mean time to failure (MTTF) for each failure mode.
The MTTF is expressed as the expected mean time to failure expected of a given failure
mode. In this case, the ideal situation is to have valid historical data for the equipment in the
operational context. In most cases, plant-specific data is unavailable or may have a low
reliability level to allow its use without corroborating it. The uncertainties of data selection
can be reduced by learning as much possible about data sets, taxonomy, equipment
boundaries, used equipment type, equipment design and construction, process medium, plant
operation, maintenance programs, and failure modes. OREDA, IEEC, PERD - Std 500-1994,
Reliability Data Book for components in Chemical Process (CCPS), etc ., are examples of data
sets that provide details of taxonomy, data origin, treatment, and limitations. An example of
definition of failure modes and MTTF data for heat exchangers is shown in Table 3.
17
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
FMECA
Item: Heat Exchanger without Change of Phase
2.2.3.4 The Description of the Failure Modes Effects and Criticality Analysis
The failure effects describe what would happen if the failure mode occurs, and are related to
issues such as downtime, effects on product quality, evidence that the failure has occurred,
and threats to safety and environment. The description of these effects should include all the
information needed to support the evaluation of the consequences of the failure (criticality
analysis). When describing the effects of a failure, the following issues should be recorded:
The impact that a failure mode has on the organization depends on the operating context of
the asset, the performance standards, which apply to each function and the physical effects
of each failure mode. This combination of context, standards and effects means that every
failure has a specific set of consequences connected to it.
An example of failure mode effects definition for heat exchangers is shown in Table 4.
19
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
2.2.3.4.1 RCM and the Hidden Failures
Equipment, in most cases, has more than one function. When one of these functions fails and
someone may notice the failure or the fault state of the equipment, failures are said to be
evident. However, in some occasions, no one knows that the equipment is in fault state unless
another failure takes place. The first failure, the one that remained unnoticed until another
took place, was not evident on its own. These are known as hidden failures.
To be able to understand this, suppose you have two pumps in a given operational context.
Pump C (reserve pump) is not available (fault state), this fact shall not be evident under
normal circumstances, since pump B is working under normal operation. In other words, the
fault/failure of pump C shall not have any direct impact on its own unless or until pump B
also fails. Pump C failure will not be evident under normal operating conditions unless other
failures do occur. Notice that failures in pump C will only have some consequence if another
failure – in this case in pump B - also takes place. When pump C is in fault state, the failure
in pump B is known as a multiple failure. Regarding this point, the review team must know
that all sole hidden failures will not have any direct consequences; however, they shall have
an indirect consequence increasing the risk level of multiple faults/failures. “The only
consequence of a hidden failure is the consequent increase of a multiple failure”.
The appearance of hidden failures, on their own, are not evident, in the normal operation
process; therefore, in order to identify or recognize hidden faults, the RCM review team
should answer the following question:
If the functional failure is caused by a failure mode on its own, is it evident under
normal operation conditions?
If the answer to the question is NO, the failure mode is a hidden failure (not evident), and if
the answer is YES, the mode is evident.
2.2.3.4.2 Risk Priority Number (RPN). Criticality Analysis for failures modes
Risk is the potential impact (positive or negative) to an asset or characteristic of value that
may arise from some present process or from some future event. In everyday usage, "risk" is
often used synonymously with "probability" and restricted to negative risk or threat. Risk
management is the ongoing process of identifying these risks and implementing plans to
address them. Some industries manage risk in a highly-quantified and numerate way. These
include, for instance, the nuclear power and aircraft industries, where the possible failure of
a complex series of engineered systems could result in highly undesirable outcomes (Crespo,
2007, Parra and Crespo, 2012).
Often, the number of assets potentially at risk outweighs the resources available to manage
them. It is therefore extremely important to know where to apply available resources to
mitigate risk in a cost-effective and efficient manner. Risk assessment is the part of the
ongoing risk management process that assigns relative priorities for mitigation plans and
20
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
implementation. In professional risk assessments, risk combines the probability of an event
occurring with the impact that event would cause. The usual measure of RPN for a class of
events is then R = P x C, where P is probability and C is consequence. The total risk is
therefore the sum of the individual class-risks.
Risk assessment techniques can be used to prioritize assets and to align maintenance actions
to business targets at any time. By doing so we ensure that maintenance actions are effective,
that we reduce the indirect maintenance cost, the most important maintenance costs, those
associated to safety, environmental risk, production losses, and ultimately, to customer
dissatisfaction.
The procedure to follow in order to carry out an assets criticality analysis following risk
assessment techniques could be then depicted as follows:
Risk factors considered in the RPN analysis were: safety, environment affection, operation
downtime, maintenance and direct and indirect cost of operations, failure frequency and
mean time to repair (Crespo, 2007, Parra and Crespo, 2012).
RPN = F x C (1)
Where F is the frequency factor or number of failures in a certain time period (year) and C is
consequence of the failure measured as follows:
C = E x Cr x MTTR (2)
With:
E : Effect on Lane Availability
Cr : Failure Mode Criticality
MTTR: Mean Time to Repair
Concerning the frequency of failures (F), the team decided to establish the classification
and scale in Table 5, to rank the different failures modes.
21
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
Failure Failures per Model
frequency (F) hours value
Regarding the different consequence factors (defining C), they were classified and scaled as
in Tables 6, 7 y 8.
Effect on Model
Lane Consequence scale
Availability (E)
Extremely
Single point of failure (no backup): lane unavailable 5
high
Very high 2nd order failure: If backup also fails: lane unavailable 4
None No effect 1
Failure Model
Mode Criticality scale
Criticality (Cr)
High Yes 4
Low No 1
Poor >8 4
Average <8 3
Good <4 2
Excellent <2 1
22
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
As a result of the above mentioned classification, maximum value of RPN for an failure mode
was set to 320 risk dimensionless units (notice that 320=4x5x4x4 when substituting in
Equations 2 and 1). The team established three levels of assets criticality as in Table 9.
In Table 10, presents an example for calculating the RPN. A total of 16 failures modes of the
subsystem are presented already sorted by their priority resulting from their estimated RPN.
A number of 2 failures modes out of 16 were found to be extremely critical, 2 very critical,
6 critical, 2 low criticality and 3 non-critical.
23
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
2.2.3.5. Selection of Maintenance Activities within RCM
Once the FMECA has been done, the natural RCM equipment should select the maintenance
activity that aides in the appearance of each previously identified failure mode, starting at the
logical decision tree (tool designed by RCM, that permits selection of the most adequate
maintenance activity to prevent the appearance of a fault mode or decease its possible
effects). After the selection of the type of maintained activity from the logical decision tree,
one has to specify the maintenance action to undertake, associated to the selected
maintenance activity, with the respective frequency of execution, keeping in mind that one
of the main objectives of RCM, is preventing or at least reducing possible consequences to
human, environmental and operative security, that may arise from the appearance of different
fault modes (Crespo, 2007). The first step to the selection of maintenance activities, consists
on the identification of the consequences that generate failure modes (see Figure 6).
yes no
24
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
2.2.3.5.1 Preventive Activities: Condition Maintenance Tasks
Many preventive maintenance activities can be based on the equipment condition. This is
due to the fact that the equipment conditions do not change instantaneously when the failure
takes place (function loss), but they normally follow a certain continuous deterioration
process during a period of time. A potential failure can then be defined as an identifiable
physical equipment condition that indicates that a functional failure is about to happen, or is
happening, during the process. The moment in time when it is possible to detect the
occurrence of a functional failure, or that a failure is about to occur, is known as the potential
failure time. Common examples of potential failures are:
The behavior over time of the equipment condition is illustrated in Figure 7. This figure
shows how a certain equipment condition that starts to deteriorate (beginning point “I”; often
this point may not be detected); then this condition reaches a point when the failure may be
detected (potential failure point “P”); finally, if the failure is not detected nor corrected the
equipment condition gets to a point where the functional failure takes place (point “F”, where
the equipment does not fulfil the function any longer).
P
Condition
F
Time of operation
These are periodical activities carried out to restore a part of an item (system, equipment,
part) to its original condition. Of course the time interval between two consecutive scheduled
actions will be shorter than the operative life limit of the item part to be restored. During this
type of preventive maintenance action, items are taken out of service, unarmed, put aside and
inspected in a general manner, corrected and replaced if necessary, in order to prevent the
appearance of possible failure modes. In case of large equipment or systems, these scheduled
restoration tasks are generally known as “overhauls” and they are common in equipment such
as compressors, turbines, broiler, furnaces, engines, etc. Restoration includes different
actions such as: adjustment, inspection, improvement, cleaning, restoration and even
replacement.
This type of preventive activity is oriented towards replacement of components or used parts
of an active, for new ones, at intervals of time shorter than that of their useful life (before
they fail). The programmed discarding activities shall return the component to its original
condition, since the old component shall be replaced with a new one. The difference between
the discarding and the restoration tasks is that the first are applied to components and or parts
of an active and not to complex actives (actives with many components), and then the
undertaking of the programmed discarding is specifically to the replacement of an old
component for a new one. In the case for programmed restoration tasks the actions to be
undertaken may be: adjusting, inspection, improvement, cleaning, restoration and even
exchanging old parts or pieces for new ones.
As was defined earlier, the hidden fault modes are no evident under normal operation
conditions, therefore this type of fault does not have direct consequences, but these do
originate appearance of multiple faults in a determined operational context. One of the ways
to reduce the occurrence possibility of hidden faults is to periodically check if a hidden
function is working correctly. These checks are known as revision tasks for hidden faults.
In conclusion, the revision task consist on the checking action of the actives with hidden
functions at regular interval of time, in order to detect if the hidden functions are at normal
operation or at failure stage.
When preventive activities are not technically feasible, or they are ineffective, for a certain
failure mode, corrective activities shall be those that apply. Possible corrective actions can
be as below.
26
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
2.2.3.5.5 Corrective Activities: Redesign
In the case, that one may not finding preventive actions that aide in the reduction of fault
modes that affect security or the environment to an acceptable level, a re-designing is
necessary to minimize or eliminate the consequences of failure modes.
In the that there are no preventive activities that may result least costly than the possible
effects that may arise from the fault modes with safety, environment or operational
consequences (minimal risk), the decision to wait for a failure or act in a corrective manner
may be taken.
An example of maintenance activity definition from the logical decision tree is shown in
Table 11 (failures modes and RPN were taken from Table 10).
Table 11. Maintenance activity definition from the RCM (logical decision tree)
27
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
RCMs can be used effectively for several purposes, besides identifying safety or reliability
failure modes and effects. The organizations can use RCM for:
Finally, it is important to coordinate these activities, so that the most effective use can be
made of RMC in all of them, and to ensure that RCM are available at the right time and to
the right people.
KPIs selection is an important decision making process that may have many potential
implications (Parra and Crespo, 2012, Crespo, 2007, Kaplan and Norton, 1992). For this
report, the KPIs proposed, are related with the three basic technical indicators: Availability,
Reliability and Maintainability (Jardine, 1999). We hereafter shall present the parameters to
be used in the calculation of these indexes, see Figure 8 (it is recommended that these
indicators can be calculated within the software MAXIMO).
Operative time
TTF TBF
TTF
1 f1 f2 fi
TOC
0
TTR
DT Unavailable time
“The probability that a team fulfils a specific mission (no failure) under specific operation
conditions during a specific period”.
Reliability is related to the failure rate (mount of failures) and with the mean time to failure
(MTTF, operative time to failure). Whilst the number of faults of a specific asset is increasing
or while the MTTF decreases, its reliability will be less.
n = number of failures.
“The possibility for an equipment to be returned to the stage to be able to fulfill the mission
in a given time period, after the appearance of a fault, using pre-established maintenance
procedures”.
The maintainability relates to the design and complexity of the equipment, with the qualifies
personal to undergo the maintenance, with the available tools and the maintanance
procedures. The fundamental parameter to calculate maintainability is constituted by the
mean time to repair (MTTR). When the MTTR for specific equipment is high, the equipment
have a low maintainability (its time to repair shall be reduced). In the contrary, if the mean
29
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
time to repair for specific equipment is low, it is considered that the equipment have a high
maintainability.
n = number of failures.
“The possibility for an equipment to be able to fulfill its mission at any given time”
The availability relates the mean downtime (MDT , represents maintainability) and the mean
time to failure (MTTF, represents, reliability).
From the three mentioned indicators, availability constitutes information from the parameter
most represented and useful of the management. Upon calculating availability is easier in
comparison to the calculus of the other two parameters and interrelated to reliability and
maintainability.
Availability (Ao): operational availability takes into account the non-operative time of the
team in a general manner (from the time comes out of service until it is put into action again),
i.e. it includes the delay (but does not estimate nor quantifies) which bring about the logistic
of the maintenance (purchase of parts, transportation, unspecified idleness, etc.). The
equation to calculate operational availability (Ao) is:
MTTF
Ao = x 100%
MTTF + MDT (5)
Where,
n: number of failures
MTTF: mean time to failure
MDT: mean downtime
MTOC: mean time out control
30
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
MDT = MTTR + MTOC (6)
• Given the following failure distribution (Figure 9) of the equipment X, for a period
of 53 hours, calculate:
- Reliability: MTTF
- Maintainability: MTTTR and MDT
- Availability: Ao
31
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
a) to calculate MTTF (reliability), apply equation 3:
MTTF = (7 + 6 + 5 + 7+ 8+ 7) / 6 = 40/6 = 6.66 hours
The KPIs plays an important role in the efficiency of the maintenance organization (Jardine,
1999). Generally, equipment records are classified under four categories: inventory,
maintenance cost, files, and maintenance work performed. KPIs are used in various areas
including troubleshooting breakdowns, investigating incidents, procuring new equipment to
determine operating performance trends, performing life cycle cost and design studies,
conducting replacement and modification studies, and conducting reliability and
maintainability studies. Progressive maintenance organizations measure their performance
on a regular basis through various means. KPIs analyses play an important role in
maintenance organization efficiency and are useful in revealing equipment downtime,
peculiarities in operational behavior of the organization, and so on. Finally, maintenance
management must ensure calculate the three KPIs shown in this section, in order to minimize
uncertainty in the process of decision making in the areas of: reliability, maintainability and
availability.
This section presents some general recommendations to MWH Global for optimizing the
processes in the areas of Maintenance and Reliability Management within the project: Design
and Construction of the Third Set of Locks in the ACP (AUTORIDAD DEL CANAL DE
PANAMÁ). The following key aspects to consider in this project are shown below (Peters,
2006 and Crespo, 2007):
33
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
resources and support to ensure that maintenance has the total technical capability to
maintain all equipment and systems.
- Continuously improve reliability and maintainability. Machines and systems will be
specified, designed, retrofitted, and installed with greater reliability and ease of
maintainability. Equipment design will focus on maintainability and reliability and
not primarily on performance. Design for maintainability will become an accepted
philosophy that fully recognizes the high cost of maintenance in the life-cycle cost of
equipment. The causes for high life-cycle costs will be reduced through the
application of good maintainability and reliability principles during design.
- Manage life-cycle cost and obsolescence. The life-cycle costs of physical assets and
systems will be closely monitored, evaluated, and managed to reduce total costs. A
profit- and customer-centered strategy will achieve significant reductions in total life-
cycle costs through an effective design process prior to purchase and installation.
During the equipment’s operating life, systems will be developed to continually
monitor equipment costs. Information to identify trends will be available to highlight
equipment with high maintenance costs.
- Minimize uncertainty and eliminate root causes. Uncertainty will be minimized
through effective preventive/predictive maintenance programs and through
continuous application of reliability-centered maintenance techniques and continuous
monitoring systems. Effective preventive/predictive maintenance programs will be
used to anticipate and predict maintenance problems in order to eliminate the
uncertainty of expected breakdowns and high repair costs. Preventive/predictive
maintenance tasks will be adequately planned based upon criticality of failure and
will cover all major assets within the operation.
- Maximize use of computerized maintenance management and enterprise asset
management. Systems that support the total maintenance operation will improve the
quality of maintenance and physical asset management and be integrated with the
overall business system of the organization. Computerized maintenance management
systems (CMMS) will provide greater levels of manageability to maintenance
operations. CMMS will cover the total scope of the maintenance operation providing
the means to improve the overall quality of maintenance management. Enterprise
asset management (EAM) will provide a broader scope of integrated software to
manage physical assets, human resources, and parts inventory in an integrated system
for maintenance management, maintenance, procurement, inventory management,
human resources, work management, asset performance, and process monitoring.
- Use maintenance information to manage the business of maintenance. The
maintenance information system and database will encompass the total maintenance
function and provide real-time information to improve maintenance management.
The implementation of CMMS and EAM provides the opportunity for improved
maintenance information systems. With CMMS and EAM, the maintenance
information system can be developed and tailored to support maintenance as a true
“business operation.”
- Ensure an effective maintenance storeroom operation. The maintenance storeroom
will be orderly, space efficient, labor efficient, responsive, and provide the effective
cornerstone for maintenance excellence. The maintenance storeroom for maintenance
repair operations (MRO) items will be recognized as an integral part of a successful
34
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
maintenance operation. Initial storeroom design or modernization will include
effective planning for space, equipment, and personnel needs while providing a
layout that ensures efficient inventory control and includes maximum loss control
measures. It will be professionally managed and maintained in a clean, orderly, and
efficient manner.
- Establish a safe and productive working environment. Successful maintenance
operations will be safe, clean, and orderly because good housekeeping is an indicator
of maintenance excellence. Maintenance leaders will provide a working environment
where safety is a top priority, which in turn allows maintenance to set the example
throughout the organization. Good housekeeping practices in maintenance will
provide the basic foundation for safety awareness. Maintenance will provide support
throughout the organization to ensure that all work areas are safe, clean, and orderly.
- Aggressive support compliance to environmental, health, and safety requirements.
Maintenance must provide proactive leadership and support to regulatory compliance
actions. Maintenance leaders must maintain the technical knowledge and experience
to support compliance with all state and federal regulations. The issue of indoor air
quality must receive constant attention to eliminate potential problems. Maintenance
must work closely with other staff groups in the organization such as quality and
safety to provide a totally integrated and mutually supportive approach to regulatory
compliance.
- Continuously evaluate, measure, and improve maintenance performance and service.
Broad-based measures of maintenance performance and customer service will
provide a continuous evaluation of the value of maintenance. CMMS will allow for a
broad range of measurement for maintenance performance and service. Investment in
best maintenance practices will require valid return on investment. Projected savings
will be established and results will be validated. Measures will be developed in areas
such as labor performance/utilization, compliance to planned repair and
preventive/predictive maintenance schedules, current backlog levels, emergency
repair hours, storeroom performance, asset uptime, and availability.
5. REFERENCES
Campbell, J.D. and Jardine, A.K.S. (2001), Maintenance Excellence, Marcel Dekker, New
York, NY.
35
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
Duffuaa, S.O. (2000), “Mathematical models in maintenance planning and scheduling”, in
Ben-Daya, M., Duffuaa, S.O. and Raouf, A. (Eds), Maintenance, Modelling and
Optimization, Kluwer Academic Publishers, Boston, MA.
Kaplan, R.S. and Norton, D.P. (1992), “The balanced scorecard – measures that drive
performance”, Harvard Business Review, Vol. 70 No. 1, pp. 71-9.
Palmer, R.D. (1999), Maintenance Planning and Scheduling, McGraw-Hill, New York, NY.
Kaplan RS, Norton DP, 1992. The Balanced Scorecard - measures that drive performance.
Harvard Business Review, 70(1): 71-9.
Pintelon, L.M. and Gelders, L.F. (1992), “Maintenance management decision making”,
European Journal of Operational Research, Vol. 58, pp. 301-17.
Peters, R. A (2006), Maintenance Benchmarking and Best Practices, First Edition, McGraw-
Hill,, New York, NY.
Prasad Mishra, R., Anand, D. and Kodali, R. (2006), “Development of a framework for
world-class maintenance systems”, Journal of Advanced Manufacturing Systems, Vol. 5 No.
2, pp. 141-65.
36
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01
Vanneste, S.G. and van Wassenhove, L.N. (1995), “An integrated and structured approach
to improve maintenance”, European Journal of Operational Research, Vol. 82, pp. 241-57.
37
www.linkedin.com/in/carlos-parra-6808201b
https://www.linkedin.com/groups/4134220 Draft-v-01