Professional Documents
Culture Documents
Abstract—High value asset vehicles, or vehicles where The growth of fatigue cracks has been studied extensively
safety/operational readiness is important, can greatly benefit from [6] and several models have been developed to explain high
an accurate remaining useful life (RUL) estimate. For these assets, cycle fatigue degradation. As reported in [5], crack growth can
RUL allows operators to realize revenue because of improved be modeled as a linear elastics fracture where the crack surface
availability. This paper uses an improved hybrid, high cycle is forced directly apart. This is known as an opening mode
fracture mechanics model to more accurately estimate the RUL. (Mode 1) failure model.
The hybrid method’s performance is validated using trendability
and monotonicity against two existing models using a real-world In a Mode 2 failure model, the crack surfaces move normal
data set. The improved model allows a longer time period to to the crack from and remain in the crack plane. Head’s theory
marshal the resources needed for repair or give operations is one such model that used this failure mode. Finally, if the
personnel an extended window to bring other assets to cover crack surface moves parallel to the crack as it forms and remains
missions that would otherwise be unavailable. in the crack plain, this is a Mode 3 failure. One such modeling
Keywords-RUL; PHM; Condition Monitoring; High Cycle
using this process is Dislocation theory [5].
Fatigue In “Contending Remaining Useful Life Algorithms” [9], the
authors explored RUL algorithms based on Mode 1, 2 and 3
I. INTRODUCTION failure models. While no model was perfect in its estimation, the
Often the true cost of unscheduled maintenance is not the data suggested that the actual degradation mode was a
repair cost of the asset, but the opportunity cost. Opportunity combination of Mode 1 and Mode 3 fatigue failure. It is
cost is the lost revenue or ability to perform a mission because hypothesized that because of the components often have
the asset is not available. While condition-based maintenance complex shapes, as gears/bearings, the component suffers
practices can often improve readiness and safety, it may not fatigue damage as a combination of modes. Hence, it is proposed
address the need for a prognostic. A prognostic capability that fatigue failure is a combination of Mode 1 and Mode 3
estimates the remaining useful life (RUL) of a monitoring damage. Furthermore, a combined, hybrid model would
component. The ability to inform an operator of an asset’s RUL facilitate a better prediction of crack growth. This, in turn, allows
allows the operator to order long lead time parts, schedule the for an improved estimate of the RUL.
asset for maintenance, and have available the skill set requires to
restore the assets’ damaged component to service. This then The other improvement is observation that while crack
allows the asset to be availability to generate revenue as opposed growth is nonlinear (e.g., the rate of change of a crack length
to being down some indefinite time for maintenance. This grows faster as the crack length increases), the RUL is linear.
proactive maintenance policy reduces the opportunity cost of That is, for any given load, the RUL rate of change (dRUL/dt) is
unscheduled maintenance by shifting it into scheduled always -1. For example, if the life of a component is given as
maintenance. 100 hours, after one hour of usage, the life should logically be
99 hours. This seemingly obvious observation can be used
While several RUL techniques have been reported, this mathematically to improve the estimate of the RUL calculation
proposed new algorithm improves upon exiting, model-based as it is an extra observable when using state reconstruction
RUL methods. The term improvement implies that the estimate algorithms.
of RUL is more consistently accurate over time (monotonicity
and trendability). This improvement was achieved using two II. BACKGROUND OF THE IMPROVEMENT
novel ideas. Commercially, there are several applications of condition
The first improvement is the realization that fatigue damage, monitoring where the estimate of RUL would be beneficial. For
such as spalling of a bearing, gear tooth root bending, or shaft example, in the wind power generation industry, condition
coupling failure, could result from a combination of degradation monitoring allows better balance of plant, and optimization of
modes. resources. The opportunity cost is the lost revenue when the
machine is down. When a wind turbine fails, because of the cost
of the crane for major maintenance, knowing the RUL of a
𝜇 = 𝛽$𝜋⁄2 , 𝜎 ! = (2 − 𝜋⁄2)𝛽! (1) where S is the sample covariance of the CIs used for the HI
algorithm. The HI algorithm become:
This is useful, as it shows that for nominal components, the
spectrum should be Rayleigh. Note that both the mean value and 𝐻𝐼 = 𝑠𝑐𝑎𝑙𝑒⁄𝑐𝑟𝑖𝑡𝑖𝑐𝑎𝑙 √𝑪𝑰𝑻 𝑳𝑳𝑻 𝑪𝑰 (4)
variance share the term b, and that the PDF can be described It can be show [3] that the distribution the HI in (4) is a
solely by this value b. Hence as the component degrades, as Nakagami distribution. The critical value is set by using the
measured by an increasing mean value or variance, there will be inverse cumulative probability function (ICPF). The ICPF
evidence to reject the null hypothesis that the component is requires a probability of false alarm. Usually, this is taken to be
healthy. small, such as 1e-6. Additionally, it requires the descriptive
As we noted, there are several different failure modes for any statistics for the Nakagami (h, w) where h is the n, number of
give components. It is also the case that not ever CI is sensitive CIs used in the HI algorithm and:
to each failure mode. For example, a bearing will have a cage, 𝜔 = 2𝑛⁄(2 − 𝜋⁄2) (5)
ball (roller element), inner race or outer race fault. To evaluate
the condition of the bearing, these four CIs would be needed to The scale value is used to define the HI value when the
evaluate the component health. Hence, from the perspective of component is deemed no longer nominal. That is, if the scale
the prognostic architecture, there is a choice. Should an RUL be value is 0.35, the component is identified at damaged. For a HI
estimated for each CI, or should the CIs be fused into a health using four CIs, the critical value for a PFA of 1e-6 is: 9.9744.
indicator (HI).
V. RUL CALCULATION
In the case of a CI approach to RUL, each CI is propagated The RUL will be defined at the time from the current HI
forward in time to calculate its RUL. The component CI is then value from (4), until the HI meets some upper threshold level.
the minimum of all the calculated RUL. This increases the This level represents damage where it is appropriate to do
computation burden and from a display perspective is not ideal maintenance. For aviation, and most application where the
– its far easier to calculate one HI and generate one RUL which desire is to optimize operational readiness, the RUL does not
is assigned directly to the component. represent the time of failure. The RUL is simply the time where
In using the HI approach, we can control the false alarm rate to not perform maintenance significantly reduces the reliability
by observing the HI is a function of the CIs distribution. This of the system. This paradigm ensures the continued safety and
will only be true if the CIs are independent (no correlation) and availability of the asset and supports existing maintenance task.
identical distributions (a term call IID). As we can show that Consider how the OEM develops a scheduled maintenance
many of the CIs are nearly Rayleigh (or can be transformed to interval. By analysis of the expected loads, the OEM determines
be Rayleigh), the issues resolve to making each Rayleigh CI both an inspection interval that should catch any propagating fault
identical and independent. By dividing by the CIs standard prior to the next inspection. Essentially, the interval is chosen
deviation, b will be: 1.5264 = 1⁄$2 − 𝜋⁄2. Hence, this is a such that the total system reliability remains high. For aviation,
method to make the CIs identical. the original equipment manufacturer (OEM) designs the
components to a reliability of “6-9s”, or that the probability of a
The process of removing correlation between CIs is known
as whitening. Whiting ensures that the realized false alarm rate component failure is less than 1e-6 per flight hour.
30
20
RUL (Hours)
10
-10
-20
-30
-40
-300 -250 -200 -150 -100 -50
Error (Hours)
700
Hybrid Model -1.96 hours 10.72 hours
500
capture the dynamics and complexity of the propagation of the
RUL over time. Therefore, a more satisfactory metric was given
400
in [8], where Dr. Coble introduces the concept of monotonicity
300 and trendability as RUL performance metrics.
200 Monotonicity is usually adopted to state the underlying
100
positive or negative trend of a series of health indicators. It is
also used to evaluate the fitness of the extracted health indicator
for RUL prediction. In [8], monotonicity can be measured given
0
-700 -600 -500 -400 -300 -200 -100 0
Remaining Useful Life (Hours)
by
Figure 3. Comparison of RUL Model.
æ h+ h- ö
I monotonicity =mean ç indicator - indicator ÷
Note that on this fault, which is a high-speed bearing, both è n -1 n -1 ø
(14)
the Linear Elastic and Dislocation models overshot ideal RUL
from -600 to -300 hours. The hybrid model is much closer to the Where n is the number of observations in a particular
ideal RUL. This is due to the extra observable input of -1 for history, the monotonicity of a series of RUL is given by the
dRUL/dt. Figure 4 compares the performance from -300 to 50