You are on page 1of 7

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/335396971

ISO 26262 ASIL-Oriented Hardware Design Framework for Safety-Critical


Automotive Systems

Conference Paper · November 2019


DOI: 10.1109/ICCVE45908.2019.8965235

CITATIONS READS

8 11,937

2 authors:

Yung-Yuan Chen Kuen-Long Lu


National Taipei University Foxtron
73 PUBLICATIONS 384 CITATIONS 27 PUBLICATIONS 132 CITATIONS

SEE PROFILE SEE PROFILE

All content following this page was uploaded by Kuen-Long Lu on 24 November 2019.

The user has requested enhancement of the downloaded file.


ISO 26262 ASIL-Oriented Hardware Design
Framework for Safety-Critical Automotive Systems

Kuen-Long Lu and Yung-Yuan Chen


Department of Electrical Engineering, National Taipei University
SanXia, New Taipei City, Taiwan, R.O.C
s810476101@webmail.ntpu.edu.tw
chenyy@mail.ntpu.edu.tw

Abstract—In this paper, we base on the fault tree analysis ISO-26262 standard adopts the ASILs (Automotive
(FTA) to propose an Automotive Safety Integrity Level (ASIL)- Safety Integrity Levels) to measure if the target system has
oriented hardware design framework for safety-critical achieved the demanded level of safety or not. There are four
automotive systems, where ASIL plays a key component in the ASILs identified by the standard - from A to D where the
ISO 26262 safety standard to measure risk of a specific system ASIL A defines the lowest and ASIL D defines the highest
component. There are two contributions in this framework: safety level. More strict requirements need to be fulfilled if the
FTA-based weak-point analysis and ASIL-oriented fault- higher ASIL is specified.
tolerant design methodologies. The former can rapidly identify In this study, we base on the ISO-26262 functional safety
the weak-points for safety through the fault tree analysis, and
standard to propose an ASIL-oriented hardware design
the latter can effectively introduce the safety mechanisms in the
hardware design to fulfill the requirements of target ASIL. We
framework for safety-critical automotive systems. The
use the autonomous emergency braking (AEB) system to effectiveness of the proposed design framework is
demonstrate the effectiveness of the proposed design framework. demonstrated by taking an autonomous emergency braking
(AEB) system as the case study. The AEB system integrates
Keywords—Fault tree analysis; Safety-critical system; Safety the brake-by-wire system proposed in [4] with automotive
mechanism; ISO 26262; Automotive safety integrity level. radar sensors and speed sensor to implement a front-vehicle
collision avoidance system.
I. INTRODUCTION This paper is organized as follows: related work is
Safety-critical automotive systems like autonomous discussed in the next section. Then in Section III, we introduce
driving systems, advanced driver assistant systems, the proposed ASIL-oriented hardware design framework
autonomous emergency braking systems, and driver-by-wire including fault tree construction, weak-point analysis and
systems require stringent dependability while the systems are fault-tolerant design. Case study demonstration is also
in operation. Therefore, safety and reliability issues must be presented in Section III. Conclusions appear in Section IV.
addressed in the development of safety-critical systems. When
II. RELATED WORK
the vehicle control functions are implemented by electronic
control systems, functional safety issues become so critical In this section, related works which focus on the PMHF
that such systems must be developed with strict safety (Probabilistic Metric for random Hardware Failures, one of
requirements. Furthermore, safety issues must be considered hardware metrics for measuring if demanded ASIL is
with the highest priority during the whole lifecycle of a safety- achieved or not) calculation, FTA and fault-tolerant design for
critical system. To carry out such safety-oriented system safety-critical automotive systems are discussed. In [5], an
development, the functional safety standard, ISO-26262, was accurate and well-explained practical guide to the specific
established [1]. techniques appropriate for PMHF calculation using FTA was
ISO-26262 was first published in 2011 for needs specific proposed. This paper presented a structured and systematic
to the application sector of electrical and/or electronic (E/E) quantitative FTA while showing various techniques for
systems within road vehicles. The primary purpose of this calculating the PMHF considering both single point faults and
standard is to conduct a safety life cycle for the electronic dual point latent faults.
systems. ISO-26262 has been accepted world-wide as the In [6], the author performed the quantitative assessments
technical “state-of-the-art” for safety-critical automotive for ISO-26262 hardware architecture metrics by means of
systems [2]. In ISO-26262, a safety life cycle includes the fault tree analysis. A custom coverage gate was firstly
concept phase, product development phase, and the proposed to represent the diagnostic coverage related to a
production and operation planning phase. During the safety safety mechanism. The advantage of introducing coverage
life cycle, considered issues cover initialization of the product gate is to reduce the complexity of fault tree construction
concept, specification establishment, product design and pre- The author of [7] presented generalized formulas for the
production test. All these issues put emphasis on functional calculation of PMHF in non-redundant and redundant
safety consideration. At the product development phase, V- subsystems using observable parameters, such as the failure
model [3] is adopted as the primary design, verification and rate of a mission function and a safety mechanism, the
validation flow. This phase is further divided into three diagnostic coverages of the primary and secondary safety
different levels: system level, hardware level and software mechanisms and the diagnostic periods of the secondary
level. For these levels, functional safety requirements are safety mechanisms to expand the scope of the application
verified and validated through failure mode and effect analysis according to ISO 26262.
(FMEA), fault tree analysis (FTA) and safety-related metrics. A mixed model based on FTA and Markov chain was
proposed in [8] to evaluate random hardware failures of the

978-1-7281-0142-2/19/$31.00 ©2019 IEEE


whole-redundancy system in ISO 26262. The mixed model exemplify how each step to be performed and the safety
presented in this study tried to solve the problem of calculating mechanisms compliant to be attained.
the PMHF in the whole-redundancy system, whose fault tree
only contains several dynamic logic gates. A. Case Study: an autonomous emergency braking system
All above studies have well applied fault tree analysis
and formulations to calculate the PMHF for automotive
systems with or without fault-tolerant design. However, the
previous approaches do not address how to effectively
conduct the fault-tolerant design to achieve the required ASIL.
Therefore, in this paper we attempt to propose an ISO-26262-
compliant design framework which well utilizes the results of
fault tree analysis to enhance the efficiency of fault-tolerant
design for automotive hardware systems.
III. ASIL-ORIENTED HARDWARE DESIGN FRAMEWORK
Fig. 2. Functional block diagram of AEB system
In this work, we propose an ASIL-oriented hardware
design framework shown in Fig. 1. Fig. 2 shows the functional block diagram of the AEB
system. The primary function of the AEB system is to warn
drivers about emergent situations and autonomously brake the
vehicles to avoid serious collisions if drivers do not react to
the warning massage.
For implementing the warning and autonomous braking
function, radar will continuously monitor the distance
between the subject vehicle and the front vehicle, and provide
the distance information to the central AEB control node.
Fig. 1. ASIL-oriented Hardware Design Framework Once AEB control node is aware that the current distance falls
The goal of this framework is to assist the designer in into the dangerous range and the collision could happen under
creating the system-level hardware architecture to meet the the relative vehicle speed, the AEB control node will firstly
ASIL requirement. The proposed framework consists of four send a warning message (a sound or a flash light) to warn the
steps: driver. If the driver does not react to the warning message and
the situation becomes more severe, then the AEB control node
1) Step 1 – fault tree construction: construct the fault tree will inform the electronic brake units to actively perform the
manually according to the system hardware architecture. vehicle braking in accordance with appropriate braking forces
2) Step 2 – fault tree based weak-point analysis: specify to avoid the serious collision.
the failure rates for each hardware component of the Fig. 3 shows the hardware architecture of the AEB
analyzed system and acquire PMHF through the system. The CAN bus is adopted as in-vehicle communication
quantitative FTA to check if the target value can be backbone. According to designer’s requirements, other
reached. If the demanded ASIL cannot be achieved, advanced in-vehicle communication protocols like FlexRay or
then the quantitative disparity between current system automotive Ethernet can also be employed.
failure probability and the PMHF target value is
calculated. Moreover, the fault tree analysis will list all
MCSs (Minimal Cut Sets). In the meantime, the MCSs,
which cause PMHF target value not fulfilled, are
identified as weak-points in the system hardware
architecture. We rank all identified weak-point
components according to their quantitative disparity
with PMHF target value. Illustration of this step can be
found in Section III sub-section C.
3) Step 3 – ASIL-oriented fault-tolerant design: based
on the analysis results derived in Step 2, this step then
applies the safety mechanisms (SMs) to the identified
weak-point components to reduce their failure
probability. For weak-point components with higher
Fig. 3. Hardware architecture of AEB system
rank, SMs that are more powerful are applied. The
applied safety mechanisms with expected diagnostic The barking function of the AEB system is implemented
coverages can refer to ISO-26262 Part 5 Annex D. with fail-operational consideration. Once any one among four
Detailed explanation and illustration of this step can be EBD nodes (whether the Brake ECU or the EBM, Electrical
found in Section III sub-section D. Brake Module) is diagnosed as failed, the AEB MCU will stop
4) Repeat step 1-3 until the system PMHF conforms to the to send braking force to the failed EBD node. To keep braking
demanded ASIL. efficiency remained, braking forces are redistributed to the
remaining three working EBD nodes, i.e. these three working
To demonstrate the effectiveness of the proposed EBD nodes will take more braking force burden than previous
framework, we employ a safety-critical AEB system to four working EBD nodes situation. Therefore, the AEB
system can tolerate one failed EBD node with the degraded real case, component failure rates can be estimated by
braking performance. application of industry reliability data books and/or estimated
by procedural (enhanced data books incorporating physics of
B. Fault Tree Construction failure elements) such as SN-29500, IEC-62380/61709 and
Fig. 4 illustrates the fault tree constructed from the HDBK-217F, etc. Then, the MCSs and their failure
hardware architecture as shown in Fig. 3. The K-out-of-N probabilities are listed in Table II.
gate reflects the fail-operational design concept. 2/4 (2-out- For the MCSs {Brake ECUi, Brake ECUj}, {EBMi,
of-4) gate for four EBD nodes means that the failure of one EBMj} and {EBMi, Brake ECUj}, i and j are both from 1 to 4
EBD node will not lead to AEB system failure. representing 4 EBD nodes, respectively, where i ≠ j.
The F(t) in (1) represents the failure probability at the
mission time t.
𝐹 𝑡 1 𝑒 (1)
where λ represents the component failure rate. In this
case study, mission time is specified to 5000 hours.
F(t) calculation is according to the following three cases
for MCS:
a. SPF (Single Point Fault): 1 component in the MCS;
b. DPF (Dual Point Fault): 2 components in the MCS;
c. MPF (Multiple Point Fault): ≥ 3 components in the
MCS.
Fig. 4. Constructed fault tree of AEB system
For SPF, F(t) of MCS is directly equal to the component
C. FT-based Weak-point Analysis failure probability; for DPF and MPF, the F(t) of MCS is
Fault tree analysis (FTA) is a well-known and widely calculated by multiplying all component failure probabilities
applied analysis methodology. FTA has been proven to be included in this MCS. F(t) of whole system is calculated by
very effective on system reliability and safety analysis. In summation of failure probabilities of all MCSs. In this case
ISO-26262, the FTA is recommended for analyzing the safety study, the system F(t) is 2.8703E-03.
of system, HW and SW. In this study, FTA is adopted In ISO-26262, system hardware can be declared to fulfill
quantitatively and qualitatively for evaluating the failure the requirements for target ASIL only when the
probability of the system, listing MCSs and calculating the corresponding target values of hardware architecture metrics
failure probability for each MCS. are achieved. The hardware architecture metrics include
C.1 Quantitative Fault Tree Analysis Single Point Fault Metric (SPFM), Latent Fault Metric (LFM)
and Probabilistic Metric for random Hardware Failures
TABLE I. COMPONENT FAILURE RATE (PMHF) [1]. In this study, only the PMFH is adopted as target
to be achieved. However, SPFM and LFM can also be applied
to the proposed design framework. Table III gives the target
values of PMHF for different ASILs.
TABLE III. TARGET VALUES OF PMHF FOR DIFFERENT ASILS

In this case study, we assume that ASIL D is the target to


TABLE II. MCS AND FAILURE PROBABILITY be achieved, i.e. the PMHF target value is 10-8h-1, and the
target failure probability of whole system is 4.99999E-05.
Apparently, there is a non-negligible gap between the
PMHF target value for ASIL D and current PMHF of AEB
system (nearly 60 times larger than the PMHF target value).
Thus, we need to improve the PMHF by introducing the
effective safety mechanisms in the hardware design.
C.2 Gap Quantification and Weak-point Classification
In this study, the “Gap” is defined as follows:
 Gap(Sys): quantitative disparity between whole system
and target failure probabilities
 𝐺𝑎𝑝 𝑆𝑦𝑠 𝐹𝑃 /𝐹𝑃 , where FPsys means
Before performing quantitative FTA, the component failure probability of whole system and FPTarget
failure rates shall be provided. The component failure rates means target value of failure probability with
only for the purpose of demonstration are listed in Table I. In required target ASIL.
 Gap(Sys) ≥ 1: PMHF for target ASIL is not achieved. First of all, the existence of MBP weak-points should be
 Gap(Sys) < 1: PMHF for target ASIL is achieved. checked. All the MBP weak-points need to be protected by
safety mechanisms until we can assure that no more MBP
 Gap(MCS): quantitative disparity between MCS and target weak-point exists. If we apply SMs to a MBP weak-point
failure probabilities which is DPF/MPF, there will be more than one component in
 𝐺𝑎𝑝 𝑀𝐶𝑆 𝐹𝑃 /𝐹𝑃 , where 𝐹𝑃 this MBP weak-point, thus the primary contributor to failure
means failure probability of kth MCS. probability shall be identified. For the case that MBP weak-
 Gap(MCSk) ≥ 1: 𝐹𝑃 must be reduced to be lower point which is DPF or MPF and constituted by hardware
than FPTarget, otherwise the target ASIL will never be redundancy, the failure probability could be primarily
achieved. We call such MCS as must-be-protected contributed by common cause failure. Thus, the mitigation
(MBP) weak-point. measures are adopted to lower the occurring probability of
 Gap(MCSk) < 1: 𝐹𝑃 is reduced on demand. common cause failure.
If Gap(Sys) is still greater than or equal to 1 after all the
Details are described as follows: MBP weak-points are resolved, then the POD weak-points are
 When Gap(Sys) > 1 and all Gap(MCS) < 1, all the next targets to be addressed. The POD weak-point with
MCSs will be ranked according to their highest Gap(MCS) will be selected. There are three cases of
Gap(MCS). MCS with greater Gap(MCS) will applying SM protection:
have higher ranking. Safety mechanism will be
introduced to the component(s) in the MCS with (a). Components in the POD weak-points have been
highest ranking so that the 𝐹𝑃 can be protected by SMs but the DCs are only at medium
reduced. Such MCS is called protected-on- level. In this case, current SMs are replaced by others
demand (POD) weak-point. with high-level DCs.
(b). There is only one component in the POD weak-point,
Table IV shows the weak-point analysis results. i.e. SPF. In this case, this single component will be
TABLE IV. weak-point analysis results for AEB system directly protected with SM.
(c). For this case, one or more components in the POD
weak-point will be protected by SMs.
To avoid the situation that the designers do not specify a
reasonable target ASIL and component failure rates so that the
number of design iterations become very large or even infinite,
the number of iterations needs to be specified an upper bound.
When the number of iterations reaches the upper bound and
the Gap(Sys) is still greater than or equal to 1, the fault-tolerant
design shall be abandoned and a FT-based weak-point
analysis report is generated with all the fault-tolerance-
induced hardware blocks. Designers can base on the provided
information to determine if the target ASIL or the component
failure rates should be revised. Otherwise, we should consider
D. ASIL-oriented fault-tolerant hardware design the redesign of hardware architecture.
Through the FT-based weak-point analysis, we have In the following sub-sections, we will illustrate the
identified the weak-points in the system and quantified the gap effectiveness of the proposed design methodology through the
to target failure probability. According to the Table IV results, AEB system case study.
we propose an ASIL-oriented fault-tolerant hardware design D.1 Iteration 1: apply SM to MBP weak-point
methodology to assist the designer in choosing the appropriate
safety mechanisms for identified weak-points. Fig. 5 depicts TABLE V. SAFETY MECHANISM ADOPTION FOR EACH MBP WEAK-POINT
the flow chart of the design methodology:

The selection of applied safety mechanisms is according


Fig. 5. ASIL-oriented fault-tolerant hardware design methodology to ISO 26262 Part 5 Annex D [1]. In this Annex, Tables D.2-
14 suggests practical safety mechanisms with expected DCs 𝐹𝑃 _ 𝐹𝑃 _ _ _
for various types of components such as processors, sensors 𝐹𝑃 _ _ _ 𝐹𝑃 (3)
and communication buses, etc. Based on these tables, we can
apply appropriate safety mechanisms to those components 𝑤ℎ𝑒𝑟𝑒 𝐹𝑃 _ _ _ 𝐹𝑃 _ _ _
included in the MBP weak-points as shown in Table V.
Fig. 6 shows the revised hardware architecture and Fig. 1 𝑒
7 illustrates the corresponding fault tree according to Table V. 𝐹𝑃 1 𝑒
We note that the β values used in table IV are only for the
purpose of illustration. Presentation of CCF in fault tree can
also be found in [11].
TABLE VI. MCS FAILURE PROBABILITY WITH CCF CONSIDERATION

Fig. 6. Revised hardware architecture of AEB system

The failure probability of current AEB system is


9.19174E-05. The Gap(Sys) is 1.83839 which is still greater
than 1. It means that the target PMHF for ASIL D still cannot
be achieved. Thus, FT-based weak-point analysis is
performed again to identify the weak-point so that the fault-
tolerant design can be employed to further improve the system
failure probability. Table VII shows the analysis results.
TABLE VII. WEAK-POINT ANALYSIS RESULTS (ITERATION 2)

Fig. 7. Revised fault tree of AEB system

D.2 Iteration 2: Apply CCF mitigation to MBP weak-point


In this design iteration, the Common Cause Failure (CCF)
should be addressed because the hardware redundancy is
adopted. To quantify the failure probability of CCF, the failure
rate of CCF must be acquired in advance. IEC-61508 provides
a feasible methodology for this end [9-10]. In IEC-61508 Part
6 Annex D, a β factor is proposed to represent the percentage
of CCF among whole failure rate of certain hardware
component and its redundancy. The β factor can be acquired
by answering about 40 Yes/No questions. These questions are
used to check if the designers use suitable measures to prevent According to Table VII, radars are the only one MBP
CCF from several aspects such as degree of physical weak-point left. Due to the high β factor, the Gap(MCS) for
separation, design diversity and environmental control, etc. If MCS {Radar 1, Radar 2}CCF,β=10% is not effectively reduced as
more questions are answered “Yes”, then the occurring expected. Through this case study, we want to demonstrate
probability of CCF, i.e. β factor will become lower. The β that the hardware redundancy could still fail to provide
factor is ranged from 0.5 % to 10%. The β factor is then used sufficient failure probability reduction if the CCF is not well
to calculate the failure rate of CCF, λC, by the following addressed. To reduce β factor, the CCF mitigation measures
equation (2). shall be adopted. ISO-26262 Part 11 provides effective
𝜆 𝜆 𝛽 (2) measures for CCF mitigation. Typical measures are external
watchdog, physical separation and design diversity, etc. We
Table VI shows the updated MCS and their failure assume that the β factor for radars is reduced to 0.5% after
probabilities with CCF consideration. In table VI, MCSs with applying the CCF mitigation measures. Fig. 8 depicts the
suffix “CCF” and β factor represent that the failure probability, revised fault tree. For space saving, only the changed part is
𝐹𝑃 _ is calculated by following equations: shown.
In this paper, only the PMHF is adopted as the target
metric. Our future goal is to improve the proposed framework
to conform to all the three hardware architecture metrics,
SPFM, LFM and PMHF. This will increase the complexity of
fault-tolerant HW design substantially. Therefore, the
challenge of next work will be how to propose a fault-tolerant
Fig. 8. Revised fault tree of AEB system (only Radar Sub-sys part is shown)
design framework to fulfill the target values of all hardware
architecture metrics and keep the number of design iterations
D.3 Iteration 3: Apply safety mechanism to POD weak-point low enough to make the approach feasible and effective.
The fault-tolerant design now enters the third iteration. Besides, the proposed methodologies are very suitable to be
After applying CCF mitigation measures to radars, Gap(MCS) implemented in an EDA tool. That is another future work.
for MCS{Radar 1, Radar 2}CCF,β=0.5% has been reduced to
7.33620E-02. Consequently, all MCSs are identified as POD
weak-points. However, Gap(Sys)=1.66268 which is still
greater than 1. Thus weak-point analysis for current AEB
system is performed and the results are shown in Table VIII.
TABLE VIII. WEAK-POINT ANALYSIS RESULTS (ITERATION 3)

Fig. 10. Revised fault tree of AEB system (final version)

ACKNOWLEDGMENT
The authors acknowledge the support of MOST
In table VIII, we identify the MCS {EBMi, EBMj} as the academic research project under Contract Number 108-2221-
highest rank. Therefore, we apply appropriate SMs to protect E-305 -004 -MY2.
EBMs. Fig. 9 depicts the revised fault tree. Again, for space
saving, only the changed part is displayed. REFERENCES
[1] ISO/DIS 26262, “ISO-26262 Road Vehicles -- Functional safety”,
International Organization for Standardization, 2018.
[2] Fabio Marchiò et al., “Automotive electronics: Application &
technology megatrends”, 44th ESSDERC, Page(s): 23 - 29, 2014.
[3] John O. Clark et al., “System of Systems Engineering and Family of
Systems Engineering From a Standards, V-Model, and Dual-V Model
Perspective”, 3rd IEEE Systems Conference, Page(s): 381 - 387, 2009.
[4] J.S. Cheon et al., “Brake By Wire Functional Safety Concept Design
Fig. 9. Revised fault tree of AEB system (only EBD node part is shown) for ISO/DIS 26262”, 2011 SAE World Congress, 2011-01-2357, 2011.
D.4 Iteration 4: target ASIL has been achieved [5] N. Das, and W. Taylor, "Quantified fault tree techniques for calculating
hardware fault metrics according to ISO 26262," in IEEE Symp.
Performing quantitative FTA for current AEB system Product Compliance Eng. (ISPCE), pp. 1–8, 2016.
under the fourth design iteration, the analysis results indicate [6] Abraham Cherfi, “Toward an Efficient Generation of ISO 26262
that the failure probability of whole system is 3.59763E-05 Automotive Safety Analyses”, Ecole Doctorale Polytechnique,
Laboratoire PRiSM, UVSQ, July 2015
and the Gap(Sys) = 7.19544E-01 which is lower than 1.
Consequently, the PMHF for target ASIL has been achieved. [7] Atsushi Sakurai, "Generalized formula for the calculation of a
probabilistic metric for random hardware failures in redundant
The final fault tree is shown in Fig. 10. subsystems", ISPCE 2017, pp. 1-5, 2017.
[8] Tong Wang, Xi Chen, Zhikai Cai, Junnan Mi, Xiaomin Lian, "A mixed
IV. CONCLUSION AND FUTURE WORK model to evaluate random hardware failures of whole-redundancy
In this study, an ASIL-oriented hardware design system in ISO 26262 based on fault tree analysis and Markov chain",
framework for safety-critical automotive system is proposed. Proceedings of the Institution of Mechanical Engineers, Part D:
Journal of Automobile Engineering, vol. 233, pp. 890, 2019
With the case study, we have illustrated how to accomplish a
[9] IEC 61508: Functional safety of electrical/electronic/ programmable
hardware design for the AEB system which complies to the electronic safety-related systems, International Electro-technical
requirements of ISO-26262 functional safety standard. Commission IEC, 2010
Through the proposed FT-based weak-point analysis and [10] Stein Hauge et al., “Common Cause Failures in Safety Instrumented
ASIL-oriented fault-tolerant design methodologies, we have Systems”, Report of SINTEF Technology and Society, May 2015
shown that how to effectively apply safety mechanisms to the [11] Vrbanic, I. et al., “Presentation of common cause failures in fault tree
system design so that the demanded ASIL can be achieved. structure of Krsko PSA : an historical overview”. NENE 2003

View publication stats

You might also like