Professional Documents
Culture Documents
ACHIEVING
RELIABILITY
THROUGH
THE DESIGN
PROCESS
Scope 3 This guide offers guidance on how to design and make a reliable
Foreword 4 product (e.g. equipment, system or service). It aims to provide everyone
Introduction to Reliability in Design Principles 5 associated with designing, manufacturing, operating or maintaining
Reliability in Design as a Business Goal 6 products with:
Designing for Reliability 7
Risk Management 9 • Best practice for a cross-industry approach to reliability
Measuring Organisational Reliability Capability 11 • A capability maturity model (CMM) that will allow an organisation
Competence and Training 13 to assess its (and its suppliers’) level of maturity in reliability
Application of Principles for Ensuring Reliability Capability 14 engineering practices
The Key Processes 15
Bibliography 31
The reliability engineering discipline is examined, including definitions,
Appendices 32
explaining the interrelationships of activities and their importance,
Definitions 33
generally and at specific times during development, manufacturing,
IMechE Position Statement on Safety and Reliability 34
Customers want products that work and go on This guide describes good reliability engineering Reliability is a defining characteristic of a product’s It is important to consider reliability at all stages
working for a ‘reasonable’ period of time. If the practice, which embeds these four factors and, if performance and life cycle costs. In the context of of the product life cycle. Reliability characteristics
customer’s expectations are not met they may followed, should ensure that expectations are met. this guide, it is understood that reliability is the should be included as an explicit design
regard the equipment or service as ‘unreliable’. It provides an aide-mémoire to all personnel with ability of an item to perform a required function requirement, as consideration at the early stages
Dissatisfaction through poor reliability is likely to experience in managing reliability, and a primer of under stated conditions, including environment of design contributes most significantly to the
lead to lost business, which would then be difficult the essentials of what to expect for those who have and usage, for a stated time. success or failure of the final product. It becomes
to recover. In today’s highly competitive markets no, or limited, experience. increasingly difficult and costly to influence the
it is therefore worth recognising that it is not The aims of the reliability engineering discipline, as reliability of a product once the detail design
‘reliability’ that sells a product, but the benefits Guidance is given as to formal techniques which part of an integrated team, are to: stage has been reached. For maximum benefit,
of having a reliable product. Benefits of good can establish and maintain confidence that the efforts should start as early in a product’s life cycle
• Specify and agree a product’s reliability
reliability include the need to keep fewer spares, product will perform as designed when required as possible, noting that empirically the 1:10:100
requirements
carry out fewer inspections, reduced maintenance, to, and for as long as necessary. Management rule applies (i.e. at each life cycle stage, the cost
increased availability, more marketable products, commitment to these techniques is vital, as is • Manage the focussed activities throughout the of change increases by an order of magnitude).
lower operating costs and ultimately increased knowledge of the circumstances in which they design of a product that will give confidence It is prudent at this stage to recognise that the
profitability. Factors that have the greatest influence should be applied. that reliability requirements can be met reliability of a product cannot be enhanced by
on reliability are: • Co ordinate and monitor the reliability maintenance, only preserved.
influencing activities within the supply chain
Design – ensuring a product is fit for purpose, Designing for reliability requires:
relative to the required use and operating • Contribute to defining through-life activities for
the product to preserve inherent reliability • Commitment and discipline from management,
environment
the team and the individual
• Risk management – assess, analyse and manage
Manufacture – ensuring a product is made risks using techniques, e.g. failure mode and • An overall integrated approach to the design
to as consistent a standard as appropriate to effects analysis, which either result in: process
economically meet customer requirements • Development, measurement and interpretation
- Acceptance, if risks are found to be tolerable
of requirements
Installation / Integration – assimilating the - Mitigation, actions to eliminate the problem,
product with all other associated products and the reduce the probability of occurrence or minimise • Methods for assurance of achieving reliability
environment the consequences upon occurrence requirements
The requirement for a quality policy and subordinate Asset management is becoming a boardroom topic,
procedures is semi-mandatory as for many and as such a recognition of the fact that taking a
businesses a pre-requisite many customers demand, longer term life cycle cost approach as opposed to Achieving Required
certainly for business-to-business transactions, a shorter term focus on minimising (as opposed to Performance
is certification to a recognised quality assurance optimising) capital expenditure will help to meet
If we return back to our definition of reliability, • Material changes due to global legislation, Here we mention another ISO management of organisational reliability engineering practices.
neither of these statements are time-bound and the e.g. standard, ISO 31000. This standard presents a The core reason for risk management is to uncover
likelihood that something will just work is rather 1. Lead-free solder – joint integrity framework that organisations can use to help potential problems as early as possible and present
absurd. The measurement of reliability can, to manage risk. Do not expect to find organisations the facts in a somehow weighted manner to allow
2. Plating materials (e.g. cadmium no longer
some, be a little abstract as it can involve statistical certified to this standard however, as it is not quality decision making at the right time. Such
allowed)
distributions – suffice to say, at this juncture, that a “certifiable standard”. The principles may be decisions will involve developing mitigating actions,
someone who has a good understanding of how used to audit your own, or other, organisations’ including appropriate contingency planning.
• Emergent technology developments
reliability can be measured and how that relates to
1. Industrial ‘internet of things’ – connectivity, practices for risk management. ISO 31010 presents
the real world should be engaged in helping to define
data collection requirements an overview, and the applicability of, various risk A best practice to be considered is the introduction
targets and the measurement system.
2. Shrinking device architectures management techniques and as such is an excellent of a central risk register at a reasonably early point
primer for individuals who are starting on a journey in product development. This register collects all
Back to the point our fictitious designer made, the 3. Obsolescence
next basic step is defining the window of design. as a risk management practitioner. It is unlikely that of the project and product risks identified through
There are numerous factors to be considered • Serviceability one individual will become an expert in all of the other discrete risk management techniques,
in achieving reliability, some of which could be risk management techniques. providing a master list which in turn can be used
1. Susceptibility to no fault found (NFF) events
inadequately taken into account during design or to drive action, decision or acceptance. This can be
2. Location The reason for the introduction of this standard is considered a list of ‘lessons learnt’ for subsequent
emerge during the life cycle. The life cycle must
be considered to be ‘cradle to grave’, therefore 3. Training and technical skill levels that robust risk management forms the bulk work projects, storing and preserving organisational
both manufacture and dismantling/disposal of 4. Information and documentation
the equipment must be given as much thought 5. Modular design
as operation and maintenance. These influencing
factors include, but are not limited to: • Other issues of note
1. Manufacturing techniques and ‘built-in latent
• Environmental conditions
defects’
1. Temperature – range and cycles
2. Counterfeit components
2. Pressures
3. Owner/operators remote from designer/
3. Humidity manufacturer
4. Arduousness of duty 4. Warranty periods
5. Altitude/Submersed depth
6. Dusts, mists and other environmental Best endeavours should be made during product
contaminants specification to identify all of the aspects that make
up the window of design, to ensure that adequate
• Effects of age and use design considerations are made that will allow the
product to meet expectations, i.e. achieve the set
1. Deterioration of electrical cable insulation
reliability target.
2. Wear
3. Corrosion
4. Fatigue
5. Misuse
Result in a Result in
strategy for achieving its performance goals. LEVEL 3: DEFINED
minor safety minor financial 2 Negligible Low Low Medium Medium
incident losses Management attention can focus on different The level 3 organisation is defined and measured. It
aspects of its business. In general, these can be has key procedures and practices in place to define
grouped into the following broad perspectives: reliability such as:
No concern No concern 1 Negligible Negligible Low Low Low • Manufacture of product • The setting of reliability requirements
• Processes (necessary for manufacturing) • The performance of risk and reliability analysis in
• Preparedness to perform processes design
Once in Once in Greater
Once in Once • Reporting, tracking and analysis of reliability data
Probability a thousand a hundred than once
ten years per year
years or less years per year Knowing this provides a basis for making
judgements on the reliability capability of an However, at this level the processes for the
Risk Assessment Matrix 1 2 3 4 5
organisation. The most capable and sustainable management of risk and reliability improvement
organisations will be those adopting a balanced are limited to immediate project issues. There is
Figure 1 approach, investing effort at all steps through a some learning by individuals exposed to failures and
product’s life cycle. The highest levels of capability performance information, but it is not fully organised
1. Define 5. Engineering Or, more simply, ‘planning’, ‘doing’, and ‘sustaining’. These key processes are summarised in table 1, and
9. Testing 13. First build
requirements specification described in detail in the subsequent sections.
The implementation of reliability through design centres on the deployment of core engineering design
principles supported by the factors determined during the planning phase. This section focuses on key
processes that, when deployed in support of design activities, can help achieve the required reliability level.
Typical activities
Purchaser Supplier
4. Define high-level R&M goals and link 4. Assist in R&M strategy and plans
these to the business risks
Related basic design stages: define requirements, refine requirements and set parameters, generate
concepts, concept appraisal, engineering specification, evaluation, re-design and further testing.
Related basic design stages: refine requirements and set parameters, generate concepts, concept
appraisal, engineering specification, initial design, modelling, prototyping, testing, evaluation, re-design
and further testing, manufacturability review, final design.
1. Conduct or commission analyses 1. Interpret requirements and prepare strategy Typical activities
to deliver
Purchaser Supplier
2. Review supplier outputs 2. Establish an R&M programme
1. Assessment of analyses results 1. Suggest potential opportunities for improvement
in design
3. Revise strategy 3. Produce case reports
2. Development of mitigating actions 2. Advise plans for controlling and minimising risk
4. Review case reports 4. Participation in analyses
during design and development
Related basic design stages: concept appraisal, engineering specification, initial design, modelling, 4. Update risk register
prototyping, testing, evaluation, re-design and further testing, manufacturability review.
Related basic design stages: concept appraisal, initial design, modelling, prototyping, testing, evaluation,
re-design and further testing, manufacturability review, final design, first build, field trials.
The majority of equipment is developed, In the context of a reliability management capability, The purpose of reliability testing is to explore and characteristic life. The data set can be analysed
manufactured, assembled and installed within a project risk management activities will typically validate performance characteristics and failure according to the Weibull, Duane or Crow-AMSAA
project environment. Project goals are generally involve: processes. This is quite distinct from qualification procedures. These techniques are covered in
focused on the delivery of equipment within a testing, where the goal is to confirm that a specified various sources, and the results of the analysis will
• Identification of project tasks
specified time and to a set budget. These goals performance requirement can be met. Reliability enable the ability to meet the reliability goals to be
can be in conflict with the goal of developing • Identification of resources required to testing is carried out to reveal weaknesses, which assessed.
reliable equipment due to the increased cost from successfully achieve the task goal are then corrected, and has several goals:
commissioning studies or purchasing higher • Identification of threats to the successful Reliability testing can also include a number of
• Verification of failure modes (or lack of) identified
specification equipment or materials. execution of each task (e.g. lead time to highly specialised methods, for example accelerated
in FMEA activities
purchase, availability of labour or resources, cost life testing (ALT), highly accelerated life testing
Project risk management provides some assurance pressures from material price fluctuations) • Identification of unpredicted physical failure (HALT), reliability environmental testing (RET),
that equipment reliability will not be compromised modes reliability growth trials (RGT), test automation
• Identification of consequences of each threat (e.g.
in order to meet project goals, or as a minimum catastrophic failure, increased equipment cost) • Model and simulation validation framework (TAF), step stress testing (SST).
where compromises are made the effects of such • Determining the probability of each consequence • Learning about physical failure mechanisms
compromises are understood. Suppliers will be occurring (qualitative or quantitative) where the mechanism is poorly understood Halt and similar techniques must be properly
required to demonstrate a strong project risk • Demonstrating reliability improvements from considered and designed to avoid introducing or
• Assessment of overall risk and underlying risk
management capability to ensure that equipment design changes causing additional non-representative problems,
drivers
reliability as well as project requirements are which can disguise the real issues. Expert advice
• Creation of a risk register to aid control and • Generation of reliability and equipment life data
achieved. from experienced test personnel should be sought.
mitigation
Due to cost and practicality, reliability testing is
• Periodic review to ensure that actions taken are A robust FRACAS (see KP 8) is an essential and
usually conducted with a small sample population
mitigating risk integral part of this activity.
and a limited testing period. Even with limited
results, it is still possible to estimate failure
Typical activities characteristics such as infant mortality, ageing and
Purchaser Supplier
Typical activities
1. Review areas of potential risk based on past 1. Review and advise areas of potential risk,
experiences e.g. new techniques or components
Purchaser Supplier
2. Determine supply chain risk management 2. Update risk register 1. Specify testing requirement 1. Propose testing methods
protocols
2. Validate results 2. Analyse results
3. Document process in a risk management
procedure 3. Agree corrective actions 3. Propose corrective actions
Typical activities 1. Define desired methods and plans for support 1. Confirm understanding of requirements
requirements
Purchaser Supplier
2. Review and agree proposals 2. Ensure design and development plans will meet
1. Agree proposed methods of monitoring and 1. Define methods and techniques to be adopted compliance requirements
control
3. Review corrective actions 3. Produce analyses of results and any recovery 4. Revise plans if required
plans if required
Related basic design stages: define requirements, refine requirements and set parameters, generate
Related basic design stages: testing, evaluation, re-design and further testing, first build, field trials,
post-handover reviews. concepts, concept appraisal, engineering specification, initial design, modelling, prototyping, testing,
evaluation, re-design and further testing, manufacturability review, final design, first build, field trials,
handover, post-handover reviews.
1 FRACAS: failure reporting, analysis and corrective action system; DRACAS: defect reporting, analysis and corrective action system.
2. Monitor associated risk aspects and configuration 2. Advise methods for advising where owners when Purchaser Supplier
control procedures changes affect component interchangeability
1. Develop, or manage development of, build 1. Design for manufacture/maintenance
procedures and quality control plans
3. Review implications for training and 3. Advise implications for training and
documentation documentation, etc.
2. Monitor progress to confirm capability of 2. Early involvement of production engineering
manufacturer
Related basic design stages: define requirements, refine requirements and set parameters, generate
3. Accept standard of manufactured items 3. Implement appropriate systems and
concepts, concept appraisal, engineering specification, initial design, modelling, prototyping, testing,
procedures to ensure smooth transition
evaluation, re-design and further testing, manufacturability review, final design, first build, field trials,
handover, post-handover reviews.
Related basic design stages: concept appraisal, engineering specification, initial design, modelling,
prototyping, testing, evaluation, re-design and further testing, manufacturability review, final design,
first build.
Purchaser Supplier
Typical activities
1. Identify the necessary reliability related skills for 1. Develop and deliver appropriate training for staff
Purchaser Supplier the type of organisation
1. Organise regular progress review sessions to 1. Advise procedures for monitoring and analysis
2. Develop and deliver appropriate training for
analyse and understand progress of accumulating data and results
project staff and advisers
2. Assist as appropriate with any resulting revisions 2. Advise methodologies for feedback of information
required to the programme produced into the programme Related basic design stages: define requirements, refine requirements and set parameters, generate
concepts, concept appraisal, engineering specification, initial design, modelling, prototyping, testing,
3. Update systems and documents with learning as 3. Produce reports accordingly evaluation, re-design and further testing, manufacturability review, final design, first build, field trials,
it arises handover, post-handover reviews.
Related basic design stages: initial design, modelling, prototyping, testing, evaluation, re-design and
further testing, manufacturability review, final design, first build, field trials, handover, post-handover
reviews.
Typical activities
Purchaser Supplier
1. Develop, introduce and monitor programmes 1. Discuss relevant previous performance of similar
to record equipment operational performance equipment and advise information which would
and identify particular effects, e.g. frequency be useful
and environment
2. Inform the company R&D strategy according to 2. Research other sources of useful information
where value from reliability improvement can and debate with purchaser
be gained
Related basic design stages: modelling, prototyping, testing, evaluation, re-design and further testing,
manufacturability review, final design, first build, field trials, post-handover reviews.
NB: this bibliography is just a selection of the many Wong, W.; 2002, How did it Happen? Engineering DEFINITIONS However, for this guide, we accept and work with
books and publications available on the subject, and Safety and Reliability, PEP. ISBN 1860583598 the following definitions:
should not be regarded as definitive. Definitions of reliability
Of the above publications, Carter is out of print but Safety is the visible and demonstrable absence of
Andrews, J. D. and Moss, T. R.; 2002, Reliability and new or used copies are available from, or via Amazon There are numerous and various accepted unacceptable risk of undesirable events affecting
Risk Assessment, Wiley-Blackwell. online store. Cost is generally around £60-£100 each. definitions of reliability. For example, ISO 8402 the health and safety of the workforce and the
ISBN 1860582907 quality vocabulary defines reliability as “the ability public at large (HSE definition).
to perform a stated function under stated conditions
Carter, A.; 1986, Mechanical Reliability (2nd edition), Restricted availability (i.e. not on the ISBN for a stated period of time”. Reliability is the ability of an item to perform a
Macmillan Education. ISBN 0333405870 system) – UK source: required function under stated conditions, including
Reliability is an operational parameter; simply, environment and usage, and for a stated time.
Carter, A.; 1997 Mechanical Reliability and Design, Reliability: a Practitioner’s Guide, Intellect/Relex it is sustained performance. The achievement
Wiley-Blackwell. ISBN 0470237198 Software Corporation, 2003. of reliability results, in the first instance, from a Maintainability is the ability of an item, under
structured, coherent and knowledgeable approach stated conditions of use, to be retained in, or
Davidson, J.; 1994, The Reliability of Mechanical British Standard 5760. Part 0.R Reliability of Systems, to the initial derivation of the level of reliability restored to, a state in which it can perform its
Systems (2nd edition); IMechE Guides for the Process Equipment and Components. Introductory Guide to demanded by the operational requirement. required functions, when maintenance is performed
Industry, Wiley-Blackwell. ISBN 0852988818. Reliability. BSI, 1986; R reissued in 2014. Subsequently, the design and development under stated conditions and using prescribed
of the equipment to meet that need must be procedures and resources.
Hecht, H.; 2003, Systems Reliability & Failure underpinned by a contract strategy that delivers
Prevention, Artech House. ISBN 1580533728 Restricted availability (i.e. not on the ISBN and demonstrates that the requirement has been Availability is the ability of an item (under
system) – US source: satisfied. combined aspects of its reliability, maintainability
Kleyner, A. and O’Connor, P. D. T; 2012, Practical and maintenance support) to perform its required
Reliability Engineering (5th edition), RIAC (reliability information analysis centre) has a Successful performance is an absence of undesired function at a stated instant of time or over a stated
Wiley-Blackwell. ISBN 047097981x. very good range of publications effects. Although reliability failures can be time.
at reasonable prices. measured, the absence of data does not prove that
Moubray, J.; 1999, Reliability Centred Maintenance, the system has achieved its aims. As previously Dependability is the collective term used to
Butterworth-Heinemann. ISBN 0750633581 Reliability toolkit: Commercial Practices Edition stated, when new projects are initiated, there is a describe the availability performance and its
System Reliability Toolkit large number of design, functional and operability influencing factors: reliability performance,
O’Connor, P. D. T.; 2002, Practical Reliability Maintainability Toolkit requirements set by the customer and imposed on maintainability performance and maintenance
Engineering (4th edition), Wiley-Blackwell. Supportability Toolkit the project contractors and system suppliers. So support performance.
ISBN 0470844639 Quality Toolkit how will companies interpret reliability? It is the
Mechanical Applications in Reliability Engineering function of the design and engineering teams to Generally, these and other aspects, referred to
Smith, D. J.; 2011, Reliability, Maintainability & Risk, Applied Reliability Engineering (Vols I and II) provide an engineering solution which best meets collectively as reliability, are core aspects of risk
Butterworth-Heinemann. ISBN 008096902x FRACAS Application Guidelines the technical requirements without compromising management in engineering.
the core business values identified above.
Thompson, G.; 1999, Improving Maintainability and
Reliability Through Design, Wiley-Blackwell. ISBN
1860581358
1 Birdcage Walk
Westminster
London SW1H 9JJ
imeche.org
Issue 3 - March 2019