0% found this document useful (0 votes)

483 views25 pages

Understanding Availability and Reliability

Reliability is defined as the probability that a system will operate failure-free over time, while availability is the probability that a system is operational to deliver requested services. Both can be expressed as percentages representing up-time. Availability takes into account repair times when systems must be taken offline. While related, availability and reliability can differ, as systems with low reliability may still be available if failures can be quickly repaired without data loss. Strategies for improving reliability include avoiding faults, detecting and removing errors before deployment, and implementing fault tolerance techniques.

Uploaded by

Kasinathan Muniandi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

483 views25 pages

Understanding Availability and Reliability

Uploaded by

Kasinathan Muniandi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Availability and Reliability

Availability and reliability, 2013

Slide 1

Principal dependability
properties

Availability and reliability, 2013

Slide 2

Reliability
The probability of failure-free
system operation over a specified
time in a given environment for a
given purpose

Availability and reliability, 2013

Slide 3

Availability
The probability that a system, at a
point in time, will be operational and
able to deliver the requested services

Availability and reliability, 2013

Slide 4

Availability specification
Both reliability and availability
attributes can be expressed as
numbers:
Availability of 0.999 means that the
system is up and running for 99.9% of
the time;
Availability and reliability, 2013

Slide 5

Reliability specification
Probability of failure on demand
(POFOD) of 0.0001 means that on
average 1 in 10, 000 demands for
service from a system will fail in
some way

Availability and reliability, 2013

Slide 6

Availability and reliability

Availability and reliability are closely
related
Obviously if a system is unavailable it is
not delivering the specified system
services.

Availability and reliability, 2013

Slide 7

However, it is possible to have

systems with low reliability that
must be available.
So long as system failures can be
repaired quickly and does not damage
data, some system failures may not
be a problem.
Availability and reliability, 2013

Slide 8

Availability is therefore best

considered as a separate attribute
reflecting whether or not the
system can deliver its services.
Availability takes repair time into
account, if the system has to be
taken out of service to repair
faults.

Availability and reliability, 2013

Slide 9

Availability perception
Availability is usually expressed as
a percentage of the time that the
system is available to deliver
services e.g. 99.9%.

Availability and reliability, 2013

Slide 10

Availability and reliability, 2013

Slide 11

Subjective availability
The number of users affected by
the service outage.
Loss of service in the middle of the
night is less important for many
systems than loss of service during
peak usage periods.
Availability and reliability, 2013

Slide 12

The length of the outage.

The longer the outage, the more the
disruption. Several short outages are
less likely to be disruptive than 1 long
outage. Long repair times are a
particular problem.
Availability and reliability, 2013

Slide 13

Reliability metrics
Probability of failure on demand
(POFOD)
Probability that a system will not
deliver a service correctly when
requested
Used for systems where demands are
infrequent and intermittent
Availability and reliability, 2013

Slide 14

Rate of occurrence of failure

(ROCOF)
Number of system failures in a given
time period
Used for transaction processing
systems with frequent and regular
transactions
Availability and reliability, 2013

Slide 15

Fault

A characteristic of a software system that can lead to a

system error.

Error

An erroneous system state that can lead to system behavior

that is unexpected by system users.

Failure

An event that occurs at some point in time when the system

does not deliver a service as expected by its users.

Availability and reliability, 2013

Slide 16

Faults-errors-failures
Fault
Error
Failure
Availability and reliability, 2013

Slide 17

Faults and failures

Failures are a usually a result of
system errors.
The incorrect state causes
undesirable system behaviour
Incorrect state is a consequence of
executing faulty code
Availability and reliability, 2013

Slide 18

However, faults do not necessarily

result in system errors
The erroneous system state resulting
from the fault may be transient and
corrected before an error arises.
The faulty code may never be
executed.
Availability and reliability, 2013

Slide 19

Errors do not necessarily lead to

system failures
The error can be corrected by built-in
error detection and recovery
The failure can be protected against
by built-in protection facilities. These
may, for example, protect system
resources from system errors
Availability and reliability, 2013

Slide 20

Reliability achievement
Fault avoidance
Development technique are used
that either minimise the
possibility of mistakes or trap
mistakes before they result in the
introduction of system faults.
Availability and reliability, 2013

Slide 21

Fault detection and removal

Verification and validation
techniques that increase the
probability of detecting and
correcting errors before the
system goes into service are
used.
Availability and reliability, 2013

Slide 22

Fault tolerance
Run-time techniques are used to
ensure that system faults do not
result in system errors and/or
that system errors do not lead to
system failures.
Availability and reliability, 2013

Slide 23

Summary
Availability is the probability that a
system will be available when a
service request is made
Reliability is the probablity that a
system will deliver a service as
expected by users
Availability and reliability, 2013

Slide 24

Summary
Software faults lead to state errors
lead to operational failures
Fault avoidance, detection and
tolerance are strategies for
achieving reliability
Availability and reliability, 2013

Slide 25

Developing Effective Maintenance Strategies
No ratings yet
Developing Effective Maintenance Strategies
8 pages
Bad Actor Program - Reliabilityweb - A Culture of Reliability
No ratings yet
Bad Actor Program - Reliabilityweb - A Culture of Reliability
8 pages
Understanding Equipment Failures
No ratings yet
Understanding Equipment Failures
32 pages
Asset Management Best Practices
No ratings yet
Asset Management Best Practices
55 pages
RAM Analysis for Oil & Gas Compressors
No ratings yet
RAM Analysis for Oil & Gas Compressors
10 pages
Maintenance Planning & Scheduling Best Practices
No ratings yet
Maintenance Planning & Scheduling Best Practices
7 pages
Understanding the P-F Curve in Maintenance
No ratings yet
Understanding the P-F Curve in Maintenance
7 pages
Power System Reliability Training Course
100% (1)
Power System Reliability Training Course
171 pages
Road To Reliability Ebook
No ratings yet
Road To Reliability Ebook
19 pages
Design For Reliability
100% (2)
Design For Reliability
70 pages
Chapter 3
No ratings yet
Chapter 3
49 pages
Reliability Centered Maintenance Overview
No ratings yet
Reliability Centered Maintenance Overview
18 pages
010RCM Intro
No ratings yet
010RCM Intro
39 pages
RP 32-5 Inspection and Testing of Plant in Service (Electrical Equipment)
100% (2)
RP 32-5 Inspection and Testing of Plant in Service (Electrical Equipment)
41 pages
Questions of Reliability Centered Maintenance
100% (1)
Questions of Reliability Centered Maintenance
15 pages
Mechanical Engineering Failures The Role of Reliability
100% (1)
Mechanical Engineering Failures The Role of Reliability
15 pages
Reliability-Centered Maintenance in Engineering
100% (1)
Reliability-Centered Maintenance in Engineering
16 pages
Understanding the P-F Curve
No ratings yet
Understanding the P-F Curve
4 pages
Revised Proact RCI English
No ratings yet
Revised Proact RCI English
31 pages
Equipment Criticality Analysis - Is It A Waste of Time? - Assetivity PDF
No ratings yet
Equipment Criticality Analysis - Is It A Waste of Time? - Assetivity PDF
11 pages
Condition Monitoring for Rotating Machinery
No ratings yet
Condition Monitoring for Rotating Machinery
39 pages
Safety Critical Elements Verification Guide
100% (1)
Safety Critical Elements Verification Guide
2 pages
Asset Integrity and Reliability Engineering
100% (1)
Asset Integrity and Reliability Engineering
4 pages
Best Practices for Reliability & Maintenance
No ratings yet
Best Practices for Reliability & Maintenance
8 pages
Understanding Reliability Engineering Principles
100% (1)
Understanding Reliability Engineering Principles
58 pages
Understanding RAM: Reliability, Availability, Maintainability
No ratings yet
Understanding RAM: Reliability, Availability, Maintainability
28 pages
Leading and Lagging Indicators Guide
No ratings yet
Leading and Lagging Indicators Guide
6 pages
Asset and Spare Parts Management v5 Magazine
100% (1)
Asset and Spare Parts Management v5 Magazine
7 pages
Simplifying Failure Elimination with FRACAS
No ratings yet
Simplifying Failure Elimination with FRACAS
10 pages
AVT Reliability
100% (1)
AVT Reliability
8 pages
Defect Elimination From A CMMS Perspective - Reliabilityweb - A Culture of Reliability
100% (2)
Defect Elimination From A CMMS Perspective - Reliabilityweb - A Culture of Reliability
17 pages
Industry 4.0 Maintenance Conference 2020
No ratings yet
Industry 4.0 Maintenance Conference 2020
8 pages
Aladon Reliability-Mgmt E-Brochure 062116
No ratings yet
Aladon Reliability-Mgmt E-Brochure 062116
4 pages
RCM Report Sample
100% (2)
RCM Report Sample
15 pages
Reliability Audit for Maintenance Systems
100% (1)
Reliability Audit for Maintenance Systems
16 pages
System Reliability and Weibull Analysis
100% (1)
System Reliability and Weibull Analysis
30 pages
Maintenance Optimization Guide
50% (2)
Maintenance Optimization Guide
28 pages
FMEA Workshop Agenda & Benefits
No ratings yet
FMEA Workshop Agenda & Benefits
95 pages
Maintenance Optimization Guide
100% (1)
Maintenance Optimization Guide
28 pages
GE Power Systems Installation Guide
No ratings yet
GE Power Systems Installation Guide
16 pages
Applications of Types Maintenance in Petrochemical Sites
No ratings yet
Applications of Types Maintenance in Petrochemical Sites
17 pages
Predictive Maintenance for Centrifugal Pumps
100% (1)
Predictive Maintenance for Centrifugal Pumps
13 pages
Optimize Lubrication for Reliability
100% (1)
Optimize Lubrication for Reliability
113 pages
Maintenance Backlog Strategies
No ratings yet
Maintenance Backlog Strategies
8 pages
Maintenance Strategies for Reliability Improvement
No ratings yet
Maintenance Strategies for Reliability Improvement
3 pages
RCM Presentation
No ratings yet
RCM Presentation
10 pages
Unit 1 Principles and Practices of Maintenance Planning KVN
No ratings yet
Unit 1 Principles and Practices of Maintenance Planning KVN
23 pages
Asset Management & RCM in Power Generation
100% (1)
Asset Management & RCM in Power Generation
20 pages
Proact Rca Template
100% (1)
Proact Rca Template
1 page
ENI Maintenance Management Guidelines
100% (3)
ENI Maintenance Management Guidelines
252 pages
Maintenance Control: 1. Work Control 2. Equipment Control 3. Cost Control
No ratings yet
Maintenance Control: 1. Work Control 2. Equipment Control 3. Cost Control
12 pages
Reliability Analysis Fundamentals
No ratings yet
Reliability Analysis Fundamentals
30 pages
Design for Reliability: Key Factors
No ratings yet
Design for Reliability: Key Factors
8 pages
Unit 11 Dependability-and-Security
No ratings yet
Unit 11 Dependability-and-Security
39 pages
DR Dele CPE591
No ratings yet
DR Dele CPE591
101 pages
Reliability and Availability in Software
No ratings yet
Reliability and Availability in Software
3 pages
Understanding Software Reliability Metrics
No ratings yet
Understanding Software Reliability Metrics
37 pages
Dependability & Security
No ratings yet
Dependability & Security
24 pages
Scalability and Performance Metrics
No ratings yet
Scalability and Performance Metrics
62 pages
Security and Dependability Overview
No ratings yet
Security and Dependability Overview
46 pages
Bad Actors Identification
No ratings yet
Bad Actors Identification
1 page
Bar Bending Schedule
No ratings yet
Bar Bending Schedule
48 pages
Eliminating Cronic Failures
No ratings yet
Eliminating Cronic Failures
2 pages
Battery Basics for Beginners
No ratings yet
Battery Basics for Beginners
60 pages
Types of Motors
No ratings yet
Types of Motors
9 pages
Pump & Compressor Essentials
100% (1)
Pump & Compressor Essentials
32 pages
Opc Matrikon Tutorial
No ratings yet
Opc Matrikon Tutorial
37 pages
Remote OPC DA Quick Start Guide (DCOM)
No ratings yet
Remote OPC DA Quick Start Guide (DCOM)
23 pages
Vibration Sensor Installation Guide
No ratings yet
Vibration Sensor Installation Guide
8 pages
Gauge Glasses Vaxbquox
100% (1)
Gauge Glasses Vaxbquox
24 pages
6th Central Pay Commission Salary Calculator
100% (436)
6th Central Pay Commission Salary Calculator
15 pages
Vibration Sensors - SKF
100% (1)
Vibration Sensors - SKF
44 pages
90-1074-03 (PIR9400) - Specs Sheet PDF
No ratings yet
90-1074-03 (PIR9400) - Specs Sheet PDF
2 pages
Reliability-Based Asset Management Guide
100% (4)
Reliability-Based Asset Management Guide
44 pages
Deep Submersible Hayward Motors
No ratings yet
Deep Submersible Hayward Motors
33 pages
Practice Exam CRE Sample
100% (4)
Practice Exam CRE Sample
16 pages
Cisl Closed Loop Case Study Web
100% (1)
Cisl Closed Loop Case Study Web
12 pages
Secondary Calculus for Mathematicians
No ratings yet
Secondary Calculus for Mathematicians
31 pages
Thief Tactics
No ratings yet
Thief Tactics
11 pages
Service Pricing Strategies Explained
No ratings yet
Service Pricing Strategies Explained
52 pages
Web Technologies TCPIP, WebJava Programming, and Cloud Computing (Achyut S. Godbole Atul Kahate)
No ratings yet
Web Technologies TCPIP, WebJava Programming, and Cloud Computing (Achyut S. Godbole Atul Kahate)
609 pages
Purging Workflow Tables in R12
No ratings yet
Purging Workflow Tables in R12
10 pages
Blast Furnace Equipment Status Report
No ratings yet
Blast Furnace Equipment Status Report
4 pages
Passing Off in Intellectual Property Law
No ratings yet
Passing Off in Intellectual Property Law
31 pages
Session 8 Assignment
No ratings yet
Session 8 Assignment
10 pages
SM 8 - Strategy Evaluation
100% (1)
SM 8 - Strategy Evaluation
26 pages
Class 12 Physics Revision Questions
No ratings yet
Class 12 Physics Revision Questions
21 pages
Gradient Descent Deep Learning Lecture
No ratings yet
Gradient Descent Deep Learning Lecture
5 pages
Industrial Upc
No ratings yet
Industrial Upc
28 pages
NuWave PIC2 Project Management Analysis
No ratings yet
NuWave PIC2 Project Management Analysis
4 pages
Nuclear Hazards and Human Health (AEC)
No ratings yet
Nuclear Hazards and Human Health (AEC)
21 pages
50 Journal Entries
69% (36)
50 Journal Entries
13 pages
NorthRidge Church May 2012 Newsletter
No ratings yet
NorthRidge Church May 2012 Newsletter
15 pages
ERQ Localisation
No ratings yet
ERQ Localisation
4 pages
Revenue Cycle in Accounting Information Systems PDF
100% (1)
Revenue Cycle in Accounting Information Systems PDF
5 pages
Material Safety Data Sheet: 1. Product and Company Identification
No ratings yet
Material Safety Data Sheet: 1. Product and Company Identification
5 pages
SME Lending Insights at Bank Asia Ltd.
No ratings yet
SME Lending Insights at Bank Asia Ltd.
94 pages
Retail Inventory Method Overview
No ratings yet
Retail Inventory Method Overview
9 pages
Imucet Syllabus Marine Edge
No ratings yet
Imucet Syllabus Marine Edge
8 pages
Business Continuity Planning:: Engineered Energy and HVAC Solutions For The Pharmaceutical Industry
No ratings yet
Business Continuity Planning:: Engineered Energy and HVAC Solutions For The Pharmaceutical Industry
19 pages
HVAC Duct Installation Details
No ratings yet
HVAC Duct Installation Details
1 page
Rule 3 Transfer September Schedule
No ratings yet
Rule 3 Transfer September Schedule
2 pages
Lack Start Control of A Solid State Transformer For Emergency
No ratings yet
Lack Start Control of A Solid State Transformer For Emergency
156 pages
Anatomy Exam Questions for MBBCh Students
No ratings yet
Anatomy Exam Questions for MBBCh Students
2 pages
Binomial and Poisson Probability Tutorial
No ratings yet
Binomial and Poisson Probability Tutorial
2 pages
Ultimate Learning Roadmap
No ratings yet
Ultimate Learning Roadmap
4 pages

Understanding Availability and Reliability

Uploaded by

Understanding Availability and Reliability

Uploaded by

Availability and Reliability

Availability and reliability, 2013

Availability and reliability, 2013

Availability and reliability, 2013

Availability and reliability, 2013

Availability and reliability, 2013

Availability and reliability

Availability and reliability, 2013

However, it is possible to have

Availability is therefore best

Availability and reliability, 2013

Availability and reliability, 2013

Availability and reliability, 2013

The length of the outage.

Rate of occurrence of failure

A characteristic of a software system that can lead to a

An erroneous system state that can lead to system behavior

An event that occurs at some point in time when the system

Availability and reliability, 2013

Faults and failures

However, faults do not necessarily

Errors do not necessarily lead to

Fault detection and removal

You might also like