Unit 4 Co

Unit IV
Storage and Other I/O

Topics
Introduction
 Storage Device – Dependablity
 Crash leads to data loss
 Expandablity & Diversity of devices
 I/O devices can be characterized by
 Behavior: input, output, storage
 Partner: human or machine
 Data rate: bytes/sec, transfers/sec
 I/O bus connections
2
I/O System Characteristics
 Dependability is important
 Particularly for storage devices
 Performance measures
 Latency (response time)
 Throughput (bandwidth)
 Desktops & embedded systems
Mainly interested inresponse time & diversity of devices

 Servers
 Mainly interested in throughput & expandability of devices
—3
Dependability
Service accomplishment
Service delivered
as specified
 Fault: failure of a
component
Restoration Failure  May or may not lead to
system failure
Service interruption
Deviation from
specified service
4
Dependability Measures
 Reliability: measure of continuous service
accomplishment
l
or mean time to failure (MTTF)
l
a related term Is Annual failure rate( AFR) –
Expected to fail a year given MTTF
 Service interruption
l
Is measured by mean time to repair (MTTR)
 Mean time between failures
l
is sum of MTTF + MTTR
 MTBF = MTTF + MTTR
 Availability
 is a measure of service accomplishment with respect
to accomplishment & interrruption
 MTTF / (MTTF + MTTR)
 Improving Availability
 Increase MTTF: fault avoidance, fault tolerance, fault forecasting —5
 Reduce MTTR: improved tools and processes for diagnosis and
repair
Disk Storage
 Nonvolatile, rotating magnetic storage
Disk divided into concentric circles called tracks

Tracks – 10,000 to 50000 tracks/surface
Each track is divided into sectors that contain information
Each track has 100 to 500 sectors, Each sector has 512 bytes.
8
Disk Sectors and Access
 Each sector records
 Sector ID
 Data (512 bytes, 4096 bytes proposed)
 Error correcting code (ECC)
 Used to hide defects and recording errors, redundact bits added to detect & correct
errors.
 Synchronization fields and gaps
 Access to a sector involves
 Queuing delay if other accesses are pending

 Seek: move the heads ( average is 3ms - 13ms)
 Rotational latency – wait until desired sector is brought under read/write head
 average latency – is halfway around the disk.
 For disk rotate with 5400 rpm = 0.5/ 5400/ (60 secs/min) = 0.0056 secs=
5.6ms
 Data transfer – transfer time required to transfer a block of bits 70-125 MB/sec
 Controller overhead – controller imposes controller time to perform I/O access.
9
Disk Access Example
 Given
 512Byte sector, 15,000rpm, 4ms average seek time,
100MB/s transfer rate, 0.2ms controller overhead, idle disk
no waiting time. Calculate average read time.
 Average read time
 = seek time + average rotational delay+ transfer time + controller overhead
4ms seek time


+ ½ / (15,000/60) = 2ms rotational latency

+ 0.5KB / 100MB/s = 0.005ms transfer time (512B =0.5 kb)
+ 0.2ms controller delay
= 6.2ms
 If actual average seek time is 1ms
 1 + 2.0+0.005+0.2
 Average read time = 3.2ms
Rotational latency is largest
10
Disk Performance Issues
 Manufacturers quote average seek time
 Based on all possible seeks
 Locality and OS scheduling lead to smaller actual average seek
times
 Smart disk controller allocate physical sectors on disk
 Present logical sector interface to host
 SCSI, ATA, (Small comp systems interface, Advanced Technology
attachment) – Standards for I/O devices
 Disk drives include caches
 Prefetch sectors in anticipation of access
 Avoid seek and rotational delay
Chapter 6 — Storage and Other I/O Topics — 11

Flash Storage
 Nonvolatile semiconductor storage
 100× – 1000× faster than disk
 Smaller, lower power, more robust
 But more $/GB (between disk and DRAM)
§6.4 Flash Storage

Flash Types
 NOR flash: bit cell like a NOR gate
 Random read/write access
 Used for instruction memory in embedded systems
 NAND flash: bit cell like a NAND gate
 Denser (bits/area), but block-at-a-time access
 Cheaper per GB
 Used for USB keys, media storage, …
 Flash bits wears out after 1000’s of accesses
 Not suitable for direct RAM or disk replacement
 Wear leveling: remap data to less used blocks

Hybrid Hard Disks
Combining Ideas of Flash & Hard disk
A few GB OF hYBRID Used in Laptops to BOOT

Quickly
Save energy - - Hard disk remains ide
Future:
Flash drives used in battery operated devices
Interconnecting Components
 Need interconnections between
 CPU, memory, I/O controllers
 Bus: shared communication channel
 Parallel set of wires for data and synchronization of data transfer
 Can become a bottleneck
 Performance limited by physical factors
 Wire length, number of connections
 More recent alternative: high-speed serial connections
with switches
 Like networks
Bus Types
 Processor-Memory buses
 Short, high speed
 Design is matched to memory organization
 I/O buses
 Longer, allowing multiple connections
 Specified by standards for interoperability
 Connect to processor-memory bus through a bridge
(Backplane Bus) ie not directly connected to Memory

Trasnsactions : TWO PARTS
1. ADDRESS RECOGNITION
2. DATA Transfers
Two Types of Transaction
1. Input (Read) : Get data from I/O device to Memory
2. Output (Write): Put data to I/O device from memory

Bus Signals and Synchronization
 Data lines
 Carry address and data
 Multiplexed or separate
 Control lines
 Indicate data type, synchronize transactions
 Synchronous
 Uses a bus clock
 Disadvantage :
l
All devices must be sync
l
Due to skew problems Sync Bus cannot be long
 Asynchronous
 Uses request/acknowledge control lines for handshaking
 Can include wide variety of devices , can be long

I/O Bus Examples
Firewire USB 2.0 PCI Express Serial ATA Serial
Attached
SCSI
Intended use External External Internal Internal External
Devices per 63 127 1 1 4
channel
Data width 4 2 2/lane 4 4

Peak 50MB/s or 0.2MB/s, 250MB/s/lane 300MB/s 300MB/s
bandwidth 100MB/s 1.5MB/s, or 1×, 2×, 4×,
60MB/s 8×, 16×, 32×
Hot Yes Yes Depends Yes Yes
pluggable
Max length 4.5m 5m 0.5m 1m 8m

Standard IEEE 1394 USB PCI-SIG SATA-IO INCITS TC
Implementers T10
Forum
Firewire – Digital Camcorders – high speed digital video transfers

Pluggable – Any time plug into systems and immediately
Chapter 6 —seen in the
Storage and device
Other I/Olist
Topics — 20
Typical x86 PC I/O System
I/O Interfacing with P ,MM & os
Bus – transfers data between devices

Other task such as transferring to the user space etc
Responsiblities of OS is from the Three characteristics of I/O System

Based on I/O Sub system characteristics OS Functions for I/O
Three types of communication by OS
ThIS Prevents user programs directly accessing I/O Devices

I/O Management
 I/O is mediated by the OS
 Multiple programs share I/O resources
 Need protection and scheduling
 I/O causes asynchronous interrupts
 Same mechanism as exceptions
 I/O programming is fiddly
 OS provides abstractions to programs
§6.6 Interfacing I/O Devices …

I/O Commands
 Memory Mapped I/O- Portions of address space are
assigned to I/O Devices.
 I/O devices are managed by I/O controller hardware
 Transfers data to/from device
 Synchronizes operations with software
 Command registers
 Cause device to do something
 Status registers
 Indicate what the device is doing and occurrence of errors
 Done bit
 Error bit
 Data registers
 Write: transfer data to a device
 Read: transfer data from a device

I/O Register Mapping
 Memory mapped I/O
 Registers are addressed in same space as memory
 Address decoder distinguishes between them
 OS uses address translation mechanism to make
them only accessible to kernel
 I/O instructions
 Separate instructions to access I/O registers
 Can only be executed in kernel mode
 Example: x86

Polling
 Periodically check I/O status register
 If device ready, do operation
 If error, take action
 Common in small or low-performance real-time
embedded systems
 Predictable timing
 Low hardware cost
 In other systems, wastes CPU time

Interrupts
 When a device is ready or error occurs
 Controller interrupts CPU
 Interrupt is like an exception
 But not synchronized to instruction execution
 Can invoke handler between instructions
 Cause information often identifies the interrupting
device(Vectored interrupt)
 Priority interrupts
 Devices needing more urgent attention get higher priority
 Status register set to 1 can raise interrupt if 0 none can
interupt
 Can interrupt handler for a lower priority interrupt
 Unix – uses four to six levels
 Internally generated exceptions – Higher priority
 I/O exceptions – lower priority

Interrupt Priority Levels (IPL)
I/O Data Transfer
 Polling and interrupt-driven I/O
 CPU transfers data between memory and I/O data
registers
 Time consuming for high-speed devices
 Suited for lower bandwidth devices
 Direct memory access (DMA)
 Interrupt driven time consuming for Disk – large
blcoks of data
 OS provides starting address in memory
 I/O controller transfers to/from memory autonomously
 Controller interrupts on completion or error

DMA/VM Interaction
 OS uses virtual addresses for memory
 DMA blocks may not be contiguous in physical
memory
 Should DMA use virtual addresses?
 Would require controller to do translation
 If DMA uses physical addresses
 May need to break transfers into page-sized chunks
 Or chain multiple transfers
 Or allocate contiguous physical pages for DMA

DMA – Implemented using Specialized controller -DMA Controller
DMA Controller becomes MASTER and directs the transfer

DMA/Cache Interaction
 If DMA writes to a memory block that is cached
 Cached copy becomes stale
 If write-back cache has dirty block, and DMA
reads memory block
 Reads stale data
 Need to ensure cache coherence
 Flush blocks from cache if they will be used for DMA
 Or use non-cacheable memory locations for I/O

Measuring I/O Performance
 I/O performance depends on
 Hardware: CPU, memory, controllers, buses
 Software: operating system, database management
system, application
 Workload: request rates and patterns
§6.7 I/O Performance Measures: …
 I/O performance depends on rate at which systems

transfer data.
 FOR I/O GB is in base 10 ie (1 GB = 10^9 )
 FOR Memory it is base 2 ie (1 GB = 2 ^ 30)
 I/O system design can trade-off between
response time and throughput
 Measurements of throughput often done with
constrained response-time Chapter 6 — Storage and Other I/O Topics — 39
Transaction Processing Benchmarks
 Transactions
Small data accesses to a DBMS

 Interested in I/O rate, not data rate
 I/O rate : Performance measure of I/O's per unit time (Read's /Sec)
Data rate: Performance measure of bytes per unit time (GB/sec)

 Measure throughput
 Subject to response time limits and failure handling
 ACID (Atomicity, Consistency, Isolation, Durability)
 Overall cost per transaction
 Transaction Processing Council (TPC) benchmarks (www.tcp.org)
 TPC-APP: B2B application server and web services
 TPC-C: on-line order entry environment – Query environement
 TPC -E: on-line transaction processing for brokerage firm
 TPC-H: decision support — business oriented ad-hoc queries

File System & Web Benchmarks
 SPEC System File System (SFS) – Standard
Perfromance Evaluation Corporation
 Synthetic workload for NFS server, based on
monitoring real systems
 Results
 Throughput (operations/sec)
 Response time (average ms/operation)
 SPEC Web Server benchmark
 Measures simultaneous user sessions, subject to
required throughput/session
 Three workloads: Banking, Ecommerce, and Support

Designing an I/O System
Two primary specification

1. Latency constraints
2. Bandwidth Constraints
Latency Measurement
1. In an unloaded system – Easy to determine
2. Loaded system – Average latency is hard
May follow : Queueing / Simulation
Bandwidth
Another typical problem
Designers may be given a partially configured I/O and asked
to balance the system to maintain maximum Bandwidth
I/O vs. CPU Performance
 Amdahl’s Law : How to speed up computation by running
part of it parallely. ( a-b)/n = (a- b) * (1/n)
 Don’t neglect I/O performance as parallelism increases compute
performance
 Example
 Let us take a Benchmark takes 90s CPU time, 10s I/O time
 Double the number of CPUs/2 years
 I/O unchanged
§6.9 Parallelism and I/O: RAID
 At the end of 6 years how much faster will our program run?
Year CPU time I/O time Elapsed time % I/O time
now 90s 10s 100s 10%
+2 90/2 = 45s 10s 55s 18%
+4 45/2 = 23s 10s 33s 31%
+6 23/2 = 11s 10s 21s 47%
Cpu time 90/11= 8 times , but elapsed time 100/21 = 4.7 only
and I/O increased from 10 to 47
RAID
 Redundant Array of Inexpensive (Independent) Disks
 Use multiple smaller disks (c.f. one large disk)
 Parallelism improves performance
 Plus extra disk(s) for redundant data storage
 Provides fault tolerant storage system
 Especially if failed disks can be “hot swapped”
 RAID 0
 No redundancy (“AID”?)
 Just stripe data over multiple disks
 Stripe – Allocation of logically sequential blocks to seperate disks to
allow higher performance
 But it does improve performance

RAID 1 & 2
 RAID 1: Mirroring
 N + N disks, replicate data
 Write data to both data disk and mirror disk
 On disk failure, read from mirror
 RAID 2: Error correcting code (ECC)
 N + E disks (e.g., 10 + 4)
 Split data at bit level across N disks
 Generate E-bit ECC
 Too complex, not used in practice

RAID 3: Bit-Interleaved Parity
 N + 1 disks
 Data striped across N disks at byte level
(Protection group)
 Redundant disk stores parity
 Read access
 Read all disks
 Write access
 Generate new parity and update all disks (Every
disk accessed)
 On failure
 Use parity to reconstruct missing data
 Application requiring large data sets-
Multimedia

RAID 4: Block-Interleaved Parity
 N + 1 disks
 Data striped across N disks at block level
 Redundant disk stores parity for a group of blocks
 Read access
 Read only the disk holding the required block “Small reads”
 Write access
 Just read disk containing modified block, and parity disk
 Calculate new parity, update data disk and parity disk “Small
writes”
 On failure
 Use parity to reconstruct missing data
 Not widely used

RAID 3 vs RAID 4

RAID 5: Distributed Parity
 N + 1 disks
 Like RAID 4, but parity blocks distributed
across disks
 Avoids parity disk being a bottleneck
 Widely used – Parity spread out to all
disk

RAID 6: P + Q Redundancy
 N + 2 disks
 Like RAID 5, but two lots of parity (Another
check)
 Greater fault tolerance through more
redundancy
 Multiple RAID
 More advanced systems give similar fault
tolerance with better performance

RAID Summary
 RAID can improve performance and
availability
 High availability requires hot swapping
 Assumes independent disk failures
 Too bad if the building burns down!
 See “Hard Disk Performance, Quality and
Reliability”
 http://www.pcguide.com/ref/hdd/perf/index.htm

Unit 4 Co

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Unit 4 Co

Uploaded by

Copyright:

Available Formats

Unit IV

Storage and Other I/O

Mainly interested inresponse time & diversity of devices

 Mainly interested in throughput & expandability of devices

Disk divided into concentric circles called tracks

 Access to a sector involves

 Queuing delay if other accesses are pending

4ms seek time

+ ½ / (15,000/60) = 2ms rotational latency

Chapter 6 — Storage and Other I/O Topics — 11

Chapter 6 — Storage and Other I/O Topics — 12

Chapter 6 — Storage and Other I/O Topics — 13

Combining Ideas of Flash & Hard disk

A few GB OF hYBRID Used in Laptops to BOOT

Save energy - - Hard disk remains ide

Chapter 6 — Storage and Other I/O Topics — 17

Two Types of Transaction

1. Input (Read) : Get data from I/O device to Memory

2. Output (Write): Put data to I/O device from memory

Chapter 6 — Storage and Other I/O Topics — 19

Data width 4 2 2/lane 4 4

Max length 4.5m 5m 0.5m 1m 8m

Firewire – Digital Camcorders – high speed digital video transfers

Bus – transfers data between devices

Responsiblities of OS is from the Three characteristics of I/O System

ThIS Prevents user programs directly accessing I/O Devices

Chapter 6 — Storage and Other I/O Topics — 25

Chapter 6 — Storage and Other I/O Topics — 26

Chapter 6 — Storage and Other I/O Topics — 27

Chapter 6 — Storage and Other I/O Topics — 28

Chapter 6 — Storage and Other I/O Topics — 29

Chapter 6 — Storage and Other I/O Topics — 34

Chapter 6 — Storage and Other I/O Topics — 36

DMA Controller becomes MASTER and directs the transfer

Chapter 6 — Storage and Other I/O Topics — 38

 I/O performance depends on rate at which systems

 Interested in I/O rate, not data rate

Data rate: Performance measure of bytes per unit time (GB/sec)

Chapter 6 — Storage and Other I/O Topics — 40

Chapter 6 — Storage and Other I/O Topics — 41

Two primary specification

Chapter 6 — Storage and Other I/O Topics — 45

Chapter 6 — Storage and Other I/O Topics — 47

Chapter 6 — Storage and Other I/O Topics — 48

Chapter 6 — Storage and Other I/O Topics — 49

Chapter 6 — Storage and Other I/O Topics — 50

Chapter 6 — Storage and Other I/O Topics — 51

Chapter 6 — Storage and Other I/O Topics — 52

Chapter 6 — Storage and Other I/O Topics — 53

You might also like