Professional Documents
Culture Documents
OPT-2T01
9899_06_2004_X
Morning Schedule
9:00am10:30am
10:30am10:45am
Break
10:50am12:30pm
12:30pm1:30pm
OPT-2T01
9899_06_2004_X
Lunch
Afternoon Schedule
1:45pm3:30pm
3:30pm3:45pm
3:50pm6:00pm
Break
Storage Network Troubleshooting
Required Tools
Required Technical Skill Sets
Storage Network Architecture
Design Practices
FC Network Designs
IP SANs
SAN Extension
Implementation and Management
OPT-2T01
9899_06_2004_X
Associated Sessions
OPT-1051 Introduction to Storage Topologies
and Applications
OPT-2051 Fibre Channel Storage Area Network Design
OPT-2052 FCIP Design and Implementation
OPT-2053 iSCSI Design and Implementation
OPT-2054 Storage Networking Security
OPT-3051 Troubleshooting MDS9000 Fibre Channel SAN
OPT-3052 Troubleshooting MDS9000 IP Storage
Area SAN
OPT-4051 Design and Architecture of Storage
Networking Platforms
OPT-4052 Case Study: Cisco IT Storage Strategy
OPT-2T01
9899_06_2004_X
Reference Materials
Cisco Storage Networking
www.cisco.com/go/storagenetworking
IETFIP Storage
www.ietf.org/html.charters/ips-charter.html
INTRODUCTION TO STORAGE
AREA NETWORKING
OPT-2T01
9899_06_2004_X
Section Agenda
Storage Terms and Acronyms
Storage Networking Devices
Storage Networking Applications
Storage Networking Topologies
Introduction to Storage Protocols
OPT-2T01
9899_06_2004_X
STORAGE TERMS
AND ACRONYMS
OPT-2T01
9899_06_2004_X
Technologies Overview
(or Storage in a Nutshell)
Servers and
Mainframes
Storage Area
Network (SAN)
Technologies
SAN
Protocols
Databases
IP
CLOUD
Backup
Apps
SAN
Applications
iSCSI
Storage
Virtualization
JBODs
and NAS
iSCSI Drivers
Support
Center
RAID &
VirtualRAID
Mirroring
FSPF
IP
CLOUD
iSCSI
om
ll H
Ca
Embedded
Management
FC
HA
Virtual SAN
Enhanced
Fibre Channel
TAPE
FCIP
Generic
Fibre Channel
SAN
FC Switch
OPT-2T01
9899_06_2004_X
IP
CLOUD
FC Switch
10
RAID Levels
OPT-2T01
9899_06_2004_X
RAID Level
Description
Min Disks
Striping/Concatenation
Mirror
0+1
1+0
Hamming Code
N/A
N/A
N/A
3 without log 4
with log
11
Terminology
Direct Attached Storage (DAS)
Block level I/O
Can be internal or external
Typically SCSI or FC
Limited scalability
High cost due to management
OPT-2T01
9899_06_2004_X
12
Terminology
Network Attached Storage (NAS)
File level I/O
Used for file sharing
applications
IP
IP-based
Deployed over existing
low-cost Ethernet networks
Redundant links
NAS
NAS
NAS
Scalable
Multiple servers can share
same file system
OPT-2T01
9899_06_2004_X
13
Terminology
Storage Area Network (SAN)
Block level I/O
Deployed as separate
network
Servers share storage
subsystem
Scalable
Multiple paths for high
availability
OPT-2T01
9899_06_2004_X
14
STORAGE
NETWORKING DEVICES
OPT-2T01
9899_06_2004_X
15
SAN Components
Host Bus Adapter (HBA)
Interface between host and storage
Supports copper or optical
Typically one port; Can be multiple ports
1Gb, 2Gb and 4Gb
OPT-2T01
9899_06_2004_X
16
SAN Components
Fabric Switch
1Gb, 2Gb, and 4Gb
8-40 ports
Low latency
Can be copper or optical
OPT-2T01
9899_06_2004_X
17
SAN Components
Director Class Switch
1Gb, 2Gb, 4Gb and 10Gb
FC and FICON
256 ports and growing
Low latency
Can be copper
or optical
Multi-service
platforms
OPT-2T01
9899_06_2004_X
18
SAN Components
JBOD
Just a bunch of disks
Limited scalability
Typically 2 FC ports
SCSI or FC disks
Basic controllers
No caches
OPT-2T01
9899_06_2004_X
19
SAN Components
Storage Arrays
36GB to many TB
Typically 2 to many interfaces
Subsystems may mix interfaces
ESCON/FICON, SCSI, FC, or iSCSI
SCSI or FC disks
iSCSI
Intelligent controllers
Large caches
OPT-2T01
9899_06_2004_X
20
SAN Components
Tape Arrays
Tape speed vary 5MBs30MBs+
Capacity vary 20GB300GB+
Deployed in servers or external
libraries
SCSI, FC, Ethernet interface
DLT most common; LTO
gaining traction
OPT-2T01
9899_06_2004_X
21
STORAGE NETWORKING
APPLICATIONS
OPT-2T01
9899_06_2004_X
22
IT Storage Requirements
Scalability
Meet high growth demand for storage capacity (>80% per year)
Increase capacity utilization rates
Availability
Share data across distributed data centers via fast speed,
long distance connectivity links
Provide effective disaster recovery
Improve interoperability across heterogeneous equipment
Enhance security
Manageability
Automate storage management functions
Provide cross-vendor management tools
Managing heterogeneous environments
OPT-2T01
9899_06_2004_X
23
Homogenous Infrastructure
Isolated Islands
iSCSI
SAN
Convenient extension
of existing FC SAN to
IP-attached servers
Extensive IP services
for NAS environments
DAS
NAS
Starting Point
OPT-2T01
9899_06_2004_X
24
Business continuance
Unified management
Remote backup
FCIP
FCIP
Optical
FCIP
Lower-cost option for
asynch replication and
backup consolidation
OPT-2T01
9899_06_2004_X
Present Trend
25
Intelligent SAN
Intelligent services
into the network
Common
management
framework
Content, file, and
block awareness
Storage
Routing
Content
Delivery
Storage
Switching
Transport
independent
Host
Awareness
OPT-2T01
9899_06_2004_X
Storage
Virtualization
Storage
Management
26
STORAGE NETWORKING
TOPOLOGIES
OPT-2T01
9899_06_2004_X
27
SCSI
Parallel implementation
Bus width: 8, 16 bits
SCSI Adapter
Target
2004 Cisco Systems, Inc. All rights reserved.
28
Initiator
SCSI
Fibre
Channel HBA
Fibre Channel
Fabric
Target
29
IP access to open
storage sub-systems
iSCSI driver is loaded
onto hosts on ethernet
network
Able to consolidate servers
via iSCSI onto existing
storage arrays
Able to build ethernet-based
SANs using iSCSI arrays
Storage assigned by iSCSI
instance
OPT-2T01
9899_06_2004_X
iSCSI
iSCSI
iSCSI
iSCSI
Array iSCSI
(Target)
IP
Network
iSCSI
Router
FC
Fabric
FC HBA
Attached
Host
(Initiator)
Storage
Pool (Target)
2004 Cisco Systems, Inc. All rights reserved.
30
Backup
Server
Servers
FC
SAN
FC
SAN
Storage
Existing IP
FC Switch
FCIP
Gateway
EMC SRDF
Network
LAN/WAN/MAN
Production Site
Production
OPT-2T01
9899_06_2004_X
FCIP
Gateway
FC Switch
Storage
Backup, R&D,
Shared Storage,
Standby
Data Warehousing, Etc.
31
IP Network
SI
C
iS
Storage
Router
iS
C
SI
FC SAN
Storage
Router
FC SAN
FCIP
FCIP
Gateway
OPT-2T01
9899_06_2004_X
FCIP
Gateway
32
INTRODUCTION TO
STORAGE PROTOCOLS
OPT-2T01
9899_06_2004_X
33
Introducing SCSI
SCSI = Small Computer System Interface
SCSI is a standard that defines an interface
between an initiator (usually a computer) and
a target (usually a storage device such as a
hard disk)
INTERFACE refers to connectors, cables,
electrical signals, optical signals and the
command protocol that allow initiators
and targets to communicate
OPT-2T01
9899_06_2004_X
34
SCSI Example
In this Case, a
File is Being
Written to the
Hard Drive By an
Application on
the Workstation
Target 1
Initiator
SCSI Connector
Target 2
SCSI Cable
Disk
Tape
Sun
Sun
SCSI Command
OPT-2T01
9899_06_2004_X
35
36
Standards
SCSI has evolved since it was introduced as
SASI in 1979 by Shugart Associatesit was
approved as a standard by ANSI in 1986 and
is now referred to as SCSI-1
SCSI-2 was approved by X3 in 1990 and by
ANSI in 1994
SCSI-3 refers to a collection of standards, each
of which defines a very specific part of SCSI:
physical interface, transport interface, command
interface, architecture model, programming
interface, etc.
OPT-2T01
9899_06_2004_X
37
Initiator
Sun
Target 1
Sun
Target 2
OPT-2T01
9899_06_2004_X
38
Target 1
Target 2
Sun
Sun
39
CAM
SBC
ASPI
SSC
SES
Generic
More
SPC-2 / SPC-3
ATAPI
OPT-2T01
9899_06_2004_X
FCP
SBP
FC-xx
1394
SPI-x
40
OPT-2T01
9899_06_2004_X
41
OPT-2T01
9899_06_2004_X
42
43
OPT-2T01
9899_06_2004_X
44
45
OPT-2T01
9899_06_2004_X
46
47
48
Lets Expand on
this Portion
49
SCSI Protocol
FCP
Parallel Bus
FibreChannel
iSCSI
iFCP
FCIP
TCP
IP
Ethernet
OPT-2T01
9899_06_2004_X
50
SCSI Protocol
FCP
Parallel Bus
FibreChannel
iSCSI
iFCP
FCIP
TCP
IP
Ethernet
OPT-2T01
9899_06_2004_X
Todays In-Depth
Protocol
Discussions
51
STORAGE PROTOCOLS
IN-DEPTH
OPT-2T01
9899_06_2004_X
52
Section Agenda
Introduction to Standards
SCSI Protocol
Fibre Channel Protocol
Internet SCSI (iSCSI)
Fibre Channel over IP (FCIP)
Internet Fibre Channel Protocol (iFCP)
iSNS and SLP
OPT-2T01
9899_06_2004_X
53
INTRODUCTION TO STANDARDS
OPT-2T01
9899_06_2004_X
54
C
J11
C++
J16
OPT-2T01
9899_06_2004_X
Techincal Committee on
Lower-Level Interfaces
(T10)
Information Technology
Industry Council
(ITI)
Techincal Committee on
Device-Level Interfaces
(T11)
Techincal Committee on AT
Attachment Interfaces
(T13)
www.t10.org
www.t11.org
www.t13.org
SCSI
Fibre Channel
HIPPI
IPI
ATA (IDE)
ATAPI
55
Standards Process
Technical Committees (T10) write drafts
Drafts are sent to INCITS for approval
Once approved by INCITS, drafts become standards
and are published by ANSI
ANSI promotes american national standards to ISO
as a Joint Technical Committee member (JTC-1)
OPT-2T01
9899_06_2004_X
56
Transport AreaHas 23
WGs, One which Is the IP
Storage WG
IEFT Is the Organization
Ratifying the IPS Standards
OPT-2T01
9899_06_2004_X
IESG
Internet Engineering
Steering Group
IETF
Internet Engineering
Task Force
Transport Area
57
OPT-2T01
9899_06_2004_X
58
59
OPT-2T01
9899_06_2004_X
60
Circuit/packet
Multiple protocol
support
Highly reliable
Destination paced
Buffer credits
Scalable
High availability
Shared media
Transport flexibility
Dedicated connClass 1
MultiplexedClass 2
DatagramClass 3
Configuration flexibility
Switch
Loop
OPT-2T01
9899_06_2004_X
61
What Is It?
Channels
Networks
Connection service
Connectionless
Physical circuits
Logical circuits
Reliable transfers
Unreliable transfers
High speed
High connectivity
Low latency
Higher latency
Short distance
Longer distance
Hardware intense
Software intense
OPT-2T01
9899_06_2004_X
62
What Is It?
Fibre Channel
Channels
Networks
Connection service
Physical circuits
Reliable transfers
High speed
Low latency
Short distance
Hardware intense
Fibre Channel
Circuit and packet
switched
Connectionless
Logical circuits
Unreliable transfers
High connectivity
Higher latency
Longer distance
Software intense
Reliable transfers
High data integrity
High data rates
Low latency
High connectivity
Long distance
OPT-2T01
9899_06_2004_X
63
Cluster
HIPPI
370
OEM
FC-4s
SCSI
IP
ATM
FC-3
Common Services
FC-2
Signaling Protocol
FC-1
Transmission Code
FC-0
Physical Interface
FC-PH
N_Port
OPT-2T01
9899_06_2004_X
F_Port
64
65
Point to point
L
L
L
Arbitrated loop
L
N
F
Switched fabric
FC
F
N
OPT-2T01
9899_06_2004_X
F
F
N
66
Point to Point
Dedicated connection between N port Fibre
Channel devices
All link bandwidth is dedicated to communication
between the two nodes
Suitable for small scale scenarios when storage
devices are dedicated to file servers
N
N
N
OPT-2T01
9899_06_2004_X
67
L
L
FC
68
Data Integrity
Upper Level Protocol
OPT-2T01
9899_06_2004_X
Operation Control
and Byte Counts
Signaling Protocol
Operation
Frame counts
CRC (32 bit)
Frame delimiters
Transmission Code
8b/10b Code
Physical Media
Fibre Reliability
69
Flow Control
Back pressure technique
Frame credit
Established by receiver during LOGIN
Transmitter
Must have credit to transmit
Receiver
Reinstates credit with ACK
OPT-2T01
9899_06_2004_X
70
FIBRE CHANNEL
COMMUNICATIONS MODEL
OPT-2T01
9899_06_2004_X
71
The Model
The Fibre Channel communications model
is based on the definition of:
Physical objects
Protocol construct
72
Physical
The fundamental physical objects in Fibre
Channel are:
Ports
Link
Nodes
Fabric
OPT-2T01
9899_06_2004_X
73
74
N_Port
Host / Device
Host/ Device
Interface
N_Port
Serial Data Out
OPT-2T01
9899_06_2004_X
Serial Data In
75
Link
A link consists of
2 unidirectional fibers transmitting in opposite directions
May be either:
Optical fiber
Copper
Mbytes/sec
106.25
265.5
100
25
76
Link
Host / Device
Host/ Device
interface
N_Port
Serial Data Out
Serial Data In
Link
OPT-2T01
9899_06_2004_X
77
Node
The equipment which contains one or more
N_Port or NL_Port (topology dependent)
May be
Computer
Controller
Device
Is NOT a switch fabric
OPT-2T01
9899_06_2004_X
78
Node
Controller
N_Port
N_Port
Link
OPT-2T01
9899_06_2004_X
N_Port
Link
N_Port
Link
Link
79
Communications Model
Point to point
N_Port to N_Port
Flow control
Acknowledged
Node
Node
N_Port
N_Port
Transmitter
Receiver
Receiver
Transmitter
Transmitter
Link
OPT-2T01
9899_06_2004_X
80
Fabric
Fabric
The entity which interconnects N_Ports
Provides routing based on destination address
Fabric may be:
Point to pointNo routing required
SwitchedRouting provided by switch
Arbitrated loopRouting is distributed throughout
attached L_Ports
OPT-2T01
9899_06_2004_X
81
Terms
Topology
The physical structure of the interconnect of ports
Defines the logical behavior of transactions
Fibre channel has 3 topologies
Pt to Pt
Switched
Arbitrated loop
Fabric
The fabric is the generic item that interconnects nodes
A fabric is made of Fibre Channel topologies like Pt to Pt,
switches and loops
OPT-2T01
9899_06_2004_X
82
Point to Point
Fabric
Node
Node
N_Port
N_Port
Transmitter
Receiver
Receiver
Transmitter
Communications Model
Source to destination
Based on address routing through the fabric
OPT-2T01
9899_06_2004_X
83
Switched Fabric
N_Port
N_Port
N_Port
N_Port
N_Port
N_Port
Switch
Fabric
Communications ModelSource to Destination Based on
Address Routing through the Fabric
A
OPT-2T01
9899_06_2004_X
B
84
Arbitrated Loop
NL_Node
A
Link
NL_Node
B
B
85
FC PROTOCOL CONSTRUCTS
OPT-2T01
9899_06_2004_X
86
OPT-2T01
9899_06_2004_X
87
Construct Introduction
FC-2 defines these constructs that allow the related
information to be:
Grouped together
Coordinated
Handled in an efficient manner
Also defined are means for the Upper Level Protocols ULPs
to communicate with FC-2:
Information Units (IU)
88
Chunks
The ULPs deal with chunks of data that are
moved across the network
These chunks of data may be either
Control
Status
Real data
OPT-2T01
9899_06_2004_X
89
Frames
Frame
FC-2 layer will take this
chunk of data and move
it from
Transmitting node to
receiving node In the units
of what Fibre Channel calls
frames
Frame Size
FC-3
Common Services
FC-2
Signaling Protocol
FC-1
Transmission Code
FC-0
Physical Interface
90
Frame Structure
General FC-2 Frame Format
Frame Format
SOF
Idles
24*
Frame
Header
24
OPT-2T01
9899_06_2004_X
Data Field
CRC
0-2114
EOF
Idles
Bytes
91
Frame Header
Word
2
4
1
2
TYPE 8 bits
Data structure
4
5
2
3
R_CTL Routing
CS_CTL 8 bits
Class Spec
OPT-2T01
9899_06_2004_X
3
1
SEQ_ID 8 bits
1
6
1
5
Frame Control
92
Data Field
Data Field 0-2114
0 - 64
0 - 2112
1-3
Optional Headers
Payload
F
I
L
L
OPT-2T01
9899_06_2004_X
93
Sequence
Sequences
Each chunk of Upper Level Protocol (ULP) data is
moved within the envelope of what Fibre Channel
calls a Sequence (SEQ)
A sequence consists of a set of related frames
As expected there are lots of rules governing sequences
94
Sequence
Sequence Initiator (SI)
Fabric
SI N_Port
Data
Chunk
Da
ta
Fr
am
SR N_Port
Data
Chunk
OPT-2T01
9899_06_2004_X
95
Sequence
Initiator (SI)
Read Command
(Chunk)
Fabric
Data F
rame
Target
(SR)
Sequence
(SI)
Data Frame
Data
(Chunk)
(SR)
Sequence
Data Frame
Status
Sequence
OPT-2T01
9899_06_2004_X
96
Sequence Identifier
Sequence initiator assigns an identifier to each
sequence
This identifier is called the Sequence_Identifier or Seq_ID
The Seq_ID uniquely identifies a given sequence within
the context of the operation
Each frame is identified within this operation by Seq_ID
and Seq_CNT
OPT-2T01
9899_06_2004_X
97
98
Responder (SR)
First Data_Frame
SOF Received
ACK to first Frame
Active
Open
Active
&
Open
Frame with End_SEQ set
99
Sequence Streaming
Sequence streaming is the ability to
Begin transmission of the next sequence while one or more
previous sequences are OPEN
(This Allows for More Data in the Pipe for Distant Connections)
OPT-2T01
9899_06_2004_X
100
Exchange
Upper level protocols frequently deal with related
bits of data as:
Request/reply
Command/data/status
101
Exchange
Initiator (SI)
Read Command
(Chunk)
Fabric
Data F
rame
Target
(SR)
Sequence
(SI)
Exchange
Data Frame
Data
(Chunk)
(SR)
Sequence
Data Frame
Status
Sequence
OPT-2T01
9899_06_2004_X
102
Exchange
Exchange originator
The N_Port which transmitted the FIRST data frame for
this exchange
Exchange responder
The N_Port which is the destination of the FIRST data
frame of this exchange
The designation for the originator and responder are
fixed for the duration of the exchange
Unlike the SI and SR Which Change Roles Within the Exchange
OPT-2T01
9899_06_2004_X
103
104
Information Unit
Upper Level Protocols (ULPs) know about
Information Units (IUs) but know nothing about:
Frames
Sequences
Exchanges
105
Information Unit
The IU contains information sets with such items
as LUN, task attributes, CDB and the command
byte count
The IUs are used in protocol mapping from FC-4
to FC-2 and are assigned an identifier that is useful
to humans not used by the machine
All the information needed to support a ULP is
formed into a IU table and is listed as a first ,
middle or last IU in the exchange
We Will See More of these Tables when We
Cover SCSI Mapping onto Fibre Channel
OPT-2T01
9899_06_2004_X
106
FC-2 Hierarchy
The Hierarchy of Constructs
Construct
Frame Fields
Exchange
Information
Unit
Sequence
Frame
OPT-2T01
9899_06_2004_X
Meaning
OX_ID / RX_ID
SEQ_ID
SEQ_CNT
107
FC-2 Hierarchy
Frame Fields
EXCHANGE
SEQ_ID
SEQ_CNT
SEQUENCE
Frame
Frame
Information Unit
OPT-2T01
9899_06_2004_X
...
...
Frame
OPT-2T01
9899_06_2004_X
109
Transmission Code
Fibre Channel uses a 8b/10b transmission code
Each 8 bit data byte to be transmitted is converted into a
10 bit quantity
The 10 bit quantity is then transmitted over the FC media
The 10 bit quantity is then converted back to the 8 bit data
byte by the receiving node
110
8b/10b Code
Why 8b/10b
1. To ensure the sufficient transitions are present
in the serial bit stream to make clock recovery
possible at the receiver
2. Increase the likelihood of detecting any single
or multiple bit errors
3. To provide special characters with distinctive
and easily recognizable characters to achieve
word alignment on the incoming bit stream
OPT-2T01
9899_06_2004_X
111
8b/10b Code
Characteristics of 8b/10b
The 10 bit transmission code
Supports all 256 values of the 8 bit data byte
Contains unused code points
Illegal codes(called code violations)
Detection of code violations
May occur on the transmission character in which the error
occurred or may be detected on a subsequent character
Contains special characters
Running disparity with DC balance
112
8b/10b Code
Running Disparity
Disparity:
113
8b/10b Code
Code Notation
Each valid transmission character has been
assigned a name in the form of:
Zxx.y
Z = K or D
OPT-2T01
9899_06_2004_X
114
Conversion Table
Transmission Order
MSB
LSB
FC-2 Bits
Example
D1.0
D or K
0
0
1
0
1
0
1
0
0
1
1
FC-1 Transmission
Character
115
Special Characters
K28.5 only special character used in Fibre
Channel out of the 12 set aside
Has no 8 bit representation
The only FC transmission character with 5
consecutive 1s or 0s
Used to find word boundaries and sync
Used in ordered sets
OPT-2T01
9899_06_2004_X
116
Transmission
Transmission word consists of 4 continuous
transmission characters treated as a unit
40 bits long
Aligned on a word boundary
There is a ordered set and a data word
Transmission Order
Byte
Data Word
K28.5
Encoded Data
Byte
Encoded Data
Byte
Encoded Data
Byte
Encoded Data
Byte
Encoded Data
Byte
Encoded Data
Byte
Encoded Data
Byte
Ordered Set
OPT-2T01
9899_06_2004_X
117
Ordered Set
Transmission word starting with the K28.5 special
character
Three classifications of ordered sets are defined
Delimiters
Primitive signals
Primitive sequences
MSB
K28.5
LSB
Dxx.y
Dxx.y
Dxx.y
118
Primitive Signals
Primitive signals are ordered sets
Transmission of primitive signals are interrupted
occasionally to transmit frames
OPT-2T01
9899_06_2004_X
119
Delimiters
Delimiters are ordered sets that delineate a frame
Immediately preceding and following the contents of a frame
SOF delimiters
Identify the start of a frame
Identify the transmission class
Used to establish a Class_1 connection
Identify the beginning and continuation of a sequence
EOF delimiters
Terminate frames
Identify the end of a sequence
Terminate connections
Indicate known frame errors
OPT-2T01
9899_06_2004_X
120
FC-1 Synchronization
Procedures
Sync acquire
Initialization
Loss of sync procedure
Primitive sequences
OPT-2T01
9899_06_2004_X
121
Sync Procedures
Bit synchronization
The state in which a receiver is delivering retimed serial data at
the required bit error rate
122
Synch Acquired
Loss of Sync State
Waiting on Bit Synchronization
Bit Sync Acquired
Data Word
Rx Ordered set #1
Data Word
Rx Ordered set #2
Data Word
Rx Ordered set #3
OPT-2T01
9899_06_2004_X
Sync Acquired
123
Loss-of-Sync Procedure
Sync Acquire State
No Invalid
Words Detected
Two
Consecutive
Valid Words
One Invalid
Word in Next
2 Words
One Invalid
Word in Next
2 Words
Two
Consecutive
Valid Words
One Invalid
Word in Next
2 Words
One Invalid
Word in Next
2 Words
Loss Of Sync
124
FC-1 Constructs
Port states
Primitive sequences
NOS/OLS/LR/LRR
Relationships
Port state transition table
OPT-2T01
9899_06_2004_X
125
Port States
Four primary operational states
Active state
Link recovery state
Link failure state
Offline state
126
Primitive Sequences
Ordered set that is transmitted continuously to
indicate that specific conditions within the port
are encountered
Transmitted while the condition exist
Four primitive sequences
Not Operational Sequence (NOS)
Offline Sequence (OLS)
Link Reset Sequence (LR)
Link Reset Response Sequence (LRR)
OPT-2T01
9899_06_2004_X
127
K28.5
OPT-2T01
9899_06_2004_X
D21.1
D31.5
D5.2
128
K28.5
OPT-2T01
9899_06_2004_X
D21.2
D10.4
D21.2
129
Primitive Sequence LR
Link Reset Sequence
Transmitted by port to indicate that it is:
Initiating the link reset protocol
To recover from a link timeout
To remove a Class_1 connection
K28.5
OPT-2T01
9899_06_2004_X
D9.2
D31.5
D9.2
130
K28.5
OPT-2T01
9899_06_2004_X
D21.1
D31.5
D9.2
131
OPT-2T01
9899_06_2004_X
132
133
Active State
(AC)
Link Recovery
State (LR)
Offline
State (OL)
Idle
OPT-2T01
9899_06_2004_X
134
Meaning
Transmit in
Response
Not Operational
NOS
OLS
Link Failure
Offline State
OLS
LR
Link Reset
LR
LRR
LRR
Idles
IDLE
Idles or
R_RDY
135
Link Failure
Port A
Port B
AC
AC
LF
NOS
LF
OLS
OL
LR
LR
LRR
LR
Idle
AC = Activity State
LR = Link Recovery State
AC
Idle
OL = Offline State
AC
OPT-2T01
9899_06_2004_X
136
Offline
Port A
Port B
AC
AC
OL
OLS
OL
Request to Go
Offline
LR
LR
LRR
Request to Go
Online
LR
Idle
AC
Idle
AC
OPT-2T01
9899_06_2004_X
AC = Activity State
137
OPT-2T01
9899_06_2004_X
138
Word
3
1
2
4
R_CTL Routing
CS_CTL 8 bits
Class Spec
TYPE 8 bits
Data structure
SEQ_ID 8 bits
2
3
1
6
1
5
Frame Control
OPT-2T01
9899_06_2004_X
139
Routing Control
The Routing control field is an 8 bit field
R_CTL consist of two 4 bit sub-fields
Routing
Information category
31
28
Routing
OPT-2T01
9899_06_2004_X
27
24
Info Category
140
Routing Control
The R_CTL is used to direct the frame to the
process the frame is directed to; For example:
Frames directed to the fabric for extended link
services (0x22)
Indication of the function or purpose of the frame
payload from the upper level protocol at FC-4 (0x01)
OPT-2T01
9899_06_2004_X
141
Port Addressing
Word
3
1
2
4
R_CTL Routing
CS_CTL 8 bits
Class Spec
TYPE 8 bits
Data structure
SEQ_ID 8 bits
4
5
OPT-2T01
9899_06_2004_X
2
3
1
6
1
5
Frame Control
142
Port Addressing
D_ID and S_ID fields are 24 bits each
They provide the address or identifier of the
Source and destination port of a frame
OPT-2T01
9899_06_2004_X
143
144
Topology
Assignment
OPT-2T01
9899_06_2004_X
145
Address Identifiers
8 bits
Switch Topology
Model
Public Loop
(Connected to
Switch)
OPT-2T01
9899_06_2004_X
8 bits
8 bits
Switch Domain
Area
Device
00
00
Domain
Area
AL_PA
146
Reserved Addresses
FC-PH has defined a block of addresses for
special functions:
High order 16 addresses in the 24 bit address space
Called the well known addresses
OPT-2T01
9899_06_2004_X
FF FF FC
Directory Server
FF FF FD
Fabric Controller
FF FF FE
147
Word
0
1
2
3
4
5
OPT-2T01
9899_06_2004_X
3
1
2
4
2
3
R_CTL Routing
1
5
CS_CTL 8 bits
Class Spec
TYPE 8 bits
Data structure
SEQ_ID 8 bits
1
6
F_CTL 24 bits
DF_CTL 8 bits
Data field
Frame Control
148
Type
The TYPE is a 8 byte field
Indicates the upper level carried in the payload
of the frame
Examples:
SCSI
08h
IP
05h
SNMP 24h
Fibre Channel services 20h
OPT-2T01
9899_06_2004_X
149
Frame Control
Word
3
1
2
4
R_CTL Routing
CS_CTL 8 bits
Class Spec
TYPE 8 bits
Data structure
SEQ_ID 8 bits
4
5
OPT-2T01
9899_06_2004_X
2
3
1
6
1
5
Frame Control
150
Frame_Control
The frame control is a 24 bit field
It contains a number of flags that are used to
control the flow of the sequence
The more common flags are exchange and
sequence management, acknowledgement
control and error conditions
Bits 16-23 deal with the sequence and exchange settings
Bits 14-15 deal with X_ID
Bits 13-12 form the ACK level for class 1 & 2
Bits 5-4 used for aborting the sequence
OPT-2T01
9899_06_2004_X
151
OPT-2T01
9899_06_2004_X
152
Sequence Identifier
Word
3
1
2
4
2
3
1
6
1
5
0
R_CTL Routing
1
2
3
CS_CTL 8 bits
Class Spec
TYPE 8 bits
Data structure
SEQ_ID 8 bits
4
5
F_CTL 24 bits
DF_CTL 8 bits
Data field
Frame Control
OPT-2T01
9899_06_2004_X
153
Sequences
Sequences
Deal with chunks of upper level protocol
Are made up of one or more frames which
transport the ULP
The data phase may be subdivided into
multiple sequences
Uniquely identifiable with SEQ_ID
The command, data, and status phases of
SCSI are examples of sequences
OPT-2T01
9899_06_2004_X
154
Sequence Identifier
The Sequence Identifier (SEQ_ID) is a 8 bit field
All Frames of a sequence will carry the same
SEQ_ID value
Data content of these frames are related in some way
by the ULP
OPT-2T01
9899_06_2004_X
155
Sequence Count
Word
2
4
R_CTL Routing
CS_CTL 8 bits
Class Spec
TYPE 8 bits
Data structure
SEQ_ID 8 bits
4
5
OPT-2T01
9899_06_2004_X
3
1
2
3
1
6
1
5
Frame Control
156
Sequence Count
Sequence count (SEQ_CNT) is a 16 bit field
Identifies the order of the transmission of frames
within this sequence
Used by Sequence Recipient (SR) to account for
all transmitted frames
Used by Sequence Initiator (SI) to account for all
transmitted acknowledges (ACKs) in Class 1 and 2
OPT-2T01
9899_06_2004_X
157
Sequence Count
Within a Sequence_Initiative
The SEQ_CNT of the first data frame will be zero
The SEQ_CNT of each subsequent data frame in the
sequence will be incremented by 1
The first data frame of the next sequence may be either
zero or one more then the last data frame, this is called
continuously increasing SEQ_CNT
If streamed sequences is used, continuously increasing
SEQ_CNT is required
OPT-2T01
9899_06_2004_X
158
Sequence Count
Sequence initiator
Assigns SEQ_CNT to data frames
Keeps a record of ACK frames received
Sequence recipient
Records SEQ_CNT of data frames
Transmits an ACK frame for each valid data frame
when Rx buffer is available
Knows that sequence was received without error if all
Frames are Rx without errors and are accounted for
Sequence initiator
Knows the sequence was received without error if it
has Rx an ACK frame to all frames within the sequence
OPT-2T01
9899_06_2004_X
159
Exchange Identifiers
Word
3
1
2
4
R_CTL Routing
CS_CTL 8 bits
Class Spec
TYPE 8 bits
Data structure
SEQ_ID 8 bits
4
5
OPT-2T01
9899_06_2004_X
2
3
1
6
1
5
Frame Control
160
OPT-2T01
9899_06_2004_X
161
Parameter Field
Word
3
1
2
4
R_CTL Routing
CS_CTL 8 bits
Class Spec
TYPE 8 bits
Data structure
SEQ_ID 8 bits
4
5
OPT-2T01
9899_06_2004_X
2
3
1
6
1
5
Frame Control
162
Parameter Field
The parameter is a 4 byte field
The content of the parameter field is dependent
on the specific frame type as identified in the
routing field
FC-4 data frames
ACK link control
Port reject and frame reject frames
Port busy and fabric busy frames
OPT-2T01
9899_06_2004_X
163
LOGIN PARAMETERS
OPT-2T01
9899_06_2004_X
164
Login
Procedure to Determine the Operating Environment
for Communications between Two Ports
Exchange service parameters done with login
frame PLOGI or FLOGI
Required before communications can be
established between the two ports
Applies to all topologies
Applies to all ports, node and fabric
Bi-directional
ACCEPT Frame contains service parameters of the
port addressed
OPT-2T01
9899_06_2004_X
165
Login
Service Parameters Contain the Following
Type of Information
Version of Fibre Channel support
N_Port or F_Port functionality
Service classes supported
Size of receive buffers
Number of sequences supported
Support for Intermix
ACK capability
Error policy supported
Others
OPT-2T01
9899_06_2004_X
166
ACKs
Informs Transmitter that:
One or more valid data frames were received by the
sequence recipient for the corresponding sequence
qualifier
Interface buffer is available for another data frame,
this only applies to class 1 and class 2
Class 3 are not ACKed
Flow control
Re-instates end-to-end credit
OPT-2T01
9899_06_2004_X
167
ACKs
Frame Header
Constructed from the data frame which is being
acknowledged
S_ID and D_ID are swapped
F_CTL with both exchange and sequence context
bit inverted
SEQ_ID is unchanged
SEQ_CNT is set to the sequence count of the highest
data frame being replied to by the ACK
Parameter Field
Bit 16 = History bit
Bits 0-15 are ACK type specific
OPT-2T01
9899_06_2004_X
168
ACKs
Again there are three types of ACKs
ACK_1 default for class 1 and 2 one ACK sent for each
SEQ_CNT
ACK_N Class 1 or 2 N=ACK sent by recipient for the
support indicated during port login
ACK_0 class 1 or 2 single ACK sent at end of sequence
We could spend a lot more time discussing ACKs but
there is little or no class 1 or 2 used in networks today
and doubt if we will see any soon
OPT-2T01
9899_06_2004_X
169
OPT-2T01
9899_06_2004_X
170
OPT-2T01
9899_06_2004_X
171
172
173
174
175
Both based on
Credit
Credit_CNT
Differ in
Frames controlled and acknowledgement signal
OPT-2T01
9899_06_2004_X
176
Sequence
Recipient
Fabric
RX Buf
TX
Buf
TX
Buf
RX Buf
R_RDY
R_RDY
RX Buf
ACK
TX
Buf
ACK
RX Buf
TX
Buf
R_RDY
R_RDY
BB_C
BB_C
EE_Credit
OPT-2T01
9899_06_2004_X
177
Class of Service
Applicable to all fabric topologies
Switched
Point to point
Arbitrated loop
178
Class of Service
SOF delimiter
The required class of service along with basic sequence
management are specified in the SOF delimiter of
every frame
The SOF delimiter dedicate basic link management
functions within the fabric
The SOF delimiter identifies basic Sequence management
functions within the destination N_Port in the initial frame
of the sequence and the last frame of the sequence
EOF delimiter
Last frame of a sequence is terminated by a special EOF
Dedicated connections are removed by a special EOF
OPT-2T01
9899_06_2004_X
179
Class of Service
Class 1
Dedicated connection service
Connection oriented service between two N_Ports
Frames received in order transmitted
Guaranteed delivery with notification of non-delivery
Guaranteed throughput
Optional Intermix
Can mix Class 2 and 3 frames if allowed
OPT-2T01
9899_06_2004_X
180
Class of Service
Class 1
Requires explicit connection establishment
SOF(C1) delimiter
Flow control
Buffer to buffer on SOF(C1) frame: R_RDY
End to end for all other data frames: ACK
OPT-2T01
9899_06_2004_X
181
Recipient
Fabric
SOF(C1)
Connection Requested
R_RDY
R_RDY
ACK
Connection Established
SOF(n1)
SOF(n1)
SOF(n1)
ACK
ACK
ACK
Conn Removed
OPT-2T01
9899_06_2004_X
EOF(t)
182
Class of Service
Class 2
Multiplexed connectionless service
Connectionless oriented service between two N_Ports
Order of frame reception not guaranteed
Guaranteed delivery
Notification of non-delivery
No throughput guarantees
Optional intermix
OPT-2T01
9899_06_2004_X
183
Class of Service
Class 2
Multiplex on a frame-by-frame basis
Between different destination N_Ports
Among different sequences
OPT-2T01
9899_06_2004_X
184
Recipient
Fabric
SOF(C2)
R_RDY
R_RDY
ACK
R_RDY
SOF(n2)
R_RDY
SOF(n2)
R_RDY
ACK
R_RDY
ACK
R_RDY
R_RDY
R_RDY
OPT-2T01
9899_06_2004_X
185
Class of Service
Class 3
Datagram multiplexed connectionless service
Connectionless oriented service between two N_Ports
Order of frame reception not guaranteed
Unacknowledged
Delivery NOT guaranteed
No throughput guarantees
Optional intermix
OPT-2T01
9899_06_2004_X
186
Class of Service
Class 3
Multiplex on a frame-by-frame basis
Between different destination N_Ports
Among different sequences
OPT-2T01
9899_06_2004_X
187
Recipient
Fabric
Data Frame
R_RDY
R_RDY
Data Frame
R_RDY
OPT-2T01
9899_06_2004_X
R_RDY
188
EE Credit
Switch
EE_Credit
NL_Node A
EE_Credit
NL_Node B
OPT-2T01
9899_06_2004_X
EE_Credit
189
BB Credit
Switch
BB_Credit
NL_Node A
BB_Credit
NL_Node B
OPT-2T01
9899_06_2004_X
BB_Credit
190
FRAME PROCESSING
OPT-2T01
9899_06_2004_X
191
Tables
The N_Port will keep the following information
Available X_ID table
Exchange context table
Login table
OPT-2T01
9899_06_2004_X
192
Tables
Available X_ID Table
This table contains a list of available X_IDs
Can be used for OX_IDs or RX_IDs
A given implementation may choose to keep two tables
one for OX_ID and RX_ID
OPT-2T01
9899_06_2004_X
193
Tables
Exchange Context Table
Each exchange ID points to a unique entry in the
exchange context table
Each entry contains the context and state
information for the particular exchange
Port_ID involved in exchange
X_ID it assigned to exchange
ULP and phase within the operation
Data source or destination address
Data frames transmitted or received (SEQ_CNT)
ACK frames transmitted or received (SEQ_CNT)
OPT-2T01
9899_06_2004_X
194
Tables
Login Table
This table contains one entry for each port to
which this port is logged in with
Each entry contains service parameters and
working EE_Credit count value
OPT-2T01
9899_06_2004_X
195
Login process
If you are not logged in, initiate login process
Build logging table entry for destination port
OPT-2T01
9899_06_2004_X
196
Data Frames
Data Frame Transmission (Cont.)
Gather information
Exchange context table
Receive buffer size and destination port
Login table
Working credit count of destination port
Set-up frame header
197
Data Frames
Transmit Request
ULP passes a request to transmit a chunk
of data to the N_Port
Destination Port D_ID is made
OPT-2T01
9899_06_2004_X
198
Data Frames
The Data Transmission
ULP data chunk is moved in frames with the
use of the sequence
All within the context of the exchange
OPT-2T01
9899_06_2004_X
199
OPT-2T01
9899_06_2004_X
200
Sequence Processing
Sequence Count
ULP chunk of data is transmitted IN ORDER
All Frames are sent in order
Sequence_Count (SEQ_CNT)
Frames are assigned sequentially increasing numbers
as they are sent
The receiving N_Port will use the SEQ_CNT to insure
that Frames are reassembled in order and back in its
original chunk
OPT-2T01
9899_06_2004_X
201
OPT-2T01
9899_06_2004_X
202
Parameter
Set to offset of the first byte of payload with respects
to entire chunk
Offset = 0 on first frame and 1 + for second and
subsequent frames
OPT-2T01
9899_06_2004_X
203
OPT-2T01
9899_06_2004_X
204
Automatic Processes
These processes are automatic and are
performed by the protocol chip
Segmentation and reassembly
SEQ_CNT assignment
Higher layers are unaware of these processes
OPT-2T01
9899_06_2004_X
205
ULP Processing
The Upper Level Protocol (ULP) uses these fields
Routing 0000 = FC-4 data frame
Type 08 = SCSI/FCP
Info category
206
ARBITRATED LOOP
OPT-2T01
9899_06_2004_X
207
Attaches NL_Ports
Number of nodes on loop
directly affects performance
208
Loop Advantages
Low cost solution with copper transceivers
Eliminates the need for a discrete fabric
Fabric routing decision distributed around the loop
OPT-2T01
9899_06_2004_X
209
Loop Advantages
Port bypass network
High availability configurations possible
Supports both public and private loops
Provides access fairness
OPT-2T01
9899_06_2004_X
210
NL_Port
N_Port
Attaches to the physical transport media
Provides the Fibre Channel control and protocol
Provides the termination point for Fibre Channel
Resides within the node
NL_Port
Provides all functionality on N_Port with additional
function of the loop
An NL_Port can function as a N_Port
OPT-2T01
9899_06_2004_X
211
FL_Port
F_Port
Attaches to the physical transport media at the edge of
the switched fabric
FL_Port
The switched fabric port which attaches to a loop
F_Port functionality with additional function of the loop
OPT-2T01
9899_06_2004_X
212
Public Loop
Contains an FL_Port
Communications outside loop via Fibre Channel
is possible
Private Devices
Devices on a public loop may be private, i.e. do not login
OPT-2T01
9899_06_2004_X
213
Addressing
Arbitrated Loop Physical Address (AL-PA or PA)
Assigned during the loop initialization (soft addressing)
A unique 8 bit value
127 valid values
214
OPT-2T01
9899_06_2004_X
215
Switched Fabric
N_Port
N_Port
N_Port
N_Port
N_Port
N_Port
Switch
Fabric
OPT-2T01
9899_06_2004_X
216
Loop
= Arbitrated Loop
Additional Function
Fabric
Node
Node
Node
NL_Port
NL_Port
NL_Port
LOOP
OPT-2T01
9899_06_2004_X
NL_Port
NL_Port
NL_Port
Node
Node
Node
217
218
Arbitration
The process by which an L_Port acquires ownership
of the loop
Open
The process by which the L_Port which owns the Loop
uses to select the L_Port to which it wants to
communicate with
Close
The process by which the L_Port which owns the Loop
releases control
OPT-2T01
9899_06_2004_X
219
Fill Words
FC-PH defines two signals that may be transmitted
between frames (when no other information is
being transmitted)
Idle
R_RDY
220
Primitive sequences
Port bypass enable
Port bypass disable
Loop initialization
OPT-2T01
9899_06_2004_X
221
Credits Buffers
Loop Uses Same Credit Method as Previously
Discussed But Also Has an Alternate Credit Model
Alternate BB_Credit management requested
during login
When activated service parameter BB_Credit = number
of buffers available when circuit is established
The receiving L_Port shall transmit R_RDYs for the
additional buffers at anytime when opened
Used to pump up BB_Credit_CNT
Transmitting L_Port
Decrements BB_Credit by 1 for each data frame Tx
Increments BB_Credit by 1 for each R_RDY Rx
Stops transmitting when BB_Credit =0
OPT-2T01
9899_06_2004_X
222
OPT-2T01
9899_06_2004_X
223
Loop Commands
Loop Initialization ProcedureLIP Is an Ordered Set
Command
Bytes
Payload Contents
LISM
12
LIFA
20
LIPA
20
LIHA
20
LISA
20
LIRP
132
LILP
132
OPT-2T01
9899_06_2004_X
224
Phase B
Phase C
OPT-2T01
9899_06_2004_X
Start The
Initialization
Procedure
LIP
Select
Temporary Loop
Master
LISM
AL_PA
Mapping
Phase
LIFA,
LIPA,
LIHA,
LISA
Start The
Initialization
Procedure
Phase D
Reporting Phase
LIRP
Phase E
Distribute AL_PA
Map Phase
LILP
Close
2004 Cisco Systems, Inc. All rights reserved.
225
LIP: Phase A
Loop Initialization Primitive Sequence
Transmitted continuously by L_Port until it
receives the same LIP configuration
LIP (F7F7) the L_Port is attempting to determine if this
is a loop and to acquire an AL_PA
LIP (F8F7) the L_Port has detected a loop failure at its
receiver prior to acquiring an AL_PA
LIP (F8) the L_Port (AL_PS) had detected a loop failure
at its receiver
LIP (F7) the L_Port (AL_PS) has detected a performance
degradation
OPT-2T01
9899_06_2004_X
226
LIP: Phase B
Each L_Port will build the LISM with:
AL_PA = 00 hex if FL_Port
EF hex if NL_Port
D_ID
S_ID
= 0000hex + AL_PA
227
Loop master
Current fill word would be ARB(F0)
When ARB(F0)s are received, go to phase C
OPT-2T01
9899_06_2004_X
228
LIP: Phase C
Loop Master Will Form the Initial Bit Map as
Shown:
Lowest
AL_PA
Bit
Position
Word
31
L000
0000
0000
0000
0000
0000
0000
0000
0000
0000
0000
0000
0000
0000
0000
0000
0000
0000
0000
0000
0000
0000
0000
0000
0000
0000
0000
0000
0000
0000
0000
0000
Where
24
23
16
15
Highest
AL_PA
229
LIP: Phase C
Loop master will transmit the following three
commands allowing an L_Port to choose a
desired AL_PA
LIFA
LIPA
LIHA
OPT-2T01
9899_06_2004_X
230
LIP: Phase C
Each NL_Port will
Receive, possibly modify and retransmit the four
Initialization Command frames
Set the Current Fill Word (CFW) = ARB(F0)
231
LIP: Phase D
The loop master will prime the AL_PA position
map to:
Byte 0 = 01 hex
Byte 1 = its AL_PA
Bytes 2-127 = FF hex
Then transmit the LIRP with this position map
232
LIP: Phase E
The loop master will transmit the LILP
command with
Payload = AL_PA position map
Initialization Complete
OPT-2T01
9899_06_2004_X
233
LIP: Summary
A. LIP starts the initialization procedure
B. Select a temporary loop master
Lowest AL_PA | WWN wins
234
Arbitration
The process by which L_port request ownership of
the loop based on primitive signals
Ordered Set
MSB
K28.5
ARB(x)
OPT-2T01
9899_06_2004_X
LSB
D20.4
AL_PA
AL_PA
235
Arbitration
Loop Owner
The current loop owner is responsible for
Seeds the arbitration process with ARB(F0)
Blocks propagation of the received ARB(x) until it
relinquishes the loop
Fairness variables
Access
ARB_WON
OPT-2T01
9899_06_2004_X
236
Arbitration Process
When a port is arbitrating it enters the
arbitrating state
The CFW is updated to the ports ARB(AL_PA)
if the CFW is:
1.
2.
3.
4.
IDLE
ARB(F0)
ARB(FF)
Lower-priority ARB (higher value AL_PA)
237
Fairness
Access Fairness
Ports with higher-Priority AL_PA values could lock out lower
priority ports
When they ARB they will always win
Lower Priority ports might never win Arbitration
238
Fairness
The fairness is controlled by the FC-AL fairness
algorithm called a fairness window
Window begins when the first port wins arbitration
Ends when a port discovers that it was the last arbitrating port
IDLE resets the fairness window
The variables used are
Access = 0 for fairness window open
Access = 1 when NL_Port has won arbitration
239
Open
If the Port Requires the Loop when It Wins ARB
It sends an OPN(yx) or OPN(yy)
y=destination port x=source port
Full-Duplex establishes a point to point like circuit
between the loop ports
Half-duplex restricts open recipient to transmit link
control frames only
Cannot transmit device data frames
Used by designs that can not support simultaneous
data frames Tx and Rx
OPT-2T01
9899_06_2004_X
240
Open
Selecting the Destination Port
Is the intended destination port on same loop or
connected via fabric switch?
If the upper 16 bits of destination field (D_ID) are all zeros
the port is on this private loop
If the upper 16 bits of the source(S_ID)are all zeros then the
source port is a private port and can only talk to ports on
same loop
If the upper 16 bits of the D_ID are the same as the upper 16
bits of the S_ID then they are both on the same loop or both
are public and attached to the same FL_Port
241
OPT-2T01
9899_06_2004_X
242
OPT-2T01
9899_06_2004_X
243
OPT-2T01
9899_06_2004_X
244
Switch Model
Port
Connection
Matrix
Port
Fabric Controller
Connectionless
Switch Matrix
Port
OPT-2T01
9899_06_2004_X
Port
245
Worldwide Names
Each switch element is assigned a WWN at time
of manufacture
Each switch port is assigned a WWN at the
time of manufacture
During FLOGI the switch identifies the WWN
in the service parameters of the accept frame
Fabric port and
Switch element
246
Switch Ports
Four basic types of switch ports
F_PortUses NOS/LOS to attach to single N_Port
FL_PortUses LIP to attach 1 to 126 NL_Ports
E_PortUses NOS/LOS to interconnect switches
(inter-link switch ISL)
G_PortUses NOS/LOS can be a F or E port
OPT-2T01
9899_06_2004_X
247
Fabric Addressing
The 24 bit address is partitioned into 3 fields
Device
Area
Domain
Switch Domain
8 bits
Area
8 bits
Device
248
Directory Server
Repository of information regarding the
components that make up the Fibre
Channel network
Located at address FF FF FC (Some readings
call this the name server)
Components can register their characteristics
with the directory server
An N_Port can query the directory server for
specific information
Query can be the address identifier, WWN and volume
names for all SCSI targets
OPT-2T01
9899_06_2004_X
249
Directory Server
Command Requests
Register objects
RFT_IDRegister FC-4 types
Deregister objects
DA_IDDeregister all
OPT-2T01
9899_06_2004_X
250
Fabric Controller
Each switch has a fabric controller
Assigned address FF FF FD
Every fabric controller in the fabric has the same address
It is the N_Port within the switch
Responsible for managing fabric, initialization, routing,
setup and teardown of Class-1 connections
OPT-2T01
9899_06_2004_X
251
OPT-2T01
9899_06_2004_X
252
PRLI
Process Login
OPT-2T01
9899_06_2004_X
SCN
SCR
RSCN
253
ELS: FLOGI
FLOGIFabric login
Issued by N_Port to destination FF FF FE to
Determine if fabric is present
Establish a session with the fabric
Exchange service parameters with the fabric
FLOGI assigns N_Ports 24 bit address to
N_Port or AL_PA to loop ports
OPT-2T01
9899_06_2004_X
254
ELS: PLOGI
PLOGIN_Port login
Established sessions between two N-Ports
Required before upper level protocol operations can begin
N_Port will register to the name server FF FF FC in fabric
with all required login parameters
N_Port will then query name server for other N_Ports
on the fabric
OPT-2T01
9899_06_2004_X
255
ELS: PRLI
PRLIProcess Login
Allows the FC-4 levels to exchange service parameters
for communications between each other
Process is protocol specific (type field)
SCSI-3 FCP mapping requires PRLI
OPT-2T01
9899_06_2004_X
256
ELS: FAN
FANFabric Address Notification
Used in fabric loop attached topology
Provides mechanism for FL_Port to notify
NL_Ports of addresses and names of FL_Ports
along with fabric name
Allows NL_Ports to verify configuration following
a loop initialization
OPT-2T01
9899_06_2004_X
257
ELS: SCN
SCNState Change Notification
Provides notification to ports of events that may
effect logins or process logins to ports on the fabric
SCN can be sent from
N_Port to N_Port
N_Port to fabric controller
Fabric controller to N_Ports
258
ELS: RSCN
RSCNRegistered State Change Notification
Similar to SCN but only sends change notice
to those ports registered
SCN did not define a registration method
OPT-2T01
9899_06_2004_X
259
Class_F Service
Communications between switch elements use
Class_F Service
Unique SOF delimiter and normal EOF delimiter
OPT-2T01
9899_06_2004_X
260
Inter-Switch Link
The interconnection between switches is called
the inter-switch link
E_Port to E_Port
OPT-2T01
9899_06_2004_X
261
OPT-2T01
9899_06_2004_X
262
Switch 1
Switch 3
Switch 2
Switch 5
Switch 4
Switch 6
OPT-2T01
9899_06_2004_X
263
OPT-2T01
9899_06_2004_X
264
Starting
Process
Ending
Establish Link
Parameters
and Switch
Port Operating
Mode
Select
Principle
Switch
BF or RCF SW_ILS
transmitted or
received
Switch_Names are
exchanged over all ISLs to
select a Principle Switch,
which becomes the Domain
Address Manager
Domain ID
Acquisition
Domain Address
Manager had been
selected
Path Selection
Switch has a
Domain_ID
Condition
OPT-2T01
9899_06_2004_X
Condition
265
OPT-2T01
9899_06_2004_X
266
OPT-2T01
9899_06_2004_X
267
0x11
WWN
Record Len
(0x10)
Payload Len
Reserved
or
t
E_
P
E_
Po
rt
E_
P
EFP
EFP
Domain_ID record M
rt
Po
EF
P
E_
EFP
Domain_ID record 0
EF
P
or
t
EF
P
EF
P
Priority
Priority
EFP
Domain_id
OPT-2T01
9899_06_2004_X
E_
Po
rt
EFP
Po
rt
SW
_R
JT
CC
_A
EFP
E_
SW
E_
Po
rt
SW
_A
C
SW C
_R
JT
Priority
E_
Po
rt
EFP
269
Each switch can send many RDI but once the principal switch
has granted the domain ID to the switch, the following RDI
from the switch must request the same set of domain_id
OPT-2T01
9899_06_2004_X
270
DIA (SW
_REQ)
SW_ACC
SW_RJT
Upstream port
)
RDI (SW_REQ
E_
D
IA
E_
Po
rt
E_
Po
rt
Upstream port
Po
rt
DI E
A _P
or
t
SW_ACC
SW_RJT
EFP (SW
_REQ)
Contains
DoID lis
t
SW_ACC
SW_ACC
SW_RJT
)
RDI (SW_REQ
)
RDI (SW_REQ
SW_ACC
SW_RJT
EFP (SW
_REQ)
Contains
DoID lis
t
SW_ACC
OPT-2T01
9899_06_2004_X
DIA (SW
_REQ)
SW_ACC
SW_RJT
EFP (SW
_REQ)
Contains
DoID lis
t
SW_ACC
271
272
Fabric Configuration
Disruptive/Non-Disruptive
One of the following three conditions can trigger BF
(non-disruptive) or RCF (disruptive)
Two disjoints fabric are combined together
A principal ISL fails (upstream or downstream)
A switch with Domain_ID request for another Domain_ID
273
Switch Starts
the
Reconfig
OPT-2T01
9899_06_2004_X
274
Area_ID
Port_ID
Description
00
00
00
00
00
AL_PA
00
00
NonAL_PA
Reserved
00
01-FF
00-FF
Reserved
01-EF
00-FF
00-FF
N_Port & E_Port. Port ID=00 for FL port for public devices 255 address
F0-FE
00-FF
00-FF
Reserved
FF
00-FA
00-FF
Reserved
Multicast & Broadcast
OPT-2T01
9899_06_2004_X
FF
FB
00-FF
FF
FC
00
Reserved
FF
FC
01-EF
FF
FC
F0-FF
Reserved
FF
FD-FE
00-FF
Reserved
FF
FF
00-EF
Reserved
FF
FF
F0-FC,FF
FF
FF
FD
FF
FF
FE
275
OPT-2T01
9899_06_2004_X
276
OPT-2T01
9899_06_2004_X
277
OPT-2T01
9899_06_2004_X
278
OPT-2T01
9899_06_2004_X
279
OPT-2T01
9899_06_2004_X
280
B
LSU(DB-A)
LSU(DB-B)
LSA(DB-B)
LSA(DB-A)
OPT-2T01
9899_06_2004_X
281
Fabric Configuration
Link State Record
B
LSU(LSR-A)
LSU(LSR-B)
LSA(LSR-B)
LSA(LSR-A)
OPT-2T01
9899_06_2004_X
282
Fabric Configuration
Link cost is calculated based on baud rate of the
link, plus an administratively set factor
Link cost = S * (1.0625E12/baud rate)
S is administrative factor defaults to 1
Ex: Link cost of 1G port = 1000
OPT-2T01
9899_06_2004_X
283
Fabric Configuration
Path Selection (FSPF) Operation Summary
Operation
Starting
Condition
Process
Ending
Condition
Perform initial
HELLO
Exchange
The switch
sending HELLO
has a valid
Domain_ID
Two way
communication
has been
established
Perform Initial
Database
Exchange
Two
communication
has been
established
Link State
Databases have
been exchanged
Running State
Initial Database
Exchange has
been completed
OPT-2T01
9899_06_2004_X
284
FSPF Characteristics
Uses FSPF as the routing algorithm
FSPF routes traffic based on destination domain ID
FSPF uses total cost as the metric to determine
most efficient path
Static routes can be applied
OPT-2T01
9899_06_2004_X
285
FSPF Characteristics
Paths:
Finds the shortest path to each domain, then programs the
hardware routing tables
Routes:
Dynamically
Round robin
Statically
Administrator can configure the route
Automatically re-routes upon ISL going away and static routing
will again take effect upon ISL return
Automatic failover
Fault detection 150 ms
Self heals in 500 ms
So, alternate route is live in 650 ms
OPT-2T01
9899_06_2004_X
286
OPT-2T01
9899_06_2004_X
287
Route
FC
ISL
Path
288
Selecting a Path
FC
Cost 500
FC
Cost 250
Cost 250
289
ISL Oversubscription
Multiple Nodes
1G
1G
1G
Switch
ISL
1G
Switch
OPT-2T01
9899_06_2004_X
Oversubscription
occurs when more
nodes can contend
for the use of one ISL
Oversubscription
ratio is the number of
different ports that
contend for the use
of one ISL
This a 3:1 over
subscription
290
FC ERROR MANAGEMENT
OPT-2T01
9899_06_2004_X
291
Timers
Four different timers used
Receiver-transmitter time-out (R_T_TOV)
Error detect time-out (E_D_TOV)
Resource allocation time-out (R_A_TOV)
Connection request time-out (C_R_TOV)
Used in Class 1
OPT-2T01
9899_06_2004_X
292
Timers: R_T_TOV
Receiver-Transmitter Time-out
Used to time events at the link level
Loss of synchronization
Times Responses for link reset protocol
OPT-2T01
9899_06_2004_X
293
Timers: E_D_TOV
Error Detect Time-out
Timers for events and responses at the
sequence level
Missing ACK or R_RDY when buffer credit has
reached zero
Class 1 or 2 expects response from data frames
N_Port logout
OPT-2T01
9899_06_2004_X
294
Timers R_A_TOV
Resource Allocation Time-out
Time-out value for how long to hold resources
associated with a failed operation
Needed to free shared resources for reuse
OPT-2T01
9899_06_2004_X
295
Timers: CR_TOV
Connection Request Time-out
Determines how long the fabric can hold
a class-1 request in the queue during
connection establishment
Allows for separation of the time in a stacked
queue from the E_D_TOV; This separates
queuing time from frame transit time
Helps in controlling F_BSY issues
OPT-2T01
9899_06_2004_X
296
Recovery: Class 3
Errors in class 3 sequence can only be detected by
the Sequence recipient because there are no ACKs
or rejects in class 3
Class 3 SR will discard single or multiple frames
until the exchange is terminated
The upper level recovery may retransmit the entire
Sequence or at least the sequence following the
error detection
OPT-2T01
9899_06_2004_X
297
Recovery: Class 3
Errors a class 3 operation can detect:
Out of order delivery and potential missing frame based
on SEQ_CNT
If missing frame is not Rxed within E_D_TOV
Indication of a new sequence when last frame of previous
Sequence has not been received (in-order delivery set)
Relative offset not in order with an order delivery set
OPT-2T01
9899_06_2004_X
298
OPT-2T01
9899_06_2004_X
299
Timers
Four different timers used
Receiver-transmitter time-out (R_T_TOV)
Error detect time-out (E_D_TOV)
Resource allocation time-out (R_A_TOV)
Connection request time-out (C_R_TOV)
Used in Class 1
OPT-2T01
9899_06_2004_X
300
Timers: R_T_TOV
Receiver-Transmitter Time-out
Used to time events at the link level
Loss of synchronization
Times Responses for link reset protocol
OPT-2T01
9899_06_2004_X
301
Timers: E_D_TOV
Error Detect Time-out
Timers for events and responses at the
sequence level
Missing ACK or R_RDY when buffer credit has
reached zero
Class 1 or 2 expects response from data frames
N_Port logout
OPT-2T01
9899_06_2004_X
302
Timers R_A_TOV
Resource Allocation Time-out
Time-out value for how long to hold resources
associated with a failed operation
Needed to free shared resources for reuse
OPT-2T01
9899_06_2004_X
303
Timers: CR_TOV
Connection Request Time-out
Determines how long the fabric can hold
a class-1 request in the queue during
connection establishment
Allows for separation of the time in a stacked
queue from the E_D_TOV; This separates
queuing time from frame transit time
Helps in controlling F_BSY issues
OPT-2T01
9899_06_2004_X
304
Recovery: Class 3
Errors in class 3 sequence can only be detected by
the Sequence recipient because there are no ACKs
or rejects in class 3
Class 3 SR will discard single or multiple frames
until the exchange is terminated
The upper level recovery may retransmit the entire
Sequence or at least the sequence following the
error detection
OPT-2T01
9899_06_2004_X
305
Recovery: Class 3
Errors a class 3 operation can detect:
Out of order delivery and potential missing frame based
on SEQ_CNT
If missing frame is not Rxed within E_D_TOV
Indication of a new sequence when last frame of previous
Sequence has not been received (in-order delivery set)
Relative offset not in order with an order delivery set
OPT-2T01
9899_06_2004_X
306
OPT-2T01
9899_06_2004_X
307
iSCSI
RFC 3720
OPT-2T01
9899_06_2004_X
308
Session Modules
What is iSCSI and what is the big picture?
iSCSI protocol Introduction
The iSCSI connection
Security, data integrity and error recovery
iSCSI protocol details in-depth
Simple iSCSI connection flows
Service location protocol for IP storage
OPT-2T01
9899_06_2004_X
309
What Is iSCSI?
A SCSI transport protocol that operates on
top of TCP
Encapsulates SCSI-3 CDBs (Control Descriptor Blocks)
and Data into TCP/IP byte-streams (defined by SAM-2
SCSI Architecture Model 2)
Allows IP hosts to access IP or Fibre Channel-connected
SCSI targets
Allows Fibre Channel hosts to access IP SCSI targets
Standards status
RFC 3720 (assigned May 2004)
Major industry support (Cisco, IBM, EMC, HP, Microsoft)
OPT-2T01
9899_06_2004_X
310
Storage Technology
SCSI Domain
SCSI Device
Target
Logical
Unit 1
Port
Application
Client
Service Delivery
Subsystem
Port
SCSI Device
Initiator
Device
Server
Task Request
Task Response
Task
Manager
311
SAN
iSCSI
iSCSI
Appliance Gateway
NAS
Computer System
Application
Application
Application
Application
Application
File System
File System
File System
File System
File System
Volume Manager
Volume Manager
Volume Manager
Volume Manager
I/O Redirector
NFS/CIFS
TCP/IP stack
NIC
TCP/IP stack
TCP/IP stack
NIC
NIC
File I/O
Block I/O
SCSI
SAN
IP
IP
IP
FC
NIC
TCP/IP stack
iSCSI layer
Bus Adapter
NIC
TCP/IP stack
iSCSI layer
Bus Adapter
NIC
TCP/IP stack
File System
Device driver
FC switch
OPT-2T01
9899_06_2004_X
Block I/O
312
IP Storage Networking
IP storage networking provides solution to carry storage
traffic within IP
Uses TCP: A reliable transport for delivery
Can be used for local data center and long haul applications
Two primary protocols:
iSCSIIP-SCSIUsed to Transport SCSI CDBs and Data
within TCP/IP Connections
IP TCP iSCSI
SCSI
Data
IP TCP FCIP
OPT-2T01
9899_06_2004_X
FC
SCSI
Data
313
iSCSI
Gateway
FC
Target
FC
iSCSI
Initiator Target
iSCSI
Gateway
iSCSI
Initiator
iSCSI
Target
iSCSI
FC
Initiator Target
FC
Initiator
iSCSI Components
iSCSI is an end-to-end protocol
iSCSI has human readable SCSI device (node)
naming
iSCSI includes the following base components:
IPSEC connectivity security
Authentication for access configuration
Discovery of iSCSI nodes
Process for remote boot
iSCSI MIB standards
OPT-2T01
9899_06_2004_X
315
316
iSCSI Model
SCSI CDBs carried by Fibre
Channel Exchange and Sequences
FC Storage Device
FC Target
Logical
Unit 1
Port
Target
Device Service
Request
Mapping
fc1
iSCSI Target
ge2
Application
Client
requests
data from
LUN 1
Port
Device
Server
Logical
Unit 2
Device
Server
317
iSCSI Stack
SCSI Applications (File Systems, Databases)
SCSI Device-Type
Commands
SCSI Generic
Commands
SCSI Transport
Protocols
iSCSI
SCSI Over TCP/IP
TCP
Layer 3
Network Transport
IP
Layer 2
Network
OPT-2T01
9899_06_2004_X
Fibre Channel
Ethernet
318
iSCSI
iSCSI Packet
461500 bytes
Destination Source
Address
Address
Preamble
8
Type
IP
TCP
Data
FCS
4 Octet
Well-known Ports:
21 FTP
23 Telnet
25 SMTP
80 http
iSCSI
encapsulated
Opcode
3260 iSCSI
Destination Port
Sequence Number
Acknowledgment Number
Offset Reserved U A P R S F
Window
Checksum
Urgent Pointer
TCP Header
OPT-2T01
9899_06_2004_X
319
320
iqn
Type
Date
Unique String
Organization
Subgroup Naming Authority or
Naming Authority String Defined by Organization Naming Authority
iqn.1987-05.com.cisco.1234abcdef987601267da232.betty
iqn.2001-04.com.acme.storage.tape.sys1.xyz
Date = yyyy-mm when
Domain Acquired
eui
Type
Host Name
eui.02004567a425678d
OPT-2T01
9899_06_2004_X
321
OPT-2T01
9899_06_2004_X
322
OPT-2T01
9899_06_2004_X
323
Initiator session ID
This is an initiator-defined session identifier; It will be the
same for all connections within a session; An iSCSI
initiator port is uniquely identified by the value pair (iSCSI
Initiator Name, ISID)
Target session ID
Target assigned tag for a session with a specific named
initiator that, together with the ISID uniquely identifies a
session with that initiator
OPT-2T01
9899_06_2004_X
324
It is a combination of
the following:
Is a combination of the
following:
iSCSI initiator
iSCSI host
OPT-2T01
9899_06_2004_X
Initiatortarget
session (SSID)
325
OPT-2T01
9899_06_2004_X
326
OPT-2T01
9899_06_2004_X
327
Medium-sized networks
Service Location Protocol (SLP multicast discovery)
Large-sized networks
iSNS (internet storage name service)
Includes soft zone domains
Includes database for ongoing management
OPT-2T01
9899_06_2004_X
328
iSCSI Architecture
Network Entity (iSCSI Client)
iSCSI Node
(Initiator)
Network Portal
Network Portal
10.1.30.1
10.1.40.1
Network Portal
Network Portal
10.1.30.2
10.1.40.2
iSCSI Node
iSCSI Node
(Target)
(Target)
329
iSCSI Architecture
IP Network
Network Portal
10.1.30.1
Network Portal
Network Portal
10.1.40.1
10.1.50.1
Portal Group 1
Portal Group 2
OPT-2T01
9899_06_2004_X
330
TCP/3260
TCP/3260
TCP/3260
iSCSI Routing Instance
OPT-2T01
9899_06_2004_X
iSCSI
Storage Router
2004 Cisco Systems, Inc. All rights reserved.
331
OPT-2T01
9899_06_2004_X
332
disk.cisco.com.stor.1
23
iSCSI
Driver,
Storag
e NIC
1.1.1.1
iSCSI
Node
s will
Configuration
iSCSI Initiator
address
Initiator Port
2.2.2.2
ISID
TCP
Connection
IP
SSID
TSID
make the
connections
between storage
and iSCSI Initiator
3.3.3.3
4.4.4.4
5.5.5.5
Target-4
iSCSI Network
Entity-Server
OPT-2T01
9899_06_2004_X
Target-2
Target-3
Target-5
Storage Systems
333
iSCSI Session
OPT-2T01
9899_06_2004_X
335
336
337
iSCSI Sessions
During session establishment, the target identifies
the SCSI initiator port (the I in the I_T nexus)
through the value pair (InitiatorName, ISID)
Any persistent state (e.g., persistent reservations)
on the target associated with a SCSI initiator port is
identified based on this value pair
Any state associated with the SCSI target port (the
T in the I_T nexus) is identified externally by
the TargetName and portal group tag and internally
in an implementation dependent way
OPT-2T01
9899_06_2004_X
338
OPT-2T01
9899_06_2004_X
339
340
iSCSI Security
Two types of security
IPSec secures TCP/IP nodes; setup at TCP/IP startup
before iSCSI login
Session authentication via IKE (Internet Key Exchange)
Packet by packet authentication (also provides Integrity)
Privacy via encryption (also provides Integrity)
See SEC-IPS
iSCSI techniques (done/setup during iSCSI Login)
Authentication (ensures nodes are authorized to use the
iSCSI target node) may use SRP, Chap, or Kerberos
OPT-2T01
9899_06_2004_X
341
Challenge Handshake
Authentication Protocol
In-band initiator-target authentication
IP-SEC is not assumed
No clear text password accepted
Compliant iSCSI initiators and targets MUST
implement the CHAP (RFC1994)
Implementations MUST support use of up to
128 bit random CHAP secrets
OPT-2T01
9899_06_2004_X
342
iSCSI Security
Various levels of security can fit different
topologies
Examples:
Secure main floorNo security
Campus LANiSCSI authentication and CRC32c
(digests)
Remote private WANIPSec with session/packet
authentication
Remote internet WANIPSec with privacy encryption
OPT-2T01
9899_06_2004_X
343
OPT-2T01
9899_06_2004_X
344
Digests (Checksums)
Optional header and data digests protect the integrity of
the header and data, respectively; The digests, if
present, are located, respectively, after the header and
PDU-specific data, and cover the proper data and the
padding bytes
The existence and type of digests are negotiated during
the login phase
The separation of the header and data digests is useful
in iSCSI routing applications, in which only the header
changes when a message is forwarded; In this case,
only the header digest should be recalculated
Digests are not included in data or header length fields
A zero-length data segment also implies a zero-length
data-digest
OPT-2T01
9899_06_2004_X
345
Error Recovery
Two Considerations for Errors
An iSCSI PDU may fail the digest check and be
dropped, despite being received by the TCP layer;
The iSCSI layer must optionally be allowed to
recover such dropped PDUs
A TCP connection may fail at any time during the
data transfer; All the active tasks must optionally
be allowed to be continued on a different TCP
connection within the same session
OPT-2T01
9899_06_2004_X
346
347
348
OPT-2T01
9899_06_2004_X
349
Error Levels
Level determined during logon text negotiation
Error recovery level is proposed by an originator in a
text negotiation
OPT-2T01
9899_06_2004_X
350
iSCSI PROTOCOL
DETAILS IN-DEPTH
OPT-2T01
9899_06_2004_X
351
352
Target to initiator
Ready to transfer (R2T) message to Initiator, requesting data for
a write command
Command responses
Asynchronous messages (SCSI and iSCSI) describing an unusual
or error event
OPT-2T01
9899_06_2004_X
353
iSCSI Numbering
iSCSI uses command and status numbering
Command numberingSession wide and is used for
ordered command delivery over multiple connections
within a session; It can also be used as a mechanism for
command flow control over a session
Status numberingper connection and is used to enable
recovery in case of connection failure
354
OPT-2T01
9899_06_2004_X
355
356
OPT-2T01
9899_06_2004_X
357
358
359
iSCSI PDUs
Several different types of iSCSI PDUs used, each of
the different iSCSI Operation Codes (Opcodes)
determine what iSCSI PDU to use; Some of the
more used PDUs are:
Login and logout PDU
Command and response PDU
Data-In and data-out PDU
OPT-2T01
9899_06_2004_X
360
Initiator ID
for This
Connection
If Set to 1 Indicates
Initiator Is Ready to
Transit to Next Stage
Unique ID
for This
Connection
OPT-2T01
9899_06_2004_X
361
iSCSI Login
Login Phase used to:
Enable TCP connection (Target listens on well known port)
Authentication (CHAP)
Negotiate session parameters
Open security protocols
Mark the TCP connection as a iSCSI session
and assign IDs
OPT-2T01
9899_06_2004_X
362
OPT-2T01
9899_06_2004_X
363
OPT-2T01
9899_06_2004_X
364
365
Data-Out PDU
LUN Number
for Data
Data Segment
Length Based
on Capabilities
Exchange
OPT-2T01
9899_06_2004_X
366
Data-In PDU
Acknowledge Bit
used when error
recovery level is 1
or higher
OPT-2T01
9899_06_2004_X
367
Target may satisfy the single read command with multiple iSCSI
data read PDUs (PDUs can be out-of-order)
Good status can be sent within the last iSCSI data-in PDU
All iSCSI data-in PDUs and the response PDU will be delivered
on the same TCP connection that the command was sent on
All data-in PDUs will carry the same value in the ITT field
OPT-2T01
9899_06_2004_X
368
16 bytes of
SCSI CDB,
R=1 If the
Command Is
Expected to
Input Data
CRC If
Capabilities
Required This
OPT-2T01
9899_06_2004_X
W=1 If the
Command Is
Expected to Output
Data
369
SCSI Status
per SAM2
CRC Check
Sums
OPT-2T01
9899_06_2004_X
370
OPT-2T01
9899_06_2004_X
371
OPT-2T01
9899_06_2004_X
372
Task Management
Functions to provide the initiator a way to control
management of the target device
Abort the TASK
Clear allegiance
Logical reset
Target reset
OPT-2T01
9899_06_2004_X
373
NOP-IN
Sent by a target as a response to a NOP-Out, as a
ping to an initiator
Or a means to carry a changed ExpCmdSN and/or MaxCmdSN
if there is no other PDU to carry them for a long time
NOP-OUT
Used by Initiator as a ping command, to verify that a
connection/session is still active and all its components
are operational
Used to confirm a changed ExpStatSN if there is no other
PDU to carry it for a long time
OPT-2T01
9899_06_2004_X
374
OPT-2T01
9899_06_2004_X
375
Immediate Data
Data Digest
Max Connections
Send Targets
Target Name
Initiator Name
Target Alias
Initiator Alias
Target Address
Session Type
OPT-2T01
9899_06_2004_X
376
OPT-2T01
9899_06_2004_X
377
SIMPLE ISCSI
CONNECTION FLOWS
EXAMPLE OF DISCOVERY
SESSION WITH CHAP
OPT-2T01
9899_06_2004_X
378
iSCSI Flows
Initiator
TCP port 1026
(Random)
Discovery
Session
Target
Establish Initial TCP Session Phase
0X03 CommandLogin
Key Values Are Sent, InitiatorName, InitiatorAlias,
SessionType=Discovery, AuthMethod=CHAP/none,
HeaderDigest, DataDigest
0X03 CommandLogin
Key Values Sent, InitiatorName, InitiatorAlias,
SessionType=Discovery, CHAP_A=5 (CHAP with MD5)
TCP Port
3260
This
Device
Has
Already
Initialized
Onto the
Fibre
Channel
iSCSI Driver
OPT-2T01
9899_06_2004_X
379
iSCSI Flows
Initiator
TCP port 1026
(Random)
Discovery
Session
Target
0X03 CommandLogin
Key Values Are Sent, InitiatorName, InitiatorAlias,
SessionType=Discovery, CHAP_R, CHAP_N
TCP Port
3260
iSCSI Driver
OPT-2T01
9899_06_2004_X
380
iSCSI Flows
Initiator
Target
0X04 Text Command
SendTargets=all
TCP Port
3260
Note the
Addition of
Another TCP
Session
iSCSI Driver
OPT-2T01
9899_06_2004_X
381
iSCSI Flows
Initiator
TCP Port 1027
(Random)
Target
Session #1
Target
0X03 CommandLogin
Key Values are sent, InitiatorName, InitiatorAlias,
SessionType=Normal, TargetName, CHAP_A=5
TCP Port
3260
0X03 CommandLogin
Key Values are sent, InitiatorName, InitiatorAlias,
SessionType=Normal, CHAP_R, CHAP_N
0X03 CommandLogin
iSCSI Driver
OPT-2T01
9899_06_2004_X
382
iSCSI Flows
Initiator
Target
0X23 Login Response
TCP Port
3260
iSCSI Driver
OPT-2T01
9899_06_2004_X
383
FCIP CONCEPTS
OPT-2T01
9899_06_2004_X
384
Agenda
What FCIP Is About
The Standards
Fibre Channel T11 Standards
IETF IPS Working Group Drafts
OPT-2T01
9899_06_2004_X
385
OPT-2T01
9899_06_2004_X
386
FCIP Design
FC
Server
FC Tape
Library
FC Server
FSPF Routing
Backbone
FC Tape
Library
FC Switch
FC Switch
Fiber
Channel
SAN
FCIP
Tunnel
FSPF Routing
Backbone
FC Switch
IP Network
FC Switch
FCIP
Tunnel
Fiber
Channel
SAN
Tunnel
Tunnel Session
Session
FC Switch
FC Switch
FC Switch
FC Switch
IP Services
Available at Aggregated
FC SAN Level
FC
Server
FC
JBOD
OPT-2T01
9899_06_2004_X
FC
Server
FC
JBOD
387
388
OPT-2T01
9899_06_2004_X
389
OPT-2T01
9899_06_2004_X
390
391
IETF FCIP
IETF draft standard that allows IP connectivity
to link Fibre Channel storage area networks
across WANs
Two methods can be used
1) Similar to Cisco STUNNailed up tunnel
2) Similar to DLSWDynamic peering method
We will visit the details of each in later slides
draft-ietf-ips-fcovertcpip
Draft 12 is current, will RFC Jan/Feb 2003
OPT-2T01
9899_06_2004_X
392
FCIP Link
FCIP
FCIP
FC-2
TCP
TCP
FC-2
FC-1
IP
IP
FC-1
FC-0
LINK
LINK
FC-0
PHY
PHY
Key:
SAN
IPInternet Protocol
SAN
TCP/IP
Network
OPT-2T01
9899_06_2004_X
393
FCIP
End-station addressing, address resolution,
message routing, and other fundamental elements
of the network architecture remain unchanged
from the Fibre Channel model, with IP introduced
exclusively as a transport protocol for an
inter-network bridging function
IP is unaware of the Fibre Channel payload and
the fibre channel fabric is unaware of IP
//
Ethernet
Header
IP
TCP FCIP
FCP
SCSI Data
CRC
Checksum
//
OPT-2T01
9899_06_2004_X
394
FCIP
FCIP only supports class 2, class 3, class 4,
and class F frames
No FC primitive signals or primitive
sequences supported
Physical signal sets used by FC ports to indicate
events, i.e. NOS, OLS, LR
OPT-2T01
9899_06_2004_X
395
396
397
FCIP Diagram
FC/FCIP Entity Pair
FC Entity
FCIP Entity
Virtual ISL
VE_Port
VE_Port
FCIP_LEP
FCIP_LEP
DE
TX
Dynamic
CONNECTION
PORT for FCIP
Connections
RX
FCIP Link
TCP
Ports
Non Dynamic
Connections
WKP = 3225
IP Address = 172.16.0.5
TCP
Ports
WKP = 3225
IP Address = 192.168.1.10
TCP Connection
OPT-2T01
9899_06_2004_X
FC Frames in TCP/IP
Class 3 and Class F Can Be
on Separate Ports or Connections
398
FC-BB-2
FCIP
OPT-2T01
9899_06_2004_X
399
FC-BB-2
FCIP
OPT-2T01
9899_06_2004_X
400
OPT-2T01
9899_06_2004_X
401
MIBs
draft-ietf-ips-scsi-mib
draft-ietf-ips-fcmgmt-mib
draft-ietf-ips-fcip-mib
FC-BB
Published ANSI project being superseded by BB-2
OPT-2T01
9899_06_2004_X
402
OPT-2T01
9899_06_2004_X
403
OPT-2T01
9899_06_2004_X
404
OPT-2T01
9899_06_2004_X
405
OPT-2T01
9899_06_2004_X
406
VE_Port
FCIP_LEP
DE
DE
TCP
Ports
WKP = 3225
IP Address = 192.168.1.10
TCP/IP Network Interface
FCIP Link
Class F
OPT-2T01
9899_06_2004_X
Class 3
407
408
OPT-2T01
9899_06_2004_X
409
OPT-2T01
9899_06_2004_X
410
OPT-2T01
9899_06_2004_X
411
412
Initialization of Port
B_Port
E_Port
Link initialization
Link initialization
Exchange link
parameters
Link reset
Exchange switch
capabilities
Reset link
Exchange fabric
parameters
Assign domain IDs
Establish routes
Merge zones if required
OPT-2T01
9899_06_2004_X
413
B_Port or E_Port
on FCIP Device
These Are All Special Ordered
Sets of 8B/10B Coding
LF
NOS
LF
OLS
LR = Link Reset
OL
LR
LR
LRR
LR
Idle
AC
Idle
AC
OPT-2T01
9899_06_2004_X
414
Link Capture
E_Port on
Switch
B_Port or E_Port
on FCIP Device
NOS
LR
IDLE
R-RDY
IDLE
IDLE
LR & LRR to
Initialize Flow
Control
Parameters Per
FC-PH
IDLE
R-RDY
IDLE
LRR
OPT-2T01
9899_06_2004_X
415
ISL E_Port
If It Is an E_Port FCIP Device or If the B_Port Is Now
up the Switch to Switch Exchange Continues
OPT-2T01
9899_06_2004_X
416
ELP Data
Bit 15 of flag
will be a 1 for
B_Port
RA_TOV is fabric
wide timer, ED_TOV
is per Link
Class 2 & 3
supported
OPT-2T01
9899_06_2004_X
417
FCIP E-Port
FC Switch
Exchange Fabric
Parameters
FC SAN
Exchange Fabric
Parameters
ESC
ESC
ESC
FCIP B-Port
FCIP B-Port
VB - Port
FCIP E-Port
7200 w/ PA-FC-1G
FC Switch
FCIP E-Port
7200 w/ PA-FC-1G
FC Switch
IP
Network
FC SAN
B Port Operation
FC Switch
IP
Network
FC SAN
E Port Operation
FCIP E-Port
Exchange FCIP-Link
Parameters
FC SAN
Exchange Link Parameters
418
OPT-2T01
9899_06_2004_X
419
Comparisons
OPT-2T01
9899_06_2004_X
420
Dynamic
Discovery of FCIP entities using SLPv2
OPT-2T01
9899_06_2004_X
421
422
OPT-2T01
9899_06_2004_X
423
Sending Side
First frame sent after TCP connection is established
Sending side waits for FSF echo (90 seconds)
Echo is match or non-match (Non-match terminates
TCP connection)
Creation of FCIP_LEP and FCIP_DE
Inform FC Entity of connection and usage flags
OPT-2T01
9899_06_2004_X
424
425
426
proto
0x01
version
0x01
~proto ~version
0xFE
0xFE
proto
0x01
version
0x01
~proto ~version
0xFE
0xFE
2
pFlags
Ok WWN1, I
Am WWN2
Lets Setup the
Connection
0x00
~pFlags
0x00
~Flags ~Frame
Frame
Len 0x12 0x3F Len 0x3ED
Flags
0x00
4/5
Timestamp integer/fraction
6
CRC (Reserved in FCIP)
0x00-00-00-00
7
8/9
10/11
FCIP Device
Fibre Channel
Fibre Channel
IP WAN
Reserved
0xFFFF
12/13
14
15/16
FC
Reserved
0x0000
17
Conn
Usage
flags
0x00
Connection
usage code
FC
Reserved
0xFFFF
427
pFlag Breakdown
OPT-2T01
9899_06_2004_X
428
Ones Compliment
for Synchronization
and Error Checking
FCIP header
used after FSF
exchange is
completed
OPT-2T01
9899_06_2004_X
429
Connection Options
TCP selective acknowledgement (SACK)
Per RFC 2883
OPT-2T01
9899_06_2004_X
430
431
432
433
434
Timestamps
TS are the responsibility of the FC entity
This allows transit through the FCIP entity to be
included in the measurement
This transit time should be well below R_A_TOV
OPT-2T01
9899_06_2004_X
435
Buffer Credits
Fibre channel buffer credit methods do not change
R_Rdys will be used to control flow coming from
FC switch on a per link basis
Buffer credit establishment is determined at FLOGI
Mechanisms to control flow of R_Rdys to
FC switch based on TCP/IP congestion is
per FCIP solution
FC switches do not require extended credit
methods
OPT-2T01
9899_06_2004_X
436
Error Recovery
Errors on FC side of local B_Port are not
forwarded over the IP network; Issues such as
loss of sync or a FC encapsulation error will not
be set to the FC entity
Errors on IP side are handled by TCP and frame is
dropped if checksum is in error
OPT-2T01
9899_06_2004_X
437
Summary
FCIP is the standards approach to connect Fibre
Channel ISLs over TCP/IP LAN/WAN connections
State of draft wording will most likely stay as it is
worded today
Security, network delay and error recovery will be
biggest concerns
No shipping product today conforms to the
proposed FCIP draft
Cisco will have several platforms supporting
FCIP solutions
OPT-2T01
9899_06_2004_X
438
INTERNET FIBRE
CHANNEL PROTOCOL
OPT-2T01
9899_06_2004_X
439
iFCP
TCP
FC-1
FC-0
TCP
FC-1
FC-0
IP
IP
LINK
LINK
PHY
PHY
Gateway Region
Gateway Region
IP Network
440
IP Network
IFCP
Gateway Region Gateway N_port-to-N_port session
IFCP
Gateway
Gateway Region
Client-Server architecture
iSNS functions:
Device Discovery and
fabric management
iSNS
Queries
IFCP
Gateway Region Gateway
iSNS
N_port-to-N_port session
iSNS
Queries
IFCP
Gateway
Gateway Region
441
OPT-2T01
9899_06_2004_X
442
OPT-2T01
9899_06_2004_X
443
OPT-2T01
9899_06_2004_X
444
iSNS
iS
NS
pl
re
y/
r
ue
TCP/IP
IFCP
Gateway
IFCP
Gateway
Remote GW IP
FC
NS
FC
Re
pl
NS
Re
qu
es
t
1) The N_Port
Issues a
NS Query
OPT-2T01
9899_06_2004_X
FC_ID = y.y.y
4) The Gateway
Sends Back to the
N_Port the NS
Reply (for FC_ID z.z.z)
445
Remote GW IP
Dest N_Port ID (x.x.x)
Local N_Port alias (w.w.w)
sid x.x.x
IFCP
Gateway
TCP/IP
IFCP
Gateway
4) The Receiving GW
Rewrites the S_ID of
the Incoming Request
P
si logi
d
x. I did
x.
x z.z
.z
1) The N_Port
Ia
PLOGI to
D_ID z.z.z
Remote GW IP
Dest N_Port ID (y.y.y)
Local N_Port alias (z.z.z)
FC_ID = y.y.y
446
OPT-2T01
9899_06_2004_X
447
Discovery Approach
Deploy and Interoperate in Three Stages:
1. Naming and static configuration
Configure both targets and initiators
Use SendTargets to reduce initiator config
OPT-2T01
9899_06_2004_X
448
OPT-2T01
9899_06_2004_X
449
Scaling requirements
Zero-configuration, no servers in small environments
Reduce or eliminate multicast in medium environments
Interoperate with LDAP/iSNS in large environments
OPT-2T01
9899_06_2004_X
450
UA
query/
response
SA
register
services
services
services
register
DA
OPT-2T01
9899_06_2004_X
451
SLP
UA
iSCSI
Initiator
TCP/IP
SLP
DA
Zero configuration
IP
SLP
SA
iSCSI
Target
SLP Protocol
Management Code
device
OPT-2T01
9899_06_2004_X
Optional
UDP or TCP
Minimize multicast
452
OPT-2T01
9899_06_2004_X
453
SLP Summary
Serverless discovery of targets
Optional, generic DA to scale services
Zero-configuration of hosts
SLP makes careful use of multicast
OPT-2T01
9899_06_2004_X
454
What Is iSNS
iSNS Facilitates Scalable Configuration and
Management of iSCSI, iFCP and Fibre Channel (FCP)
Storage Devices in an IP Network, By Providing a Set
of Services comparable to that Available in Fibre
Channel Networks
http://www.ietf.org/internet-drafts/draft-ietf-ips-isns-22.txt
OPT-2T01
9899_06_2004_X
455
iSNS Functions
There Are Four Main Functions of the iSNS:
1. A name server providing storage resource
discovery
2. Discovery Domain (DD) and login control service
3. State change notification service
4. Open Mapping of Fibre Channel and iSCSI devices
OPT-2T01
9899_06_2004_X
456
FC
2
2
FC
457
OPT-2T01
9899_06_2004_X
458
iSNS Components
iSNS protocol (iSNSP)
A flexible and lightweight protocol that specifies how iSNS
clients and servers communicate
459
iSNS Components
iSNS client
The iSNS client is located within storage system and talk to the
iSNS server using the iSNSP within its configured device domain;
client can belong to one or more DDs; iSNS client registers its
attributes with the iSNS server and receives notices of changes
within the domain
iSNS database
The iSNS database is the information repository for the iSNS
server; it maintains information about iSNS clients attributes; a
directory-enabled implementation of iSNS may store client
attributes in an LDAP directory infrastructure
iSNS server
iSNS servers respond to iSNS protocol queries and requests,
and initiate iSNS protocol state change notifications; properly
authenticated information submitted by a registration request is
stored in an iSNS database; listens on port 3205
OPT-2T01
9899_06_2004_X
460
461
SCN Types
Regular registrations
This type of SCN is used within a DD; The discovery
domain will control where the SCN message will go
Management registrations
Used by control nodes and can travel outside the DD
from which they came
Can be TCP or UDP messaging
(Most implementations only using TCP for now)
OPT-2T01
9899_06_2004_X
462
OPT-2T01
9899_06_2004_X
463
OPT-2T01
9899_06_2004_X
464
465
iSNSP Header
iSNSP VersionC the Current Version is 0x0001; All Other Values Are RESERVED
iSNSP Function IDDefines the Type of iSNS Message and the Operation to Be Executed; iSNSP
PDU LengthSpecifies the Length of the PDU PAYLOAD Field in bytes; The PDU Payload Contains
Attributes for the Operation
iSNSP FlagsIndicates Additional Information About the Message and the Type of Network Entity
That Generated the Message
iSNSP Transaction IDMUST Be Set to a Unique Value for Each Concurrently Outstanding
Request Message; Replies MUST Use the same TRANSACTION ID Value as the Associated iSNS
Request Message
iSNSP Sequence IDThe SEQUENCE ID Has a Unique Value for Each PDU Within a Single Transaction
iSNSP PDU PayloadThe iSNSP PDU PAYLOAD Is Variable Length and Contains Attributes Used for
Registration and Query Operations
Authentication BlockFor iSNS Multicast and Broadcast Messages, the iSNSP Provides
Authentication Capability; The iSNS Authentication Block Is Identical in Format to the SLP
Authentication Block
OPT-2T01
9899_06_2004_X
466
OPT-2T01
9899_06_2004_X
467
OPT-2T01
9899_06_2004_X
468
469
Name of Port on
the IP Gateway
Entity
IP Address of
Portal to Log to
and Ask for This
target
OPT-2T01
9899_06_2004_X
470
OPT-2T01
9899_06_2004_X
471
OPT-2T01
9899_06_2004_X
472
OPT-2T01
9899_06_2004_X
473
474
OPT-2T01
9899_06_2004_X
475
OPT-2T01
9899_06_2004_X
476
OPT-2T01
9899_06_2004_X
477
478
OPT-2T01
9899_06_2004_X
479
FC Test Vendors
Leaders in dedicated hardware tools:
Finisar (www.finisar.com)
Xyratex (www.xyratex.com)
Aglient (www.agilent.com)
I-Tech (www.I-tech.com)
Ancot (http://www.ancot.com/)
Spirent/Netcom systems
(www.netcomsystems.com)
OPT-2T01
9899_06_2004_X
480
I/O zone
http://www.iozone.org/
SCSI tools
http://scsitools.com/
481
Windows Tools
iSCSI Driver debug helpers
Windows debug utilities
http://www.osr.com/resources_downloads.shtml
http://www.sysinternals.com/
OPT-2T01
9899_06_2004_X
482
IP: GiGE
GiGE testers $$$
Agilent
Sniffer
Fluke
Finisar/Shomiti
iSCSI decodes just becoming available on most tools
483
iSCSI Decoding
Software only analyzers like Ethereal
(www.ethereal.com)
Hardware analyzers
Can use monitor command on Cisco switches to
span the iSCSI GiGE port to a 10/100
OPT-2T01
9899_06_2004_X
484
Available Certifications
485
ARCHITECTURAL DESIGN OF
STORAGE AREA NETWORKS
OPT-2T01
9899_06_2004_X
486
Section Agenda
Introduction
Hierarchy
Modularity
Architecture Examples
OPT-2T01
9899_06_2004_X
487
INTRODUCTION
OPT-2T01
9899_06_2004_X
488
489
OPT-2T01
9899_06_2004_X
490
OPT-2T01
9899_06_2004_X
To Meet Mission-Critical
Business Objectives,
Applications Need to
Be Consistently Up,
Available, and
High-Performing
491
Architecture:
Hierarchy, Modularity, and Domains
Hierarchy
Modularity
Domains
OPT-2T01
9899_06_2004_X
492
Access
Focus of This
Discussion
Distribution
Building Blocks
Core
Backbone
Application
Servers
Enterprise
Storage
OPT-2T01
9899_06_2004_X
WAN
Internet
PSTN
493
Hierarchy
Predictable performance
Scaleable design
Fault isolation
Modularity
Cost-effective
Repeatable
Domain
Unified
Storage
Mgmt
Reliability
Security
OPT-2T01
9899_06_2004_X
Shared Storage
2004 Cisco Systems, Inc. All rights reserved.
494
HIERARCHY
OPT-2T01
9899_06_2004_X
495
Logical hierarchy
Virtual SANs
Zoning
Enhances physical
hierarchy
Physical Architecture
Virtual SAN A
Zone 1
Zone 2
H1
H2
H1
Zone 3
D1
H3
H2
Zone 4
D2
Logical Architecture
OPT-2T01
9899_06_2004_X
Virtual SAN B
Zone 1
Zone 2
D1
H7
D7
Logical Architecture
496
Hierarchy: Physical
Consolidated Storage Network
Cost-effective solution
Benefits of consolidation
iSCSI
iSCSI
Limited scalability
Small to medium business
Expansion can be
disruptive
OPT-2T01
9899_06_2004_X
497
Hierarchy: Physical
Collapsed Core Architecture
Collapsed core
iSCSI
iSCSI
High performance
Multiple unequal paths
Better scalability
Medium to large
enterprise
ISLs can limit scalability
Redundant
Mesh topology
Network survives some
double faults
OPT-2T01
9899_06_2004_X
498
Hierarchy: Physical
Core Edge Architecture
CoreEdge
iSCSI
iSCSI
High performance
Load balancing
Consistent hop count
Good scalability
Large to very large enterprise
Non-disruptive expansion
499
Hierarchy: Physical
Oversubscription
To be expected in storage networks
Typically lower factors than we see in LANs
Architecture should be flexible to accommodate
differing requirements for various hosts and
storage subsystems
Bandwidth can be modified non-disruptively by
using port channels between switches
Take into account any inherent over subscription
in networking hardware
Use actual anticipated throughput rather than link
speed for calculating bandwidth requirements
OPT-2T01
9899_06_2004_X
500
Hierarchy: Physical
Inter-Switch Links
Inter-Switch LinkISL
iSCSI
iSCSI
Port Channel
Multiple FC ISLs combined to
form a single aggregated trunk
All links in a Port Channel
must be directly connected
to the same two switches
Individual link state
changes do not cause ISL
trunk state changes
OPT-2T01
9899_06_2004_X
ISL
Port
Channel
501
Hierarchy: Physical
Scalability
Oversubscription
Higher OS acceptable 15:1
OS
for some hosts
Lower OS for High
performance hosts
and storage devices
Consider impact of multipath load balancing
Determine acceptable
worst case in various
failure scenarios
Can be non-disruptively
changed by adding/
removing links to port
channels
OPT-2T01
9899_06_2004_X
iSCSI
iSCSI
8:1
OS
1:1
OS
Core
4x2Gb
ISL
8x2Gb
ISL
3:1
OS
502
Hierarchy: Logical
Virtual SANs
VSANs provide a means to build a logical structure
on top of a physical SAN
Similar to how VLANs are used to scale ethernet
networks VSANs help scale Fibre Channel networks
Topology changes are isolated within the VSAN
therefore adds, moves, and changes are not
disruptive to other VSANs
VSANs can be utilized to establish administrative
domains
Zoning provides an additional access control
mechanism within each VSAN
OPT-2T01
9899_06_2004_X
503
Hierarchy: Logical
Logical Architecture
Virtual SANs
iSCSI
iSCSI
Services scalability
Independent Fibre Channel
services for each VSAN
Zoning is per VSAN
EISL
Port
Channel
Failure domain
Faults contained within
VSAN
OPT-2T01
9899_06_2004_X
504
Hierarchy: Logical
Maximizing VSAN Architecture
Isolate multiple paths
into separate VSANs
iSCSI
iSCSI
Independent FC services
per VSAN
Provides complete traffic
isolation between
redundant paths
Each VSAN converges
independently for faster
recovery and improved
fault isolation
OPT-2T01
9899_06_2004_X
505
OPT-2T01
9899_06_2004_X
iSCSI
506
MODULARITY
OPT-2T01
9899_06_2004_X
507
508
iSCSI
iSCSI
Fiber
Channel
Core
Functional Building Blocks
Provide Scalability with
Deterministic Performance
Storage
Modules
OPT-2T01
9899_06_2004_X
509
OPT-2T01
9899_06_2004_X
510
Department/
Customer A
Department/
Customer B
VSAN-Enabled
Fabric
Security
VSAN
Trunks
Scalability
Replicated fabric services
Thousands of VSANs per storage
network
Management
Roll Based Access ControlRBAC
Provides administrative boundaries
OPT-2T01
9899_06_2004_X
Shared Storage
511
Modularity:
Storage Intelligence and VSANs
Dept 1
VSAN
Dept 2
VSAN
Dept 3
VSAN
Virtualization
OPT-2T01
9899_06_2004_X
Virtualization allows
physical storage to be in
its own VSANs, separate
from the host VSANs.
VSANs provide
Secure isolation of physical storage
Easier configuration
Dynamic configuration of fabrics
Role-based access control
512
ARCHITECTURE EXAMPLES
OPT-2T01
9899_06_2004_X
513
Architecture: iSCSI
Scalability
Less expensive
alternative for host not
requiring 2Gbps
Applications
Host Services
Block Device
SCSI Generic
File System
Network File
System
TCP/IP
Stack
NIC
Driver
iSCSI
Driver
iSCSI
Driver
TCP/IP
Stack
NIC
Driver
TCP/IP
Stack
NIC
Driver
Adapter
Driver
SCSI Adapter
TOE
514
Redundant iSCSI to
Fibre Channel Connections
and Services
Application
Multipathing
iSCSI Driver
OPT-2T01
9899_06_2004_X
515
TACACS
+ Server
User1/pwd1
User2/pwd2
/
User1/pwd1
User2/pwd2
/
RADIUS TACACS+
CHAP
Local
iSCSI
Storage
(Targets)
FC Fabric
iSCSI Services
OPT-2T01
9899_06_2004_X
516
Clients
iSCSI
OPT-2T01
9899_06_2004_X
iSCSI
iSCSI
iSCSI-enabled
Hosts
Ethernet
Switches
IP
Storage
Network
iSCSI
iSCSI
Services
FC
Fabric
FC Attached
Hosts with
HBAs
Storage
Pool
517
FCIP
FC
DWDM
FC
SONET/SDH
OPT-2T01
9899_06_2004_X
518
Architecture:
High Availability for SAN Extension: FC
Utilize disparate paths and portchannel for high availability
Utilize VSANs to limit the failure domain in the event of lost
connectivity
FC
Fabric
A
Fabric
B
CWDM
FC
Fabric
A
Fabric
B
DWDM
PortChannel
SONET/SDH
Both fabrics remain connected if one of the paths fails
Use of portchannel prevents state change on link failure
OPT-2T01
9899_06_2004_X
519
Architecture:
High Availability for SAN Extension: FCIP
Utilize disparate paths and portchannel for high availability
Utilize VSANs to limit the failure domain in the event of lost
connectivity
Recommend not using etherchannels
FCIP
Fabric
A
Fabric
B
IP
WAN
IP
WAN
PortChannel
FCIP
Fabric
A
Fabric
B
PortChannel
520
Architecture:
Legacy Storage Implementation
Campus Clients
Remote Clients
Internet Clients
Storage is captive
behind applications
Inefficient
allocation of
storage resources
Multiple
administrative
domains
LAN Core
Backbone
Application
Servers
SAN
Islands
Captive Storage
Blocks
OPT-2T01
9899_06_2004_X
521
Architecture:
Factors for Determining Architecture
Current size and anticipated growth for both
application servers and storage elements
Baseline performance requirements for servers
and storage
Business continuance requirementsSAN
extension
Administrative domains
Migration plans
Interoperability considerations
Costs
OPT-2T01
9899_06_2004_X
522
Application
Servers
iSCSI
Some scalability
especially with iSCSI
Redundant paths
Unified
Achieves
Storage
Mgmt
economical
storage consolidation
VSANs can add
scalability and
management benefits
OPT-2T01
9899_06_2004_X
Shared
Storage
523
Application
Servers
iSCSI
Unified
Storage
Mgmt
Highly scalable
Highly redundant
Highly modular
Shared
Storage
2004 Cisco Systems, Inc. All rights reserved.
524
Cost efficiencies
Consolidated storage, central management, and leveraged resources
Security
Limited domains, RBAC management, and consistent architecture
525
iSCSI
iSCSI
iSCSI
iSCSI
FC
FC
FC
FC
FC
FC
FC
FC
iSCSI
FC
FC
FC
FC
FC
FC
iSCSI
iSCSI
iSCSIFCiSCSI
FC
FC
iSCSI
FC
iSCSIEnabled
Storage
Network
Ethernet
Switches
Multiprotocol/Multiservice
SONET Network
SONET Network
FC
FC
FC
FCIP
Remote
Storage
Access
Optical Network
Resilient Optical
Transport Networks
Synchronous ReplicationOptical (FCIP/FC)
FC
FC
OPT-2T01
9899_06_2004_X
FC
FC
FC
Intelligent Workgroup
Storage Networks
2004 Cisco Systems, Inc. All rights reserved.
FC
FC
FC
FC
FC
526
Q&A
OPT-2T01
9899_06_2004_X
527
WHY:
OPT-2T01
9899_06_2004_X
528
OPT-2T01
9899_06_2004_X
529
EXTRAS
OPT-2T01
9899_06_2004_X
530
FC LOOP OPERATIONS
OPT-2T01
9899_06_2004_X
531
Port
RX
1.
2.
3.
4.
AL_PA B2
TX
Port
IDLE
TX
Port
AL_PA 01
RX
RX
TX
IDLE
AL_PA EF
IDLE
IDLE
Port
RX
OPT-2T01
9899_06_2004_X
TX
532
Port
RX
1.
2.
3.
AL_PA B2
TX
Port
IDLE
TX
Port
AL_PA 01
RX
RX
TX
ARB(01)
AL_PA EF
IDLE
IDLE
Port
RX
OPT-2T01
9899_06_2004_X
TX
533
Port
ARB(01)
RX
AL_PA B2
TX
Port
2.
Port
AL_PA 01
ARB(01)
TX
RX
TX
1.
RX
AL_PA EF
IDLE
ARB(01)
Port
RX
TX
When a Port Discards Rx Fill Words and Transmits the CFW this Allows the Port to
Compensate for Clock Differences Between Rx Data Stream and Tx Data Stream
OPT-2T01
9899_06_2004_X
534
Port
ARB(F0)
RX
OPN
RX
Port
3.
AL_PA B2
TX
2.
Port
AL_PA 01
TX
1.
TX
AL_PA EF
ARB(01)
Port
RX
OPT-2T01
9899_06_2004_X
TX
535
Port
IDLE
RX
2.
3.
AL_PA B2
TX
Port
1.
Port
AL_PA 01
ARB(F0)
TX
RX
TX
RX
AL_PA EF
ARB(F0)
ARB(F0)
Port
RX
OPT-2T01
9899_06_2004_X
TX
536
Port
IDLE
1.
RX
AL_PA B2
TX
Port
3.
Port
AL_PA 01
IDLE
TX
RX
TX
2.
RX
AL_PA EF
ARB(F0)
IDLE
Port
RX
TX
537
Port
ARB(01)
RX
Port
AL_PA B2
TX
2.
Port
AL_PA 01
IDLE
TX
RX
TX
1.
RX
AL_PA EF
IDLE
ARB(B2)
Port
RX
OPT-2T01
9899_06_2004_X
TX
538
Port
ARB(01)
RX
Port
AL_PA B2
TX
2.
Port
AL_PA 01
ARB(01)
TX
RX
TX
1.
RX
AL_PA EF
ARB(B2)
ARB(B2)
Port
RX
OPT-2T01
9899_06_2004_X
TX
539
Port
ARB(01)
RX
ARB(01)
TX
RX
AL_PA B2
TX
Port
Port
AL_PA 01
RX
TX
1.
AL_PA EF
ARB(B2)
ARB(01)
Port
RX
TX
540
Port
ARB(F0)
and OPN
3.
RX
2.
ARB(01)
ARB(01)
Port
RX
OPT-2T01
9899_06_2004_X
AL_PA B2
TX
Port
AL_PA 01
ARB(01)
TX
RX
TX
1.
RX
TX
541
Port
ARB(F0)
ARB(F0)
TX
RX
2.
AL_PA B2
TX
Port
AL_PA 01
Port
RX
TX
1.
RX
AL_PA EF
ARB(01)
ARB(B2)
Port
RX
OPT-2T01
9899_06_2004_X
TX
542
Port
1.
2.
RX
3.
AL_PA B2
TX
Port
ARB(F0)
TX
Port
AL_PA 01
RX
RX
TX
ARB(F0)
AL_PA EF
ARB(B2)
ARB(B2)
Port
RX
TX
543
Port
IDLE
RX
IDLE
TX
RX
TX
2.
AL_PA B2
TX
Port
AL_PA 01
Port
RX
1.
AL_PA EF
IDLE
ARB(B2)
Port
RX
OPT-2T01
9899_06_2004_X
TX
544
Port
IDLE
IDLE
TX
RX
AL_PA B2
TX
Port
2.
Port
AL_PA 01
RX
TX
1.
RX
AL_PA EF
ARB(B2)
ARB(B2)
Port
RX
OPT-2T01
9899_06_2004_X
TX
545
Port
ARB(B2)
IDLE
TX
RX
AL_PA B2
TX
Port
2.
Port
AL_PA 01
RX
TX
1.
RX
AL_PA EF
ARB(B2)
ARB(B2)
Port
RX
OPT-2T01
9899_06_2004_X
TX
546
ARB(01)
Port
ARB(01)
1.
TX
RX
4.
AL_PA B2
TX
Port
3.
RX
Port
ARB(B2)
XX
TX
2.
AL_PA 01
RX
AL_PA EF
ARB(B2)
ARB(B2)
Port
RX
OPT-2T01
9899_06_2004_X
TX
547
Port
ARB(01)
AL_PA B2
RX
TX
Port
2.
ARB(B2)
Port
AL_PA 01
TX
RX
TX
1.
RX
ARB(01)
AL_PA EF
ARB(B2)
ARB(F0)
and OPN
Port
RX
OPT-2T01
9899_06_2004_X
TX
548
Port
RX
1.
2.
3.
4.
AL_PA B2
TX
Port
ARB(01)
TX
Port
AL_PA 01
RX
RX
TX
ARB(01)
AL_PA EF
ARB(F0)
ARB(F0)
Port
RX
OPT-2T01
9899_06_2004_X
TX
549