Professional Documents
Culture Documents
@CCIE21921
Session Objectives
• Focus on Data Center Networks and
Fabrics with Overlays
• Closer Look on Packet Encapsulation
(VXLAN)
• Encapsulation and Forwarding
• Underlay – the Transport for the Overlay
6
Introduction to Data
Center Fabrics
7
Data Center “Fabric” Journey (Standalone)
STP
VPC
FabricPath
VXLAN
MAN/WA
N
FabricPath VXLAN
/BGP /EVPN
MAN/WA MAN/WA
N N
Data Center Fabric Properties
Extended Namespace
Scalable Layer-2 Domains
Integrated Route and Bridge
Multi-Tenancy
Hybrid Overlays
Inter-Pod connectivity
Overlay Based Data Center Fabrics
RR RR • Desirable Attributes:
• Mobility
• Segmentation
• Scale
• Automated & Programmable
• Abstracted consumption models
• Full Cross Sectional Bandwidth
• Layer-2 + Layer-3 Connectivity
• Physical + Virtual
10
Overlay Based Data Center: Edge Devices
Network Overlays Host Overlays Hybrid Overlays
V
V
V
V
V
V
15
Agenda
16
Overview
Classic Ethernet IEEE 802.1Q
Classic Ethernet
Frame Format Frame
DMAC SMAC 802.1Q Etype Payload CRC
Data (Payload)
CRC/FCS
VLAN ID
12 bits
TPID = Tag Protocol Identifier, TCI = Tag Control Information, PCP = Priority Code Point,
CFI = Canonical Format Indicator, VID = VLAN Identifier
Overview
Introducing VXLAN
• Traditionally VLAN is expressed
over 12 bits (802.1Q tag)
• Limits the maximum number of
segments in a Data Center to 4096
VLANs
• VXLAN leverages the VNI field with Classical Ethernet Frame DMAC SMAC 802.1Q Etype Payload CRC
Cisco DFA
Frame
•
8 bits 24 bits 24 bits 8 bits
The VXLAN Network Identifier ags Reserved VNIVNI Reserved
Underlay
0x0800 Header
16 20 Bytes
Checksum
Source IP 32
50 (54) Bytes of Overhead
Overlay
VXLAN Flags
8
RRRRIRRR
Original Layer-2 Frame Allows for 16M
possible
Reserved 24
Segments
8 Bytes
VNI 24
Reserved 8
Data Center Fabric Properties
Extended Namespace
Scalable Layer-2 Domains
Integrated Route and Bridge
Multi-Tenancy
Understanding Overlay Technologies
Overlay Services
• Layer 2 Underlay Transport
Tunnel Encapsulation
• Layer 3 Network
• Layer 2 and Layer 3
22
VXLAN Taxonomy (1)
Edge Device
Local LAN
Edge Device Segment
Edge Device
VTEP
VTEP
Local LAN
VTEP Segment
VTEP
VTEP
Control-
EVPN MP-BGP - RFC 7432
Plane
ID Title Category
28
VXLAN Evolution
• Multi-Protocol BGP (MP-BGP) based Control-Plane using
Protocol Learning
EVPN NLRI (Network Layer Reachability Information)
• Workload MAC and IP • Make Forwarding decisions at VTEPs for Layer-2 (MAC)
Addresses learnt by VXLAN and Layer-3 (IP); Integrated Route/Bridge (IRB)
Edge Devices (NVEs)
• Advertises Layer-2 and • Reduce Flooding
Layer-3 Address-to-VTEP
Association (Overlay • Reduce impact of ARP on the Network
Control-Plane)
• Flood Prevention • Standards Based (IETF draft)
• Optimized ARP forwarding
VXLAN Evolution
• Forward based on MAC or IP address learnt via Control-
IP Services
Plane (MP-BGP EVPN)
• VXLAN Routing • Make routing decisions at VTEPs
• Distributed Anycast
Gateway (requires Overlay • Scale and Multipathing (ECMP)
Control-Plane)
• Multi-Tenancy
• Leverage Layer-3 Gateway capabilities along with
Protocol Information
• LISP-ish / LISP-like approach for Host/IP Mobility
• Location (VTEP), Identifier (MAC, IP of End-Host)
Getting the Puzzle Together!
Optimized Networks with VXLAN
Underlay
Overlay
Integrated (VXLAN)
Route/Bridge
BGP
(EVPN)
http://packetpushers.net/show-233-cisco-nexus-using-bgp-
as-a-vxlan-control-plane-sponsored/
http://www.slideshare.net/robboyd/techwisetv-
workshop-secrets-of-scalable-multitenancy
http://blogs.cisco.com/cin/network-like-its-1999-with-bgp-evpn http://blogs.cisco.com/datacenter/vxlanevpn-
standards-based-overlay-with-control-plane
32
Agenda
33
Deployment Considerations
• MTU and Overlays
• Unicast Routing Protocol and IP
Addressing
• Multicast for BUM* Traffic
Replication
Underlay
• Avoid Fragmentation by adjusting
the IP Networks MTU
50 (54) Bytes of Overhead
Outer IP Header
*Cisco Nexus 5600/6000 switches only support 9192 Byte for Layer-3 Traffic
Building your IP Network – Interface Principles (1)
• Know your IP addressing and IP Rendezvous-Point Loopback
10.254.254.1
scale requirements Routing Loopback
10.10.10.203/32
V
• Best to use individual Aggregates
for the Underlay V
• Unicast Routing p2p** Links p2p Links V
10.1.1.1/30
• Unicast Routing Loopbacks V
• VTEP (NVE) Loopback V p2p Agg: 10.1.1.0/24
• Multicast Routing Loopback (RP) RID Agg: 10.10.10.0/24
Routing Loopback V VTEP Agg: 10.200.200.0/24
10.10.10.101/32
RP Agg: 10.254.254.0/24
• IPv4 only (today) VTEP Loopback
10.200.200.101/32
*RP: Rendezvous-Point (Multicast)
**p2p: Point-to-Point
Building your IP Network – Interface Principles (2)
• Routed Ports/Interfaces
• Layer-3 Interfaces between Spine and
Leaf (no switchport)
• For each Point-2-Point (P2P)
connection, minimum /31 required
V
• Alternative, use IP Unnumbered (/32)
768 IP Addresses required == /22 Prefix *RID: Router ID; Unicast Routing Loopback
IP Unnumbered– Simplifying the Math
Multicast Mode IGMP v2/v3 PIM ASM PIM BiDir PIM ASM / PIM BiDir PIM ASM PIM BiDir PIM ASM / PIM BiDir
48
Agenda
49
Multiprotocol BGP (MP-BGP) Primer
• Multiprotocol BGP (MP-BGP)
• Extension to Border Gateway
RR RR
Protocol (BGP) - RFC 4760
• VPN Address-Family:
• Allows different types of address
families (e.g. VPNv4, VPNv6, L2VPN V2
EVPN, MVPN) V1
• Information transported across single
BGP peering
RR BGP Route-Reflector
V3 iBGP Peering*
*eBGP supported without BGP Route-Reflector
Multiprotocol BGP (MP-BGP) Primer
• VPN segmentation for tenant routing VRF Info
(Multi-Tenancy) VRF Info Name: VRF-A
Name: VRF-A RD: 15:10.0.0.2 (auto)
• Route Distinguisher (RD) RD: Imp
3:10.0.0.1 (auto) RRRoute-Target
RR 65500:50000 (auto)
Imp Route-Target 65500:50000 Exp Route-Target 65500:50000 (auto)
• 8-byte field of VRF parameters VRF Info (auto)
Exp Route-Target 65500:50000
Name: (auto)VRF-A
• value to make VPN prefix unique: RD: 62:10.0.0.3 (auto)
• RD + VPN prefix Imp Route-Target 65500:50000 (auto)
Exp Route-Target 65500:50000 (auto)
V1 V2
RR BGP Route-Reflector
V3 iBGP Peering
Multiprotocol BGP (MP-BGP) Primer
• Cisco’s VXLAN/EVPN does provide VRF Info
automated Route Distinguisher (RD) VRF Info Name: VRF-A
Name: VRF-A RD: 15:10.0.0.2 (auto)
• Automatic uses Type 1 format RD: Imp
3:10.0.0.1 (auto) RRRoute-Target
RR 65500:50000 (auto)
Imp Route-Target 65500:50000 Exp Route-Target 65500:50000 (auto)
• 4-byte IP Address (Router ID) VRF Info (auto)
Exp Route-Target 65500:50000
Name: (auto)VRF-A
• 4-byte Value (VRF ID) RD: 62:10.0.0.3 (auto)
Imp Route-Target 65500:50000 (auto)
vrf context VRF-A Exp Route-Target 65500:50000 (auto)
vni 50000 V2
rd auto V1
address-family ipv4 unicast
route-target both auto
route-target both auto evpn
address-family ipv6 unicast
route-target both auto RR BGP Route-Reflector
route-target both auto evpn V3 iBGP Peering
Multiprotocol BGP (MP-BGP) Primer BGP Advertisement
VPN-EVPN: RD:[MAC_A][IP_A]
• VPN Segmentation for tenant BGP Next-Hop: V1
Route Target: 65500:50000
routing (Multi-Tenancy) Label (L3VNI): 50000
RR RR
• Selective distribute VPN routes -
Route Target (RT)
• 8-byte field of VRF parameter
MAC_A / IP_A >>
• unique value to define the LOCAL
Route-Type2
import/export rules for VPN prefix V1 V2
MAC_A / IP_A >> V1
Route-Type2
RR BGP Route-Reflector
V3 iBGP Peering
Host A
MAC_A / IP_A
Multiprotocol BGP (MP-BGP) Primer BGP Advertisement
VPN-EVPN: RD:[MAC_A][IP_A]
• Cisco’s VXLAN/EVPN does provide BGP Next-Hop: V1
Route Target: 65500:50000
automated Route Target (RT) Label (L3VNI): 50000
• 8-byte Route Target (2 x 4-byte) RR RR
• ASN : VNI
Host A
MAC_A / IP_A
Overlay with Optimized Routing
EVPN Control Plane -- Host and Subnet Route Distribution
BGP Update
RR RR • Host-MAC
Spine • Host-IP
Border • Internal IP Subnet
• External Prefixes
V
V
V
V
V
V
BGP Adjacencies
Route-Reflectors deployed
RR
for scaling purposes (iBGP)
55
Overlay with Optimized Routing
EVPN Control Plane -- Host and Subnet Route Distribution
BGPMultiprotocol
Scalable Multi-Tenancy with Update BGP
RR RR • Host-MAC
Spine • Host-IP
EVPN Address-Family:
Border Host MAC+IP,• internal/external IP
Internal IP Subnet
Subnets
• External Prefixes
V
BGP enhanced for Fast Convergence at Large Scale
BGP routing table information for VRF default, address family L2VPN EVPN
Route Distinguisher: 10.0.0.1:32868
BGP routing table entry for
[2]:[0]:[0]:[48]:[0050.56a3.c2bb]:[32]:[192.168.1.73]/272,
version 4
Paths: (1 available, best #1)
Flags: (0x000202) on xmit-list, is not in l2rib/evpn, is locked
Advertised path-id 1
L3VNI Path type: internal, path is valid, is best path, no labeled nexthop
AS-Path: NONE, path sourced internal to AS
L2VNI 10.0.0.1 (metric 3) from 10.0.0.111 (10.0.0.111)
Origin IGP, MED not set, localpref 100, weight 0
Received label 30001 50001
Extcommunity: RT:65501:30001 RT:65501:50001 ENCAP:8 Router MAC:5087.89d4.5495
Originator: 10.0.0.1 Cluster list: 10.0.0.111
Remote VTEP Route Target: Route Target: Overlay Encapsulation: Router MAC of
IP Address L2VNI (VLAN) L3VNI (VRF) 8 - VXLAN Remote VTEP
58
Protocol Learning & Distribution
RR RR
MAC, IP L2VNI L3VNI NH MAC, IP L2VNI L3VNI NH
MAC_A, IP_A 30001 50001 local MAC_B, IP_B 30001 50002 local
1
1
1 V2
V1
Host C Host Y
MAC_C / IP_C MAC_Y / IP_Y
Protocol Learning & Distribution
RR RR
MAC, IP L2VNI L3VNI NH MAC, IP L2VNI L3VNI NH
MAC_A, IP_A 30001 50001 local MAC_B, IP_B 30001 50001 local
2
2
2 V2
V1
Host C Host Y
MAC_C / IP_C MAC_Y / IP_Y
Protocol Learning & Distribution
3 3
RR RR
MAC, IP L2VNI L3VNI NH MAC, IP L2VNI L3VNI NH
MAC_A, IP_A 30001 50001 local MAC_B, IP_B 30001 50001 local
MAC_B, IP_B 30001 50001 IP_V2 MAC_A, IP_A 30001 50001 IP_V1
MAC_C, IP_C 30001 50001 IP_V3 MAC_C, IP_C 30001 50001 IP_V3
2
VMAC_Y, IP_Y 30002 50001 IP_V3
MAC_Y, IP_Y 30002 50001
V1IP_V3
3
MAC, IP L2VNI L3VNI NH
Host C Host Y
MAC_C / IP_C MAC_Y / IP_Y
Subnet Route Advertisement
Route MAC, IP L3VNI NH Encap
(“VRF”)
• IP Prefix Redistribution Type
5 Subnet_A/24 50000
50001 IP_V1 8:VXLAN
Prefix, Equal Cost Multipath (ECMP)
5 Subnet_A/24 50001 IP_V2 8:VXLAN
will apply 5 Subnet_A/24 RR
50001 RR IP_V3 8:VXLAN
BGP routing table information for VRF default, address family L2VPN EVPN
Route Distinguisher: 10.0.0.1:3
BGP routing table entry for [5]:[0]:[0]:[24]:[192.168.2.0]:[0.0.0.0]/224, version 3
Paths: (1 available, best #1)
Flags: (0x000002) on xmit-list, is not in l2rib/evpn, is locked
Advertised path-id 1
Path type: internal, path is valid, is best path, no labeled nexthop
AS-Path: NONE, path sourced internal to AS
L3VNI 10.0.0.1 (metric 3) from 10.0.0.111 (10.0.0.111)
Origin incomplete, MED 0, localpref 100, weight 0
Received label 50001
Extcommunity: RT:65501:50001 ENCAP:8 Router MAC:5087.89d4.5495
Originator: 10.0.0.1 Cluster list: 10.0.0.111
65
ARP Suppression
VXLAN/EVPN
RR RR
MAC, IP VNI NH MAC, IP VNI NH
1 2
MAC, IP VNI NH
Host A
MAC_A / IP_A V3 Host B
MAC_B / IP_B
ARP Request for IP_B
Src MAC: MAC_A
Dst MAC: FF:FF:FF:FF:FF:FF
1 ARP Request sent for IP_B sent from Host A
Virtual Switch
4
3
MAC, IP VNI NH
Underlay
DMAC: hop-by-hop DMAC: MAC_V2
Underlay
SIP: IP_V1 SIP: IP_V1
DIP: IP_V2 DIP: IP_V2
Host A
MAC_A / IP_A
UDP V3 UDP Host B
MAC_B / IP_B
VXLAN VNID: 30001 VXLAN VNID: 30001
Overlay
Overlay
MAC_A, IP_A 30001 Local 50001 MAC_A, IP_A 30001 Local 50001
MAC_F, IP_F 30005 IP_V2 50001 MAC_F, IP_F 30005 E1/4 50001
SMAC: MAC_A SMAC: MAC_GW
DMAC: MAC_GW DMAC: MAC_F
SIP: IP_A
V1 V2 SIP: IP_A
DIP: IP_F DIP: IP_F
1
2 3 4
SMAC: MAC_V1 SMAC: hop-by-hop
Underlay
DMAC: hop-by-hop DMAC: MAC_V2
Underlay
SIP: IP_V1 SIP: IP_V1
DIP: IP_V2 DIP: IP_V2
Host A
MAC_A / IP_A
UDP V3 UDP Host F
MAC_F, IP_F
VXLAN VNID: 50001 VXLAN VNID: 50001
Overlay
Overlay
MAC_A, IP_A 30000 Local 50001 MAC_A, IP_A 30000 Local 50001
Underlay
DMAC: hop-by-hop DMAC: MAC_V2
Underlay
SIP: IP_V1 SIP: IP_V1
DIP: IP_V2 DIP: IP_V2
Host A
MAC_A / IP_A
UDP V3 UDP Host F
MAC_F, IP_F
VXLAN VNID: 50001 VXLAN VNID: 50001
Overlay
Overlay
routing methodology
• datagrams sent from a single
sender to the topologically
✖
nearest node
✔
• group of potential receivers, ✖
all identified by the same
destination address
✔
73
Distributed IP Anycast Gateway
• Distributed Inter-VXLAN Routing at
Access Layer (Leaf) RR RR
SVI 200
SVI 100
• Distributed state - Smaller ARP SVI 200
SVI 100
tables SVI 200
SVI 100
• Only local attached End-Points SVI 200
(Servers)
SVI 100, Gateway IP: 192.168.1.1, Gateway MAC: AG:AG:AG:AG:AG:AG
SVI 200, Gateway IP: 10.10.10.1, Gateway MAC: AG:AG:AG:AG:AG:AG
Distributed IP Anycast Gateway
RR RR
Spine
bridge SVI 100, Gateway IP: 192.168.1.1
SVI 200, Gateway IP: 10.10.10.1
route V
route
SVI 100
Host3
V MAC: CC:CC:CC:CC:CC:CC
V SVI 200 IP: 192.168.1.33
VLAN 100
V VXLAN VNI 30001
V Host2
MAC: BB:BB:BB:BB:BB:BB
V IP: 10.10.10.22
SVI 100 VLAN 200
Host1 VXLAN VNI 30002
MAC: AA:AA:AA:AA:AA:AA
IP: 192.168.1.11
VLAN 100
VXLAN VNI 30001
Distributed IP Anycast Gateway
RR RR
Any Subnet Routed Anywhere – Any VTEP can serve any Subnet
Spine
bridge SVI 100, Gateway IP: 192.168.1.1
Integrated Route & Bridge (IRB) - Route whenever you can,
SVI 200, Gateway IP: Bridge when needed
10.10.10.1
route V
route
– Optimized
No Hairpinning SVI 100 East/West and North/South Routing
Host3
V MAC: CC:CC:CC:CC:CC:CC
V SVI 200Seamless IP: 192.168.1.33
Mobility - All Leaf share same Gateway MAC
VLAN 100
V VXLAN VNI 30001
Leafs
• Optimal for Consistency SVI 100
SVI 200
• Every VLAN/VNI Everywhere
SVI 300
SVI 100
• Sub-Optimal for Scale SVI 100
SVI 200
SVI 300
Scoped Configuration
• Logical Configuration (VLAN, VRF,
VNI) scoped to Leafs with respective RR RR
connected End-Points
• Optimal for Scale SVI 200
SVI 100
• Consistency with End-Points SVI 200
SVI 100
• Configuration Consistency depends SVI 300
SVI 200
on End-Points
SVI 100 SVI 300
SVI 200
Asymmetric IRB
• Similar to todays Inter-VLAN routing
RR RR
• Requires to follow a consistent
configuration of VLAN and L2VNI
across all Switches
• Post routed traffic will leverage
destination Layer 2 Segment
SVI 200
SVI 300
(L2VNI), same as for bridged traffic
SVI 300
SVI 200
SVI 300
✖
RR
SVI 200
RR
SVI 200
SVI 300
Asymmetric IRB
Asymmetric IRB
L2VNI 30001
L2VNI 30002
Leaf
V V
SVI 300 SVI 200 SVI 200 SVI 300
SVI 200
SVI 300
context
• Routed traffic uses transit VNI
(L3VNI), while bridged traffic uses
SVI 300
SVI 200
L2VNI
SVI 300
RR
SVI 200
RR
SVI 200
SVI 300
Symmetric IRB
Symmetric IRB
L3VNI 50001 (VRF)
Leaf
V V
SVI 300 SVI 200 SVI 200 SVI 300
87
What is Multi-Tenancy
88
Where can we apply Multi-Tenancy
Multi-Tenancy at Layer-2 Multi-Tenancy at Layer-3
• Per-Switch VLAN-to-VNI mapping • VRF-to-VNI mapping
RR RR
Spine
bridge
V
VLAN 100
Host3
V MAC: CC:CC:CC:CC:CC:CC
V IP: 192.168.1.33
VLAN 100
V VXLAN VNI 30001
V
V
VLAN 100
Host1
MAC: AA:AA:AA:AA:AA:AA
IP: 192.168.1.11
VLAN 100
VXLAN VNI 30001
Layer-2 Multi-Tenancy – Bridge Domains
VXLAN Overlay
(VNI 30001)
Leaf
V
VLAN 100
Bridge Domain V
VLAN 100
Host1 Host3
MAC: AA:AA:AA:AA:AA:AA MAC: CC:CC:CC:CC:CC:CC
IP: 192.168.1.11 IP: 192.168.1.33
VLAN 100 VLAN 100
VXLAN VNI 30001 VXLAN VNI 30001
Layer-2 Multi-Tenancy – Bridge Domains
VXLAN Overlay
The Bridge Domain
(VNI is the Layer-2 Segment from Host to Host
30001)
V
VLAN 100
Bridge Domain
1) The Ethernet Segment (VLAN), between Host
V and Switch
VLAN 100
2) The Hardware Resources (Bridge Domain) within the Switch
Host1 Host3
MAC: AA:AA:AA:AA:AA:AA MAC: CC:CC:CC:CC:CC:CC
IP: 192.168.1.11 IP: 192.168.1.33
VLAN 100 VLAN 100
VXLAN VNI 30001 VXLAN VNI 30001
VLAN-to-VNI mapping
VXLAN Overlay
(VNI 30001)
Leaf
V V
VLAN 100 VLAN 100
Leaf
V V
VLAN 100 VLAN 200
Leaf
V V
VLAN 100 VLAN 200 VLAN 300
V
V Host2
IP: 10.10.10.22 (VRF-B)
V VLAN 200
SVI 100
Host1
IP: 192.168.1.11 (VRF-A)
VLAN 100
Layer-3 Multi-Tenancy – VRF-VNI or L3VNI
VRF-A VRF-B
(VNI 50001) (VNI 50002)
Leaf
V V V
SVI 100 SVI 200 SVI 300
Leaf
V V V
SVI 100 SVI 200 SVI 300
Leaf
Routing Routing
DomainV
SVI 100
V
SVI 200
Domain V
SVI 300
VRF-A VRF-B
Host1 Host2 Host3
IP: 192.168.1.11 (VRF-A) IP: 10.10.10.22 (VRF-B) IP: 172.16.1.33 (VRF-B)
VLAN 100 VLAN 200 VLAN 300
Layer-3 Multi-Tenancy – VRF-VNI or L3VNI
VRF-A VRF-B
The Routing Domain
(VNI 50001) is the VRF owning multiple
(VNI 50002)across multiple Switches
Subnets
Leaf
Routing Routing
In VXLAN EVPN, the Routing Domain consists of three Components
DomainV
SVI 100
1) The
SVI 200
Domain
V Routing Domains (VRF), local
V to the Switch
SVI 300
2) The Routing Domain (L3VNI) between the Switches
VRF-A VRF-B
3) Multi-Protocol BGP with EVPN Address-Family
Leaf
V V
SVI 100 SVI 200 SVI 300 SVI 400
Leaf
V V
SVI 100 SVI 200 SVI 300 SVI 400
interface eth1/10.1002
VLAN 1001 interface eth1/10.1002
encapsulation dot1q 1002
vrf member VRF-B
Ethernet
VLAN 1002
encapsulation dot1q 1002
vrf member VRF-B
ip address 10.2.2.1/24 ip address 10.2.2.2/24
ip router ospf 100 area 0.0.0.0 ip router ospf 100 area 0.0.0.0
Leaf
V V
SVI 100 SVI 200 SVI 300 SVI 400
router SVI
bgp100
65500 SVI 200 routerSVI
bgp30065500SVI 400
address-family ipv4 unicast address-family ipv4 unicast
neighbor 1.1.1.2 remote-as 65500 neighbor 1.1.1.1 remote-as 65500
address-family vpnv4 unicast address-family vpnv4 unicast
send-community extended send-community extended
vrf VRF-A vrf VRF-A
address-family ipv4 unicast address-family ipv4 unicast
vrf VRF-B vrf VRF-B
address-family ipv4 unicast address-family ipv4 unicast
Subnet1 Subnet2 Subnet3 Host4
IP: 192.168.1.0/24 (VRF-A) IP: 10.10.10.0/24 (VRF-B) IP: 172.16.1.0/24 (VRF-B) IP: 10.44.44.0/24 (VRF-A)
VLAN 100 VLAN 200 VLAN 300 VLAN 400
109
Layer-3 Multi-Tenancy – VXLAN EVPN
L3VNI 50001
VXLAN
L3VNI 50002
Leaf
V V
SVI 100 SVI 200 SVI 300 SVI 400
V Host2
MAC: BB:BB:BB:BB:BB:BB
V IP: 10.10.10.22
SVI 100 VLAN 200
Host1 VXLAN VNI 30002
MAC: AA:AA:AA:AA:AA:AA
IP: 192.168.1.11
VLAN 100
112
VXLAN VNI 30001
Integrated Route & Bridge + Multi-Tenancy
VRF-A (VNI 50001)
RR RR
Spine
SVI 100, Gateway IP: 192.168.1.1 (VRF-A)
SVI 200, Gateway IP: 10.10.10.1 (VRF-A)
V
SVI 100
Host3
V MAC: CC:CC:CC:CC:CC:CC
V SVI 200 IP: 192.168.1.33 (VRF-A)
VLAN 100
V VXLAN VNI 30001
V Host2
MAC: BB:BB:BB:BB:BB:BB
V IP: 10.10.10.22 (VRF-A)
SVI 100 VLAN 200
Host1 VXLAN VNI 30002
MAC: AA:AA:AA:AA:AA:AA
IP: 192.168.1.11 (VRF-A)
VLAN 100
VXLAN VNI 30001 113
Data Center Fabric Properties
Extended Namespace
Scalable Layer-2 Domains
Integrated Route and Bridge
Multi-Tenancy
Agenda
115
“Stories” and Use-Cases
116
• Yesterday: VXLAN, yet another
VXLAN Overlay
• Data-Plane only (Multicast based
applicability •
Flood & Learn)
Today: VXLAN for the creation of
evolves as the scalable DC Fabrics – Intra-DC
• Control-Plane, active VTEP discovery,
Multicast and Unicast (Head-End
Control Plane Replication)
evolves!
Story #1: Scalable Data Center Fabric
118
Story #1: Scalable Data Center Fabric (1)
Spine
Border Leaf
Leaf
Leaf
Leaf
Leaf
119
Story #1: Scalable Data Center Fabric (2)
interface loopback0 interface loopback0
ip address 10.10.10.101/32 ip address 10.10.10.201/32
ip router ospf UNDERLAY area 0.0.0.0 ip router ospf UNDERLAY area 0.0.0.0
interface loopback0
router ospf UNDERLAY
ip address 10.10.10.102/32 router ospf UNDERLAY
router-id
ip router 10.10.10.101
ospf UNDERLAY area 0.0.0.0 router-id 10.10.10.201
interface loopback0
interface
router Ethernet1/1
ospf
ip addressUNDERLAY
10.10.10.103/32 interface Ethernet1/1
mturouter-id
9192
ip router10.10.10.102
ospf UNDERLAY area 0.0.0.0 mtu 9192
ip address 10.1.1.1/30 ip address 10.1.1.2/30
ipinterface
ospf
router network point-to-point
Ethernet1/1
ospf UNDERLAY ip ospf network point-to-point
ip mtu
router
9192ospf 10.10.10.103
router-id UNDERLAY area 0.0.0.0 ip router ospf UNDERLAY area 0.0.0.0
ip ip
pimaddress
sparse-mode
10.1.1.5/30 ip pim sparse-mode
… ipinterface
ospf network point-to-point
Ethernet1/1
ip mtu
router
9192ospf UNDERLAY area 0.0.0.0 interface Ethernet1/2
ip ip
pimaddress
sparse-mode
10.1.1.9/30 mtu 9192
… ip ospf network point-to-point ip address 10.1.1.6/30
ip router ospf UNDERLAY area 0.0.0.0 ip ospf network point-to-point
ip pim sparse-mode ip router ospf UNDERLAY area 0.0.0.0
… ip pim sparse-mode
interface loopback0
ip address 10.10.10.103/32
ip router ospf UNDERLAY area 0.0.0.0
interface loopback0
ip address 10.10.10.102/32
p2p Agg: 10.1.1.0/24 ip pim rp-address 10.254.254.1
ip router ospf UNDERLAY area 0.0.0.0
RID Agg: 10.10.10.0/24 interface loopback0 RP Rendezvous-Point
VTEP Agg: 10.200.200.0/24 ip address 10.10.10.101/32
ip pim rp-address 10.254.254.1
RP Agg: 10.254.254.0/24 ip router ospf UNDERLAY area 0.0.0.0
VTEP
VTEP
VTEP
VTEP
interface loopback1
VTEP ip address 10.200.200.103/32
interface loopback1
ip router ospf UNDERLAY area 0.0.0.0
ip address 10.200.200.102/32
p2p Agg: 10.1.1.0/24 VTEPinterface loopback1
ip routerinterface
ospf UNDERLAY
nve1 area 0.0.0.0
ip address 10.200.200.101/32
RID Agg: 10.10.10.0/24 source-interface loopback1
ip routerinterface
ospf UNDERLAY
nve1 area 0.0.0.0
VTEP Agg: 10.200.200.0/24 host-reachability protocol bgp
RP Agg: 10.254.254.0/24 source-interface loopback1
interface nve1
host-reachability protocol bgp
source-interface loopback1
host-reachability protocol bgp
122
Story #1: Scalable Data Center Fabric (5)
router bgp 65500 router bgp 65500
router-id 10.10.10.202 router-id 10.10.10.203
RR RR
neighbor 10.10.10.0/24 remote-as 65500 neighbor 10.10.10.0/24 remote-as 65500
update-source loopback0 update-source loopback0
address-family l2vpn evpn address-family l2vpn evpn
send-community both send-community both
route-reflector-client VTEP
route-reflector-client
VTEP
VTEP
router bgp 65500
bgp router-id
router VTEP 65500 10.10.10.103
router-id neighbor 10.10.10.202 remote-as 65500
10.10.10.102
router bgp 65500
update-source
neighbor 10.10.10.202 loopback0
remote-as 65500
router-id
VTEP 10.10.10.101address-family
update-source loopback0 l2vpn evpn
neighbor 10.10.10.202 remote-as 65500
send-community both
address-family
update-source loopback0 l2vpn evpn
neighbor
send-community 10.10.10.203 remote-as 65500
p2p Agg: 10.1.1.0/24 VTEP address-family l2vpn evpn both
neighbor update-source
10.10.10.203 loopback0
remote-as 65500
RID Agg: 10.10.10.0/24 send-community both
address-family l2vpn evpn RR BGP Route-Reflector
update-source loopback0
neighbor 10.10.10.203 remote-as 65500
VTEP Agg: 10.200.200.0/24 send-community both
address-family
update-source loopback0 l2vpn evpn
RP Agg: 10.254.254.0/24
address-family l2vpn evpn both
send-community
send-community both
123
Story #1: Scalable Data Center Fabric (6)
vlan 100
vn-segment 30001
name Blue
vlan 200
vn-segment 30002
name Green
VTEP
evpn
vni 30001
rd auto
route-target both auto
VTEP vni 30002
rd auto
VTEP route-target both auto
interface nve1
VTEP source-interface loopback1
host-reachability protocol bgp
VTEP member vni 30001
mcast-group 239.239.239.1
member vni 30002
p2p Agg: 10.1.1.0/24 VTEP mcast-group 239.239.239.2
RID Agg: 10.10.10.0/24
VTEP Agg: 10.200.200.0/24
RP Agg: 10.254.254.0/24
124
Story #1: Scalable Data Center Fabric (7)
interface Vlan100 vrf context VRF-A
mtu 9192 vni 50001
vrf member VRF-A rd auto
ip address 192.168.1.1/24 tag 21921 address-family ipv4 unicast
fabric forwarding mode anycast-gateway route-target both auto
route-target both auto evpn
interface Vlan200 VTEP
address-family ipv6 unicast
mtu 9192 route-target both auto
vrf member VRF-A route-target both auto evpn
ip address 10.10.10.1/24 tag 21921
fabric forwarding mode anycast-gateway interface nve1
VTEP source-interface loopback1
host-reachability protocol bgp
VTEP member vni 50001 associate-vrf
125
Story #1: Scalable Data Center Fabric (8)
interface Ethernet 2/1.10
vrf member VRF-A
ip address 172.16.0.1/30
encapsulation dot1q 5
127
Story #2: Scalable Data Center Fabric (1)
Spine AS65500
AS65555
AS65504
AS65503
AS65503
AS65502
128
Story #2: Scalable Data Center Fabric (2)
interface loopback0 interface loopback0
ip address 10.10.10.101/32 tag 12345 ip address 10.10.10.201/32 tag 12345
interface
interface Ethernet1/1
loopback0 interface Ethernet1/1
mtuip9192
address 10.10.10.102/32 tag 12345 mtu 9192
ip address 10.1.1.1/30 ip address 10.1.1.2/30
interface
interface Ethernet1/1
loopback0
router
mtuipbgp
9192 65501
address 10.10.10.103/32 tag 12345 interface Ethernet1/2
router-id
ip address10.10.10.101
10.1.1.5/30 mtu 9192
address-family ipv4 unicast
interface Ethernet1/1 ip address 10.1.1.6/30
redistribute
router bgp
mtu 9192 direct
65502 route-map UL-TAG
template peer10.10.10.102
router-id
ip address SPINE-UNDERLAY
10.1.1.9/30 interface Ethernet1/3
remote-as 65500 ipv4 unicast
address-family mtu 9192
address-family
redistribute
router bgp 65503ipv4
directunicast
route-map UL-TAG ip address 10.1.1.10/3
neighbor
template10.10.10.2
peer10.10.10.103
router-id SPINE-UNDERLAY
inherit peer 65500
remote-as SPINE-UNDERLAY
address-family ipv4 unicast router bgp 65500
neighbor 10.10.10.L1-S2
address-family
redistribute ipv4 unicast
direct route-map UL-TAG router-id 10.10.10.201
inherit
neighborpeer SPINE-UNDERLAY
10.10.10.6
template peer SPINE-UNDERLAY address-family ipv4 unicast
neighbor
inherit10.10.10.L1-S3
peer 65500
remote-as SPINE-UNDERLAY redistribute direct route-map UL-TAG
inherit
neighborpeer SPINE-UNDERLAY
10.10.10.L1-S2
address-family ipv4 unicast neighbor 10.10.10.1 remote-as 65501
neighbor
inherit10.10.10.L1-S4
neighborpeer SPINE-UNDERLAY
10.10.10.10 address-family ipv4 unicast
p2pinherit
Agg: 10.1.1.0/24
neighborpeer
inherit SPINE-UNDERLAY
10.10.10.L1-S3
peer SPINE-UNDERLAY neighbor 10.10.10.5 remote-as 65502
RID
… Agg: 10.10.10.0/24
inherit
neighborpeer SPINE-UNDERLAY
10.10.10.L1-S2 address-family ipv4 unicast
VTEP Agg:
neighbor10.200.200.0/24
inherit10.10.10.L1-S4
peer SPINE-UNDERLAY neighbor 10.10.10.9 remote-as 65503
RP Agg: 10.254.254.0/24
inherit
neighborpeer SPINE-UNDERLAY
10.10.10.L1-S3 address-family ipv4 unicast
… inherit peer SPINE-UNDERLAY …
neighbor 10.10.10.L1-S4 route-map TAG-UL permit 10
129
inherit peer SPINE-UNDERLAY match tag 12345
…
Story #2: Scalable Data Center Fabric (3)
VTEP
VTEP
VTEP
VTEP
interface loopback1
VTEP ip address 10.200.200.103/32 tag 12345
interface loopback1
ip address 10.200.200.102/32
interface nve1 tag 12345
p2p Agg: 10.1.1.0/24 VTEPinterface loopback1 source-interface loopback1
ip address 10.200.200.101/32
interface nve1 tag 12345
RID Agg: 10.10.10.0/24 host-reachability protocol bgp
VTEP Agg: 10.200.200.0/24 source-interface loopback1
interface nve1
host-reachability protocol bgp
RP Agg: 10.254.254.0/24 source-interface loopback1
host-reachability protocol bgp
130
Story #2: Scalable Data Center Fabric (4)
router bgp 65500 router bgp 65500
router-id 10.10.10.202 router-id 10.10.10.203
address-family l2vpn evpn address-family l2vpn evpn
nexthop route-map NHUNCH nexthop route-map NHUNCH
retain route-target all retain route-target all
neighbor 10.10.10.101 remote-as 65501 neighbor 10.10.10.101 remote-as 65501
update-source loopback0
VTEP
update-source loopback0
address-family l2vpn evpn address-family l2vpn evpn
send-community both send-community both
disable-connected-check disable-connected-check
route-map NHUNCH out route-map NHUNCH out
neighbor 10.10.10.102 remote-as 65502 VTEP 10.10.10.102 remote-as 65502
neighbor
update-source loopback0 update-source loopback0
address-family l2vpn evpn VTEP address-family l2vpn evpn
send-community both send-community both
disable-connected-check disable-connected-check
route-map NHUNCH out VTEP route-map NHUNCH out
neighbor 10.10.10.103 remote-as 65503 neighbor 10.10.10.103 remote-as 65503
update-source loopback0 VTEP update-source loopback0
address-family l2vpn evpn address-family l2vpn evpn
send-community both send-community both
disable-connected-check
p2p Agg: 10.1.1.0/24 VTEP disable-connected-check
route-map
RID Agg: 10.10.10.0/24 NHUNCH out route-map NHUNCH out
VTEP Agg: 10.200.200.0/24
RP Agg: 10.254.254.0/24
route-map NHUNCH permit 10
set ip next-hop unchanged
131
Story #2: Scalable Data Center Fabric (5)
132
Story #2: Scalable Data Center Fabric (6)
vlan 100
vn-segment 30001
name Blue
vlan 200
vn-segment 30002
name Green
VTEP
evpn
vni 30001
rd auto
route-target both 65500:30001
VTEP vni 30002
rd auto
VTEP route-target both 65500:30002
interface nve1
VTEP source-interface loopback1
host-reachability protocol bgp
VTEP member vni 30001
ingress-replication protocol bgp
member vni 30002
p2p Agg: 10.1.1.0/24 VTEP ingress-replication protocol bgp
RID Agg: 10.10.10.0/24
VTEP Agg: 10.200.200.0/24
RP Agg: 10.254.254.0/24
133
Story #2: Scalable Data Center Fabric (7)
interface Vlan100 vrf context VRF-A
mtu 9192 vni 50001
vrf member VRF-A rd auto
ip address 192.168.1.1/24 tag 21921 address-family ipv4 unicast
fabric forwarding mode anycast-gateway route-target both 65500:50001
route-target both 65500:50001 evpn
interface Vlan200 VTEP
address-family ipv6 unicast
mtu 9192 route-target both 65500:50001
vrf member VRF-A route-target both 65500:50001 evpn
ip address 10.10.10.1/24 tag 21921
fabric forwarding mode anycast-gateway interface nve1
VTEP source-interface loopback1
host-reachability protocol bgp
VTEP member vni 50001 associate-vrf
134
Story #2: Scalable Data Center Fabric (8)
interface Ethernet 2/1.10
vrf member VRF-A
ip address 172.16.0.1/30
encapsulation dot1q 5
applicability •
Flood & Learn)
Today: VXLAN for the creation of
evolves as the scalable DC Fabrics – Intra-DC
• Control-Plane, active VTEP discovery,
Multicast and Unicast (Head-End
Control Plane Replication)
• Future: VXLAN for DCI – Inter-DC
evolves! • DCI Enhancements (ARP
caching/suppress, Multi-Homing,
Failure Domain isolation, Loop
Protection etc.)
What is the Elephant in the Room?
137
Note sure if it is a Elephant
VXLAN for Interconnecting Networks
Story #3: Inter-Fabric Connectivity
139
Inter-Fabric Connectivity (Option 1)
• Multiple BGP-EVPN Control-Plane
EVPN Control-Plane Domain 1 Domains
• End-to-End reachability for VTEP
V • End-to-End reachability for BUM
V
Replication
V • Multicast / Ingress Replication
V
V
V
• End-to-End Data-Plane
encapsulation
V
V
VXLAN Encapsulation
EVPN Control-Plane Domain 2
Inter-Fabric Connectivity (Option 2)
• Multiple BGP-EVPN Control-Plane
EVPN Control-Plane Domain 1 Domains
• Normalization via Ethernet (MPLS,
VRF-lite & IEEE 802.1Q Trunk) at
V
V the Border
V • Separate Data-Plane (DP)
V
DCI
V
encapsulation per Domain
V
• Multicast / Ingress Replication
V
V
VXLAN Encapsulation
EVPN Control-Plane Domain 2 DCI Encapsulation
Inter-Fabric Connectivity (Option 3 / Option 4)
• Multiple BGP-EVPN Control-Plane
EVPN Control-Plane Domain 1 Domains
• Integrated Hand-Off with Data-Plane
separation
V
V • Option 3 – L3 DCI
• L3-LISP, MPLS, EVPN
V
V
• Option 4 – L2 DCI
V • OTV, L2-LISP, EVPN
V
• Separate Data-Plane (DP)
V encapsulation per Domain
V • Multicast / Ingress Replication
VXLAN Encapsulation
EVPN Control-Plane Domain 2 DCI Encapsulation
Inter-Fabric Connectivity
Option 1 Option 2 Option 3/4
Underlay Control Plane Unified Underlay Domain Separated Underlay Domains Separated Underlay Domains
Broadcast
no yes yes
Suppression/Limit (DCI)
Layer-2 Loop Prevention Loop mitigation (Edge Protection) VPC at Border Loop mitigation (At DCI)
Fabric Management &
Automation
144
How to achieve Data Center Automation
• Simplify
• Do not start with the most difficult task (low hanging Fruits)
• Standardize
• Find common Denominators and create Templates
• Automate repetitive Tasks
• Use Templates for Simple Tasks and use Automation (e.g. create VLAN, SVI, VRF)
• Abstract
• Take a step back and look at the WHOLE
• Cisco ACI
145
Anatomy of Data Center Automation
VMM Chef Openstack
Puppet NX-API Ansible
API
IP Fabric
Network Infrastructure
Fabric Management & Operations
Element Day-0: Day- 1: Day-2:
management:
Configuration Configuration and Visibility,
Hardware (POAP) Configuration Configuration
Management, Management increments,
Health Status, and Underlay compare changes.
Inventory Management Automated
Configuration
Compute
Integration
Troubleshooting
Simplifying Management & Fabric Visibility
• Device Auto-Configuration
• Cabling Plan Consistency Check
• Automated Network Provisioning
• Common point of fabric access
• Tenant, Virtual Fabric & Host
Visibility
Device Auto-Configuration: Day 0
• Underlay Configuration:
• Physical interface IP configuration
• Loopback interface IP configuration
• Multicast Configuration for the Underlay (BUM)
• Routing protocol for the underlay configuration
• vPC domain
• BGP EVPN + RR configuration
• VTEP configuration
149
Device Auto-Configuration: Day 0.5
150
Device Auto-Configuration: Day 1
151
Device Auto-Configuration (POAP)
Day 0, Day 0.5 and Day 1
1. Easy way to unbox, rack the device, and not enter any base CLI
configuration. Just rack, power, and plug into the management
network.
2. Provides a standard and consistent configuration across of the data
center network devices.
3. Provides a standard and consistent images to deploy to all of the
data center devices.
152
Q&A
153
Recommended Reading
Using TRILL, FabricPath, and VXLAN:
Designing Massively Scalable Data
Centers (MSDC) with Overlays
• Sanjay K. Hooda
• Shyam Kapadia
• Padmanabhan Krishnan
ISBN-10: 1-58714-393-3
ISBN-13: 978-1-58714-393-9
Recommended Viewing
Cisco Programmable Fabric Using
VXLAN with BGP EVPN LiveLessons
• David Jansen
• Lukas Krattiger
ISBN-10: 0-13-427229-3
ISBN-13: 978-0-13-427229-0
Thank you
156