TCP Congestion

Acknowledgments
S. Athuraliya, D. Lapsley, V. Li, Q. Yin

(UMelb)
J. Doyle (Caltech), K.B. Kim

(SNU/Caltech), F. Paganini (UCLA), J.
Wang (Caltech), Z. Wang (UCLA)
L. Peterson, L. Wang (Princeton)
Matthew Roughan (AT&T Labs)

Copyright, 1996 Dale Carnegie &
Associates, In
TCP Congestion Control:
Algorithms & Models
Steven Low
netlab.caltech.edu
IPAM, 2002

Outline
Introduction
Algorithms
Window flow control
TCP algorithm: Tahoe, Reno, Vegas
AQM algorithm: RED, REM, variants
Models
Duality model
Linear dynamic model

Part 0
Introduction

TCP/IP Protocol Stack
Applications (e.g. Telnet, HTTP)
TCP UDP
ICMP
ARP IP
Link Layer (e.g. Ethernet, ATM)
Physical Layer (e.g. Ethernet, SONET)

Packet Terminology
Application Message
TCP data TCP
hdr
MSS
TCP
Segment
IP data IP hdr
IP Packet
Ethernet data Ethern
et
Ethernet Frame
20 bytes
20 bytes
14 bytes
4 bytes
MTU 1500 bytes

IP
Packet switched
Datagram service
Unreliable (best effort)
Simple, robust
Heterogeneous
Dumb network, intelligent terminals

Compare with PSTN

TCP
Packet switched
End-to-end
Reliable, in order delivery of a packet

stream
Reliability through ACKs
Flow control: use bandwidth

efficiently

Success of IP
Applications
IP
Transmission
Simple/Robust
Robustness against failure
Robustness against technological

evolutions
WWW, Email, Napster, FTP,
Ethernet, ATM, POS, WDM,

TCP versions
TCP is not perfectly homogenous

(200+)
4.2 BSD first widely available release of TCP/IP (1983)
4.3 BSD (1986)
4.3 BSD Tahoe (1988)
4.3 BSD Reno (1990)
Vegas (1994)
Windows 95
SunOS 4.1.3,4.1.4
4.4 BSD (1993)
HP/UX
Solaris 2.3,2.4
Digital OSF
Windows NT
Linux 1.0
IRIX
.....
NewReno (1999)

Part I
Algorithms

TCP & AQM
x
i
(t)
p
l
(t)
Example congestion measure p
l
(t)
Loss (Reno)
Queueing delay (Vegas)

TCP & AQM
x
i
(t)
p
l
(t)
TCP:
Reno
Vegas
AQM:
DropTail
RED
REM,PI,AVQ

Outline
Introduction
Algorithms
Window flow control
Models
Duality model

Early TCP
Pre-1988
Go-back-N ARQ
Detects loss from timeout
Retransmits from lost packet onward
Receiver window flow control
Prevent overflows at receive buffer
Flow control: self-clocking

Why Flow Control?
October 1986, Internet had its first

congestion collapse
Link LBL to UC Berkeley
400 yards, 3 hops, 32 Kbps
throughput dropped to 40 bps
factor of ~1000 drop!
1988, Van Jacobson proposed TCP

flow control

Effect of Congestion
Packet loss
Retransmission
Reduced throughput
Congestion collapse due to
Unnecessarily retransmitted packets
Undelivered or unusable packets
Congestion may continue after the overload!

throughput
load

Window Flow Control
~ W packets per RTT
Lost packet detected by missing

ACK
RTT
time
time
Source
Destination
1 2 W
1 2 W
1 2 W
data
ACKs
1 2 W

Window flow control
Limit the number of packets in the network

to window W
Source rate = bps
If W too small then rate capacity

If W too big then rate > capacity
=> congestion
Adapt W to network (and conditions)

W = BW x RTT
RTT
MSS W

TCP Window Flow Controls
Receiver flow control
Avoid overloading receiver
Set by receiver
awnd: receiver (advertised) window
Network flow control
Avoid overloading network
Set by sender
Infer available network capacity
cwnd: congestion window
Set W = min (cwnd, awnd)

Receiver Flow Control
Receiver advertises awnd with each

ACK
Window awnd
closed when data is received and ackd
opened when data is read
Size of awnd can be the performance

limit (e.g. on a LAN)
sensible default ~16kB

Network Flow Control
Source calculates cwnd from

indication of network congestion
Congestion indications
Losses
Delay
Marks
Algorithms to calculate cwnd
Tahoe, Reno, Vegas, RED, REM

Outline
Introduction
Algorithms
Window flow control
Models
Duality model

TCP Congestion Controls
Tahoe (Jacobson 1988)
Slow Start
Congestion Avoidance
Fast Retransmit
Reno (Jacobson 1990)
Fast Recovery
Vegas (Brakmo & Peterson 1994)
New Congestion Avoidance
RED (Floyd & Jacobson 1993)
Probabilistic marking
REM (Athuraliya & Low 2000)
Clear buffer, match rate

Variants
Tahoe & Reno
NewReno
SACK
Rate-halving
Mod.s for high performance
AQM
RED, ARED, FRED, SRED
BLUE, SFB
REM, PI, AVQ

TCP Tahoe (Jacobson 1988)
SS
time
window
CA
SS: Slow Start
CA: Congestion Avoidance

Slow Start
data
packet
ACK
receiver sender
1 RTT
cwnd
1
2
3
4
5
6
7
8
cwnd cwnd + 1 (for each ACK)

Congestion Avoidance
cwnd
1
2
3
1 RTT
4
data
packet
ACK
cwnd cwnd + 1 (for each cwnd ACKS)
receiver sender

Packet Loss
Assumption: loss indicates congestion
Packet loss detected by
Retransmission TimeOuts (RTO timer)
Duplicate ACKs (at least 3)

1
2 3 4 5 6
1 2 3
Packets
Acknowledgements
3 3
7
3

Fast Retransmit
Wait for a timeout is quite long
Immediately retransmits after 3

dupACKs without waiting for timeout
Adjusts ssthresh
flightsize = min(awnd, cwnd)
ssthresh max(flightsize/2, 2)
Enter Slow Start (cwnd = 1)

Summary: Tahoe
Basic ideas
Gently probe network for spare capacity
Drastically reduce rate on congestion
Windowing: self-clocking
Other functions: round trip time estimation,

error recovery
for every ACK {
if (W < ssthresh) then W++ (SS)
else W += 1/W (CA)
}
for every loss {
ssthresh = W/2
W = 1
}

Fast recovery
Motivation: prevent `pipe from emptying

after fast retransmit
Idea: each dupACK represents a packet

having left the pipe (successfully received)
Enter FR/FR after 3 dupACKs
Set ssthresh max(flightsize/2, 2)
Retransmit lost packet
Set cwnd ssthresh + ndup (window inflation)
Wait till W=min(awnd, cwnd) is large enough;

transmit new packet(s)
On non-dup ACK (1 RTT later), set cwnd

ssthresh (window deflation)
Enter CA

9
9
4
0 0
Example: FR/FR
Fast retransmit
Retransmit on 3 dupACKs
Fast recovery
Inflate window while repairing loss to fill

pipe
time
S
time
R
1 2 3 4 5 6 8 7
8
cwnd 8
ssthresh
1
7
4
0 0 0
Exit FR/FR
4
4
4
11
0 0
101
1

Summary: Reno
Basic ideas
Fast recovery avoids slow start
dupACKs: fast retransmit + fast recovery
Timeout: fast retransmit + slow start

slow start
retransmit
congestion
avoidance
FR/FR
dupACKs
timeout

NewReno: Motivation
On 3 dupACKs, receiver has packets 2, 4, 6, 8,

cwnd=8, retransmits pkt 1, enter FR/FR
Next dupACK increment cwnd to 9
After a RTT, ACK arrives for pkts 1 & 2, exit FR/FR,

cwnd=5, 8 unacked pkts
No more ACK, sender must wait for timeout

1 2
time
S
time
D
3 4 5 6 8 7 1
8
FR/FR
0 9
0 0 0 0 0
9
8 unackd pkts
2
5
3
timeout

NewReno Fall & Floyd 96, (RFC 2583)
Motivation: multiple losses within a window
Partial ACK acknowledges some but not all

packets outstanding at start of FR
Partial ACK takes Reno out of FR, deflates window
Sender may have to wait for timeout before

proceeding
Idea: partial ACK indicates lost packets
Stays in FR/FR and retransmits immediately
Retransmits 1 lost packet per RTT until all lost

packets from that window are retransmitted
Eliminates timeout

SACK Mathis, Mahdavi, Floyd, Romanow 96 (RFC 2018, RFC 2883)
Motivation: Reno & NewReno retransmit at

most 1 lost packet per RTT
Pipe can be emptied during FR/FR with multiple

losses
Idea: SACK provides better estimate of

packets in pipe
SACK TCP option describes received packets
On 3 dupACKs: retransmits, halves window,

enters FR
Updates pipe = packets in pipe
Increment when lost or new packets sent
Decrement when dupACK received
Transmits a (lost or new) packet when pipe <

cwnd
Exit FR when all packets outstanding when FR

was entered are acknowledged

Outline
Introduction
Algorithms
Window flow control
Models
Duality model

TCP Vegas (Brakmo & Peterson 1994)
Reno with a new congestion avoidance

algorithm
Converges (provided buffer is large) !

SS
time
window
CA

for every RTT
{
if W/RTT
min
W/RTT < then W

++
if W/RTT
min
W/RTT > then W

--
}
for every loss
W := W/2
Congestion avoidance
Each source estimates number of its

own packets in pipe from RTT
Adjusts window to maintain estimate

between d and d

Implications
Congestion measure = end-to-end

queueing delay
At equilibrium
Zero loss
Stable window at full utilization
Approximately weighted proportional

fairness
Nonzero queue, larger for more sources
Convergence to equilibrium
Converges if sufficient network buffer
Oscillates like Reno otherwise

Outline
Introduction
Algorithms
Window flow control
Models
Duality model

Active queue management
Idea: provide congestion information

by probabilistically marking packets
Issues
How to measure congestion?
How to embed congestion measure?
How to feed back congestion info?

Congestion measure: average queue length

b
l
(t+1) = [b
l
(t) + y
l
(t) - c
l
]
+
Embedding: p-linear probability function
Feedback: dropping or ECN marking

Avg queue
marking
1

Variant: ARED (Feng, Kandlur, Saha, Shin 1999)
Motivation: RED extremely sensitive

to #sources
Idea: adapt max
p
to load
If avg. queue < min
th
, decrease max
p
If avg. queue > max
th
, increase max
p
No per-flow information needed

Variant: FRED (Ling & Morris 1997)
Motivation: marking packets in proportion to

flow rate is unfair (e.g., adaptive vs
unadaptive flows)
Idea
A flow can buffer up to min
q
packets without being
marked
A flow that frequently buffers more than max
q

packets gets penalized
All flows with backlogs in between are marked

according to RED
No flow can buffer more than avgcq packets

persistently
Need per-active-flow accounting

Variant: SRED (Ott, Lakshman & Wong 1999)
Motivation: wild oscillation of queue in

RED when load changes
Idea:
Estimate number N of active flows
An arrival packet is compared with a

randomly chosen active flows
N ~ prob(Hit)
-1
cwnd~p
-1/2
and Np
-1/2
= Q
0
implies p = (N/Q
0
)
2
Marking prob = m(q) min(1, p)
No per-flow information needed

Variant: BLUE (Feng, Kandlur, Saha, Shin 1999)
Motivation: wild oscillation of RED

leads to cyclic overflow &
underutilization
Algorithm
On buffer overflow, increment marking

prob
On link idle, decrement marking prob

Congestion measure: average queue length

b
l
(t+1) = [b
l
(t) + y
l
(t) - c
l
]
+
Embedding: p-linear probability function

Avg queue
marking
1

Congestion measure: price

p
l
(t+1) = [p
l
(t) + (
l
b
l
(t)+ x
l

(t) - c
l
)]
+
Embedding:

Congestion measure: price

p
l
(t+1) = [p
l
(t) + (
l
b
l
(t)+ x
l

(t) - c
l
)]
+
Embedding: exponential probability

function

0 2 4 6 8 10 12 14 16 18 20
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Link congestionmeasure
L
i
n
k

m
a
r
k
i
n
g

p
r
o
b
a
b
i
l
i
t
y

Match rate
Clear buffer and match rate

Clear buffer
Key features
+
+ + + )] ) ( ) ( ( ) ( [ ) 1 (
l
l
l l l l
c t x t b t p t p
) ( ) (
1 1
t p t p
s
l

Sum prices
Theorem (Paganini 2000)
Global asymptotic stability for general utility
function (in the absence of delay)

Summary
x
i
(t)
p
l
(t)
TCP:
Reno
Vegas
AQM:
DropTail
RED
REM,PI,AVQ

Application: Wireless TCP
Reno uses loss as congestion

measure
In wireless, significant losses due to
Fading
Interference
Handover
Not buffer overflow (congestion)
Halving window too drastic
Small throughput, low utilization

Proposed solutions
Ideas
Hide from source noncongestion losses
Inform source of noncongestion losses
Approaches
Link layer error control
Split TCP
Snoop agent
SACK+ELN (Explicit Loss Notification)

Third approach
Problem
Reno uses loss as congestion measure
Two types of losses
Congestion loss: retransmit + reduce window
Noncongestion loss: retransmit
Previous approaches
Hide noncongestion losses
Indicate noncongestion losses
Our approach
Eliminates congestion losses (buffer

overflows)

Third approach
Idea
REM clears buffer
Only noncongestion losses
Retransmits lost packets without reducing

window
Router
REM capable
Host
Do not use loss as congestion measure

Vegas
REM

Performance
Goodput

Performance
Goodput

Part II
Models

Congestion Control
Heavy tail Mice-elephants
Elephant
Internet
Mice
Congestion control
efficient & fair
sharing
small
delay
queueing +
propagation
CDN

TCP
x
i
(t)

TCP & AQM
x
i
(t)
p
l
(t)
Example congestion measure p
l
(t)
Loss (Reno)

Outline
Protocol (Reno, Vegas, RED,
REM/PI)
Equilibrium
Performance
Throughput, loss,
delay
Fairness
Utility
Dynamics
Local stability
Cost of stabilization
)) ( ), ( ( ) 1 (
)) ( ), ( ( ) 1 (
t x t p G t p
t x t p F t x
+
+

Outline
Introduction
Algorithms
Window flow control
Models
Duality model

p 1

Flow control
Interaction of source rates x
s
(t) and
congestion measures p
l
(t)
Duality theory
They are primal and dual variables
Flow control is optimization process
Example congestion measure
Loss (Reno)

Model
c
1
c
2
Sources s
L(s) - links used by source s
U
s
(x
s
) - utility if source rate = x
s
Network
Links l of capacities c
l
x
1
x
2
x
3
1 2 1
c x x +
2 3 1
c x x +

Primal problem
L l c x
x U
l
l
s
s s
x
s

, subject to
) (
max
0
Assumptions
Strictly concave increasing U
s
Unique optimal rates x
s
exist
Direct solution impractical

History
Formulation
Kelly 1997
Penalty function approach
Kelly, Maulloo and Tan 1998
Kunniyur and Srikant 2000
Duality approach
Low and Lapsley 1999
Athuraliya and Low 2000, Low 2000
Many extensions

History
Formulation
Kelly 1997
Penalty function approach
Kelly, Maulloo and Tan 1998
Kunniyur and Srikant 2000
Duality approach
Low and Lapsley 1999
Athuraliya and Low 2000, Low 2000
Many extensions

Duality Approach
)) ( ), ( ( ) 1 (
)) ( ), ( ( ) 1 (
t x t p G t p
t x t p F t x
+
+
Primal-dual algorithm:
,
_
) ( ) (
max
) ( min
, subject to ) (
max

0
0
0
: Dual
: Primal
l
l
l
l
s
s s
x
p
l
l
s
s s
x
x c p x U p D
L l c x x U
s
s

Duality Model of TCP/AQM
)) ( ), ( ( ) 1 (
)) ( ), ( ( ) 1 (
t x t p G t p
t x t p F t x
+
+
Reno, Vegas
DropTail, RED, REM
Source algorithm iterates on rates
Link algorithm iterates on prices
With different utility functions

Example
Basic algorithm
)) ( ( ) 1 ( : source
1 '
t p U t x
s
s s
+
+
+ + )] ) ( ( ) ( [ ) 1 ( : link
l
l
l l
c t x t p t p
Theorem (ToN99)
Converge to optimal rates in asynchronous
environment
TCP schemes are smoothed versions of
source
algorithm

Summary
L l c x
x U
l
l
s
s s
x
s

, subject to
) (
max
0
Flow control problem
Major TCP schemes
Maximize aggregate source utility
With different utility functions
Primal-dual algorithm
)) ( ), ( ( ) 1 (
)) ( ), ( ( ) 1 (
t x t p G t p
t x t p F t x
+
+

Outline
REM/PI)
Equilibrium
Performance
Throughput, loss,
delay
Fairness
Utility
Dynamics
Local stability
)) ( ), ( ( ) 1 (
)) ( ), ( ( ) 1 (
t x t p G t p
t x t p F t x
+
+

( ) ) ( ) (
2
) (

)) ( 1 )( (
t p t x
t w
w
t p t x
t w
s
s
s
s
s

x(t+1) = F( p(t), x(t) )
p(t+1) = G( p(t), x(t) )
Reno: F
Reno, Vegas
DropTail, RED, REM
for every ack (ca)
{ W += 1/W }
for every loss
{ W := W/2 }

( ) ) ( ) (
2
) (

)) ( 1 )( (
t p t x
t w
w
t p t x
t w
s
s
s
s
s

x(t+1) = F( p(t), x(t) )
p(t+1) = G( p(t), x(t) )
Reno: F
Reno, Vegas
DropTail, RED, REM
for every ack (ca)
{ W += 1/W }
for every loss
{ W := W/2 }

( ) ) ( ) (
2
) (

)) ( 1 )( (
t p t x
t w
w
t p t x
t w
s
s
s
s
s

x(t+1) = F( p(t), x(t) )
p(t+1) = G( p(t), x(t) )
Reno: F
Reno, Vegas
DropTail, RED, REM
for every ack (ca)
{ W += 1/W }
for every loss
{ W := W/2 }
( ) ) (
2
) (

)) ( 1 (
) ( ) ( ), (
2
2
t p
t x
D
t p
t x t x t p F
s
s
s s

+

Reno: Utility Function
p
D x
s s

2
2
2 2

+
( ) ) (
2
) (

)) ( 1 (
) ( ) ( ), (
2
2
t p
t x
D
t p
t x t x t p F
s
s
s s

+
( ) x p F x
s s
,
,
_

2
tan
2
) (
1
s s
s
s
reno
s
D x
D
x U
p
x
D
p
s
s
2

) 1 (
2
2

Reno: summary
Equilibrium characterization
Duality
Congestion measure p = loss
Implications
Reno equalizes window w = D
s
x
s
inversely proportional to delay D
s
dependence for small p

p
D x
s s

2
2
2 2

+
) (
s
reno
s
x U
p 1

2

p D
x
s
s

Validation - Reno
30 sources, 3 groups with RTT = 3, 5, 7ms + 6ms (queueing

delay) Link capacity = 64 Mbps, buffer = 50 kB
Measured windows equalized, match well with theory (black

line)

Reno & gradient algorithm
Gradient algorithm
)) ( ( ) 1 ( : source
1 '
t q U t x
i i i

+
+
+ + )] ) ( ( ) ( [ ) 1 ( : link
l l l l
c t y t p t p
( ) ) (
2
) (

)) ( 1 (
) ( ) ( ), (
2
2
t q
t x t q
t x t x t q F
i
i
i
i
i i i i

TCP approximate version of gradient

algorithm

Reno & gradient algorithm
Gradient algorithm
)) ( ( ) 1 ( : source
1 '
t q U t x
i i i

+
+
+ + )] ) ( ( ) ( [ ) 1 ( : link
l l l l
c t y t p t p
TCP approximate version of gradient

algorithm
( )
+
1
]
1
+ + )) ( ) ( (
2
) (
) ( 1
2 2
t x t x
t q
t x t x
i i
i
i i
)) ( (
1 '
t q U
i i

Active Queue Management
p
l
(t) G(p(t), x(t))
DropTail loss [1 - c
l
/x
l

(t)]
+
(?)
DropTail delay [p
l
(t) + x
l

(t)/c
l
- 1]
+
RED queue [p
l
(t) + x
l
(t) - c
l
]
+
REM price [p
l
(t) + (
l
b
l
(t)+ x
l

(t) - c
l
)]
+
x(t+1) = F( p(t), x(t) )
p(t+1) = G( p(t), x(t) )
Reno, Vegas
DropTail, RED, REM

Queue
DropTail
queue = 94%
RED
min_th = 10 pkts
max_th = 40 pkts
max_p = 0.1
p = Lagrange multiplier!
p increasing in queue!
REM
queue = 1.5 pkts
utilization = 92%
= 0.05, = 0.4, = 1.15
p decoupled from queue

queue size
for every RTT
{ if W/RTT
min
W/RTT < then W

++
if W/RTT
min
W/RTT > then W

-- }
for every loss
W := W/2
Vegas
( )
s s s s s
s
s s
d t x d t w
D
t x t x
'
< + + ) ( ) ( if
1
) ( 1
2
( ) else ) ( 1 t x t x
s s
+
( )
s s s s s
s
s s
d t x d t w
D
t x t x
'
> + ) ( ) ( if
1
) ( 1
2
F:
p
l
(t+1) = [p
l
(t) + x
l

(t)/c
l
- 1]
+
G:

Performance
Delay
Congestion measures: end to end

queueing delay
Sets rate
Equilibrium condition: Littles Law
Loss
No loss if converge (with sufficient buffer)
Otherwise: revert to Reno (loss

unavoidable)
) (
) (
t q
d
t x
s
s
s s

Vegas Utility
s s s s
reno
s
x d x U log ) (
Equilibrium (x, p) = (F, G)

Proportional fairness

Vegas & Basic Algorithm
Basic algorithm
)) ( ( ) 1 ( : source
1 '
t p U t x
s s

+
TCP smoothed version of Basic Algorithm

Vegas & Gradient Algorithm
Basic algorithm
)) ( ( ) 1 ( : source
1 '
t p U t x
s s

+
TCP smoothed version of Basic Algorithm
Vegas
)) ( (
1 '
t p U
s
( ) ) ( ) ( if
1
) ( 1
2
t x t x
D
t x t x
s s
s
s s
'
< + +
( ) else ) ( 1 t x t x
s s
+
( ) ) ( ) ( if
1
) ( 1
2
t x t x
D
t x t x
s s
s
s s
'
> +

Validation
Source 1 Source 3 Source 5

RTT (ms) 17.1 (17) 21.9 (22) 41.9 (42)
Rate (pkts/s) 1205 (1200) 1228 (1200) 1161
(1200)
Window (pkts) 20.5 (20.4) 27 (26.4) 49.8
(50.4)
Avg backlog (pkts) 9.8 (10)
Single link, capacity = 6 pkts/ms
5 sources with different propagation delays,
s
= 2
pkts/RTT
meausred theory

Persistent congestion
Vegas exploits buffer process to compute

prices (queueing delays)
Persistent congestion due to
Coupling of buffer & price
Error in propagation delay estimation
Consequences
Excessive backlog
Unfairness to older sources

Theorem
A relative error of
s
in propagation delay
estimation distorts the utility function to
s s s s s s s s s
x d x d x U + + log ) 1 ( ) (

Evidence
Single link, capacity = 6 pkt/ms,
s
= 2 pkts/ms, d
s
= 10 ms
With finite buffer: Vegas reverts to Reno

Without estimation error With estimation error

Evidence
Source rates (pkts/ms)
# src1 src2 src3 src4 src5
1 5.98 (6)
2 2.05 (2) 3.92 (4)
3 0.96 (0.94) 1.46 (1.49) 3.54 (3.57)
4 0.51 (0.50) 0.72 (0.73) 1.34 (1.35) 3.38 (3.39)
5 0.29 (0.29) 0.40 (0.40) 0.68 (0.67) 1.30 (1.30) 3.28
(3.34)
# queue (pkts) baseRTT (ms)
1 19.8 (20) 10.18 (10.18)
2 59.0 (60) 13.36 (13.51)
3 127.3 (127) 20.17 (20.28)
4 237.5 (238) 31.50 (31.50)
5 416.3 (416) 49.86 (49.80)

end2end queueing delay
Vegas/REM
To preserve Vegas utility function & rates
REM
Clear buffer : estimate of d
s
Sum prices : estimate of p

s
Vegas/REM
( ) ) ( ) ( if
1
) ( 1
2
t x t x
D
t x t x
s s
s
s s
'
< + +
( ) else ) ( 1 t x t x
s s
+
( ) ) ( ) ( if
1
) ( 1
2
t x t x
D
t x t x
s s
s
s s
'
> +
s
s
q
d
s s
x

end2end queueing delay
Vegas/REM
REM
s

s
Vegas/REM
( ) ) ( ) ( if
1
) ( 1
2
t x t x
D
t x t x
s s
s
s s
'
< + +
( ) else ) ( 1 t x t x
s s
+
( ) ) ( ) ( if
1
) ( 1
2
t x t x
D
t x t x
s s
s
s s
'
> +
s
s
q
d
s s
x
s s
s
d D
d
s s
x

Vegas/REM
REM
s

s
Vegas/REM
( ) ) ( ) ( if
1
) ( 1
2
t x t x
D
t x t x
s s
s
s s
'
< + +
( ) else ) ( 1 t x t x
s s
+
( ) ) ( ) ( if
1
) ( 1
2
t x t x
D
t x t x
s s
s
s s
'
> +
end2end price
s
s
p
d
s s
x

Vegas/REM
REM
s

s
Vegas/REM
( ) ) ( ) ( if
1
) ( 1
2
t x t x
D
t x t x
s s
s
s s
'
< + +
( ) else ) ( 1 t x t x
s s
+
( ) ) ( ) ( if
1
) ( 1
2
t x t x
D
t x t x
s s
s
s s
'
> +
s
s
p
d
s s
x
end2end price

Performance
Vegas/REM Vegas
peak = 43 pkts
utilization : 90% - 96%

Conclusion
)) ( ), ( ( ) 1 (
)) ( ), ( ( ) 1 (
t x t p G t p
t x t p F t x
+
+
Duality model of TCP: (F, G, U)
Reno, Vegas
Maximize aggregate
utility
With different utility

functions
DropTail, RED, REM
Decouple congestion &

performance
Match rate, clear buffer
Sum prices

Outline
REM/PI)
Equilibrium
Performance
Throughput, loss,
delay
Fairness
Utility
Dynamics
Local stability
)) ( ), ( ( ) 1 (
)) ( ), ( ( ) 1 (
t x t p G t p
t x t p F t x
+
+

Model structure
F
1
F
N
G
1
G
L
R
f
(s)
R
b
(s)
TCP Network
AQM
Multi-link multi-source network

AI MD
2
) (
) ( ) (
) (
1
)) ( 1 )( (
2
t x
t q t x
t x
t q t x x
i
i i i
i i
i i i i

Duality model - AIMD

source rate e2e prob

l
b
li l i
t m t q ) ( ) (

AI MD
2
) (
) ( ) (
) (
1
)) ( 1 )( (
2
t x
t q t x
t x
t q t x x
i
i i i
i i
i i i i

Linear model - AIMD

) (
1

2
t q
q
(t) x q x x
i
i i
i i i i

Linearize around equilibrium

) (
1 1
) (
2
s q
q x s q
s x
i
i i i i
i
+

In Laplace domain

)) ( ( ) (
) (
t p m t m
c t y p
l l l
l l l
Duality model - AIMD

congestion
measure
marking
prob

)) ( ( ) (
) (
t p m t m
c t y p
l l l
l l l
Linear model - AIMD

Aggregate rate
(t) y p m m
(t) y p
l l l l
i l
) (
'
Linearize around equilibrium

i
f
i i l
t x t y ) ( ) (
(s) y p m s m
l l l l
) ( ) (
'
In Laplace domain

Closed loop system is stable if and
only if
det (I + L(s)) = 0
for no s in closed RHP
Linear Model - Network
Multi-link multi-source model
x
q
p
y
TCP Network
AQM
F
1
F
N
G
1
G
L
R
f
(s)
R
b
T
(s)
) ( ) ( ) ( ) ( ) ( s G s R s F s R s L
T
b f

Linearized system
F
1
F
N
G
1
G
L
R
f
(s)
R
b
(s)
x y
q
p
s
n
s
n
n
i
i i
i
n
e
e
x
c
s
c s
c
w p s p
s L

1
1
) (
1
) (
* * *
TCP RED queue delay

Papers
Scalable laws for stable network

congestion control (CDC2001)
A duality model of TCP flow controls (ITC, Sept

2000)
Optimization flow control, I: basic

algorithm & convergence (ToN, 7(6), Dec 1999)
Understanding Vegas: a duality model (J.

ACM, 2002)
REM: active queue management (Network,

May/June 2001)
netlab.caltech.edu

TCP Congestion

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

TCP Congestion

Uploaded by

Copyright:

Available Formats

Acknowledgments

S. Athuraliya, D. Lapsley, V. Li, Q. Yin

J. Doyle (Caltech), K.B. Kim

L. Peterson, L. Wang (Princeton)

Matthew Roughan (AT&T Labs)

Window flow control

TCP algorithm: Tahoe, Reno, Vegas

AQM algorithm: RED, REM, variants

Linear dynamic model

Unreliable (best effort)

Dumb network, intelligent terminals

Reliable, in order delivery of a packet

Reliability through ACKs

Flow control: use bandwidth

Robustness against failure

Robustness against technological

TCP is not perfectly homogenous

Queueing delay (Vegas)

Window flow control

TCP algorithm: Tahoe, Reno, Vegas

AQM algorithm: RED, REM, variants

Linear dynamic model

Detects loss from timeout

Retransmits from lost packet onward

Receiver window flow control

Prevent overflows at receive buffer

Flow control: self-clocking

October 1986, Internet had its first

Link LBL to UC Berkeley

400 yards, 3 hops, 32 Kbps

throughput dropped to 40 bps

factor of ~1000 drop!

1988, Van Jacobson proposed TCP

Congestion collapse due to

Unnecessarily retransmitted packets

Undelivered or unusable packets

Congestion may continue after the overload!

~ W packets per RTT

Lost packet detected by missing

Limit the number of packets in the network

Source rate = bps

If W too small then rate capacity

Adapt W to network (and conditions)

Receiver flow control

Avoid overloading receiver

awnd: receiver (advertised) window

Network flow control

Avoid overloading network

Infer available network capacity

cwnd: congestion window

Set W = min (cwnd, awnd)

Receiver advertises awnd with each

closed when data is received and ackd

opened when data is read

Size of awnd can be the performance

sensible default ~16kB

Source calculates cwnd from

Algorithms to calculate cwnd

Tahoe, Reno, Vegas, RED, REM

Window flow control

TCP algorithm: Tahoe, Reno, Vegas

AQM algorithm: RED, REM, variants

Linear dynamic model

Tahoe (Jacobson 1988)