Professional Documents
Culture Documents
Venkataraman - Towards A Video QoE Definition in Converged Networks - IEEE - 2007 PDF
Venkataraman - Towards A Video QoE Definition in Converged Networks - IEEE - 2007 PDF
Abstract rate for voice or video service, but instead, being able to
guarantee good mean opinion scores (MOS) will certainly
Most resource allocation decisions in converged multi- bring in new subscribers.
service networks are presently driven by QoS parameters Video QoE is influenced by both video MOS and zap-
and service level agreements. Network operators are real- ping irregularities. Video MOS captures the subjective as-
izing that such policy decisions ought to be based on Qual- sessment of video quality, while poor zapping indicates sig-
ity of Experience (QoE), which is a better measure of sub- naling anomalies. Signaling anomalies are well understood,
jective video perception. Video QoE has two components: while video MOS lacks well accepted semantic definition
(a) Zapping and (b) Video Quality or Video Mean Opin- and is a much harder problem to deal with. Before one can
ion Score (MOS). Zapping anomalies are well understood gather video quality degradations in the network, there is a
and easily characterized, but there is no consensus on what need to agree on how to measure this.
the definition of a Video-MOS should be. This is largely be- Currently used video evaluation schemes are either
cause there is little understanding of qualitative and quanti- housed on the end systems or in the network core. Exam-
tative subjective degradations caused by networking events ples of end system schemes are the peak signal to noise ra-
like loss, delay, jitter, and error. In this work we conduct tio (PSNR), MPQM [2] and VQM [3]. PSNR is a simple
extensive experiments on a simulation testbed in a search frame-to-frame calculation, VQM a subjective evaluation,
for a good Video-MOS definition, with specific emphasis and MPQM considers human eye perception and subjec-
on quality degradations due to network transmission. We tivity. Differing in levels of complexity of operation and
deploy “collector” nodes that can gather various statistics the need for specialized hardware, such end system imple-
of a given flow. We also identify key parameters that such mentations which characterize a stream at playout (after
collector nodes must maintain such that an overlay of such the network transmission is complete) can do little to iso-
nodes can identity impairment points. Lightweight Video- late or correct any network induced impairment. In fact,
MOS definition evolving out this work can be used in col- their primary purpose is to evaluate the effectiveness of dif-
lector overlay networks to solve problems like root-cause ferent transcoding schemes only. The media delivery in-
analysis, capacity-planning and various network optimiza- dex (MDI) [1], on the other hand, is housed inside the
tion problems. network and provides statistics on delay factor and media
loss rate on a given stream. The scheme is too simple and
1 Introduction
there is an extremely loose correlation between network-
Network service providers are actively deploying triple ing events and a subjective perception of a playout at des-
and quadruple play networks that deliver voice, video and tination. Broadly speaking, these processes can be divided
data services over converged infrastructure. There is al- into two segments: processes in active network elements of
ready a need to deliver high quality real time streaming ap- the end-to-end journey (transcoding points), and intermedi-
plications like Video on Demand (VOD) and IPTV. Quality- ate events that potentially happen throughout the network.
of-service (QoS) parameters like bandwidth, delay, and jit- These two processes need to evolve independently, with
ter are typically used to guarantee services. These met- each process complimenting the other to reach the common
rics however fail to capture the subjectiveness associated goal of high quality user perceived streaming content.
with human perception and understanding. Network oper- We focus on the networking part and seek to understand
ators instead need to mutate policies based on quality-of- the effect of various network events on the final quality of
experience (QoE). It is not just sufficient to guarantee peak content at the end user. Our aim is to identify properties of
4
40
3.5
35
3
30 2.5
2
25
1.5
20
1
15
0.5
10 0
0 50 100 150 200 250 300 0 50 100 150 200 250 300
Video frames Video frames
(a) (b)
Figure 5. (a) PSNR fluctuations with a first I-
Frame loss; (b) Analogous case for VQM.
PER = 0.02
(a) (b)
2 Figure 6. (a) Original playout of the first GOP
at source; (b) Playout at destination.
1.5
35
3 42
0.7
30 2.5 41
2
25 40 0.6
1.5
39
20
0.5
1
38
15
0.5
37 0.4
0.4 0.42 0.44 0.46 0.48 0.5 0.52 0.54 0.56 0.4 0.42 0.44 0.46 0.48 0.5 0.52 0.54 0.56
10
0 50 100 150 200 250 300
0
0 50 100 150 200 250 300
Average queing delay Average queing delay
Video frames Video frames
(a) (b)
(a) (b)
Figure 7. (a) PSNR fluctuations with the third I- Figure 9. (a) PSNR with uniformly increasing
Frame packets in error; (b) The case for VQM. average delay; (b) The case for VQM.
46 2.4
2
44
1.8
43
1.6
42 1.4
1.2
41
1
40
0.8
39
0.6
38 0.4
0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5
Average jitter Average jitter