Professional Documents
Culture Documents
“ POLQA is the next-generation mobile voice quality testing standard P.863 ” the
successor of PESQ
“ POLQA stands for ‚Perceptual Objective Listening Quality Assessment‚
“ Standardised as Draft ITU-T P.863, following the history of P.861 ’PSQM’ and P.862
‘PESQ’
“ Specially developed for HD Voice, 3G and 4G/LTE, VoIP
“ Offers a new level of benchmarking accuracy
“ A joint development of the
POLQA consortium in the ITU-T
“ Network Testing
“ Network Testing and Optimisation
“ Drive testing and Benchmarking
“ IP
“ HD Voice
“ etc....
POLQA
P.863 (draft)
??/2010
E2E Network
WB Quality
Application Guide Wide-band
Variable Delay and
11/2005 Extension
Time Scaling
Characterization phase
“ POLQA includes enhancement of performance for latest technologies within networks and handsets
” Suitable for new types of speech codecs as used in 3G/4G/LTE and also audio codecs , e.g. AAC, MP3
” Suitable for Voice Enhancement (VQE/VED) systems using non-linear processing to increase intelligibility
” Suitable for codecs that change or extend the audio bandwidth (e.g. using SBR)
” Allows for measurements with very high background noise
” Correct modelling of effects caused by variable sound presentation levels
” Offers narrowband and super-wideband (50Hz to 14000Hz) mode
” Can handle time-scaling and time-warping as seen in VoIP and 3G
” Can be used for signals recorded at acoustic interfaces
” Uses correct weighting of reverberation, linear and non-linear filtering
” Allows for direct comparison between AMR (GSM/UMTS) and EVRC (CDMA) coded transmissions
rmse*
rmse*
narrow-band
narrow-band PESQ
PESQ POLQA
POLQA Improvm.
Improvm.
P.862.1
P.862.1
Averaged
Averagedrmse*
rmse* 0.1857
0.1857 0.1363
0.1363 27%
27%
wideband
wideband PESQ
PESQ POLQA
POLQA Improvm.
Improvm.
P.862.2
P.862.2
Averaged
Averagedrmse*
rmse* 0.3450
0.3450 0.1506
0.1506 56%
56%
The root mean square error (RMSE) is a measure of the differences between values predicted by a model and
the subjective values obtained. It is a better measure of precision than the correlation factor. The rmse* is
similar to the rmse, but also takes the accuracy of the subjective experiment into account (ci 95).
1
rmse*
N d
Perrori ²
N
Where….
Perror (i) max( 0, MOSLQS(i) MOSLQO(i) ci95 (i))
Moscow, 27-29 April 2011
ZNIIS / ITU Workshop POLQA Introduction - (c) OPTICOM GmbH 2010 13
Performance: Narrowband
PESQ Performance - NB_8kHz504_SWISSQUAL, rmse* = 0.4204
4.5
r = 0.82
4
rmse* = 0.42 PESQ
MOS-LQO Cond. (P.862)
3.5
2.5
1.5
1
1 1.5 2 2.5 3
MOS-LQS Cond.
3.5 4 4.5 5
27% improvement*
POLQA Performance - NB_8kHz504_SWISSQUAL, rmse* = 0.2311
4.5
r = 0.93
4
rmse* = 0.23
MOS-LQO Cond. (P.863)
3.5
3 POLQA
2.5
1.5
*Narrowband average rmse*
1
1 1.5 2 2.5 3 3.5 4 4.5 5 improvement observed for all ITU tests
MOS-LQS Cond.
4.5
r = 0.84
4
rmse* = 0.42
MOS-LQO Cond. (P.862)
3.5
PESQ
3
2.5
1.5
1
1 1.5 2 2.5 3
MOS-LQS Cond.
3.5 4 4.5 5
56% average
POLQA Performance - WB_16kHz204_FTDT, rmse* = 0.2319 Improvement*
5
4.5
r = 0.93
4
rmse* = 0.23
MOS-LQO Cond. (P.863)
3.5
3 POLQA
2.5
1.5
*Wideband Average Improvement
1
1 1.5 2 2.5 3
MOS-LQS Cond.
3.5 4 4.5 5 observed for all ITU tests
4.5
r = 0.90
4
rmse* = 0.32
PESQ
MOS-LQO Cond. (P.862)
3.5
2.5
1.5
1
1 1.5 2 2.5 3
MOS-LQS Cond.
3.5 4 4.5 5
56% average
POLQA Performance - WB_PSY_402_POLQA, rmse* = 0.1839 Improvement*
5
4.5
r = 0.96
4
rmse* = 0.18
MOS-LQO Cond. (P.863)
3.5
3 POLQA
2.5
1.5
*Wideband average rmse* improvement
1
1 1.5 2 2.5 3
MOS-LQS Cond.
3.5 4 4.5 5 observed for all ITU tests
GSM HR
Two MOS Scales for All:
3
“ Fs = 8kHz MOS NB
“ Fs = 48kHz MOS SWB
2
1 1 1
Moscow, 27-29 April 2011
ZNIIS / ITU Workshop POLQA Introduction - (c) OPTICOM GmbH 2010 18
Roadmap
“ POLQA Development
“ POLQA Performance
“ Will POLQA Substitute PESQ?
“ Model overview
“ Who needs POLQA ?
“ ... More Details
Loops = 0
MOS LQO
Moscow, 27-29 April 2011
ZNIIS / ITU Workshop POLQA Introduction - (c) OPTICOM GmbH 2010 20
Core Model Block Diagram
Signals and delay information
Added
Disturbance Density Disturbance Density
Big Distortion
Detection
Frequency, Noise and Reverberation Distortion Indicators
Correct for Severe Amounts of: Correct for Severe Amounts of:
Level Variation Level Variation
Frame Repeats Frame Repeats
Frequency Dewarping
Calculate Frequency,
Noise and
Transfer to Bark Scale Transfer to Bark Scale
Reverberation
Distortion Indicators
Transform to Excitation Transform to Excitation
Disturbance Density
(A Measure for the Audibility of Distortions)
Moscow, 27-29 April 2011
ZNIIS / ITU Workshop POLQA Introduction - (c) OPTICOM GmbH 2010 22
What we Perceive …
In a subjective ACR experiment POLQA, PESQ and human beings perceive the
following distortions (this list is far not complete):
Consequently, the hardware used for recording must support this as well!
SWB NB
Sample Rate 48kHz 8, 16, 48kHz
Ref. Bandwidth 50..14000Hz 300..3400Hz
Ref. Level -26dBov (73/79dBSPL) -26dBov (79dBSPL)
Deg. Level -21..-46dBov -26dBov
China
OPTICOM
Headquarters, Taiwan
Erlangen, GERMANY
Korea
Moscow, 27-29 April 2011
ZNIIS / ITU Workshop POLQA Introduction - (c) OPTICOM GmbH 2009 28
POLQA Summary
“ POLQA is an evolution of PESQ for current and new network technologies
“ Compared to PESQ, POLQA has higher correlation with subjective listening quality tests
“ It will be required by 3G, 4G/LTE NGN operators optimising HD-Voice services
“ Test, measurement and DTT system vendors should prepare now for POLQA migration.
“ OPTICOM offers licensed solutions with both PESQ and POLQA
“ OPTICOM does not compete in the OEM T&M marketplace
” Vendors/OEMs are assured of commercial confidentiality
OPTICOM is the leading Vendor for Perceptual Voice, Audio and Video
Quality Testing.
D E
(Clean) Reference Enhanced
Signal
Add Distortions VED Signal
POLQA MOSE
POLQA MOSD
The difference between MOSE and MOSD is a measure for the improvement caused by the
Voice Enhancement Device (VED).
Is simple alignment
problem?
Thorough Prealignment
© OPTICOM GmbH, 2010
Good solution
found? Allocate ref and deg sections
Search range
Coarse Alignment
(Multidimensional correlation
based search with iterative
reduction of down sampling
step
Fine Alignment
Moscow, 27-29 April 2011
Sample accurate delay per frame
ZNIIS / ITU Workshop POLQA Introduction - (c) OPTICOM GmbH 2010 36
POLQA Sharpened Loudness
“ In POLQA the smeared spectrum is only used as a factor in the sharpening of the spectrum
Completely Masked
dB
Partially Masked
Sone
dB
“Smearing”
Bark Bark
(Sharpening)
Suppression
Convert to
Sone
dB
Sone
Loudness
Bark Bark
“ Advantage 1: High resolution in the pitch domain remains, analysis of the spectral fine structures is possible
“ Advantage 2: Masked threshold is not a ‚hard clipper‘. A small range above the threshold may remain.