Speech Quality Assessment

Uploaded by

chinku kumar

0% found this document useful (0 votes)

5 views13 pages

Original Title

SPEECH QUALITY ASSESSMENT

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

5 views13 pages

Speech Quality Assessment

Uploaded by

chinku kumar

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 13

Search inside document

SPEECH QUALITY

ASSESSMENT

PRESENTED BY :-
CHINKU KUMAR VEHERA(CRF202575)
CONTAINS

• Introduction
• Model description
• Data bases
• Model training and results
• Conclusion
• References
INTRODUCTION

• Speech quality of voice communication services has rapidly increased in the last decades.
• The reasons for the improved quality is the extension of the transmission bandwidth from
narrowband (NB), with a bandwidth from 300 - 3400 Hz, to wideband (WB) with 100 -
7000 Hz.
• Here we present a non-intrusive speech quality assessment model NISQA, which – in
contrast to current state which can predict the quality of super-wideband speech
transmission.
• Recently, the quality was further improved with the introduction of super-wideband
(SWB) transmission to speech communication networks, with a bandwidth of 50 - 14000
Hz.
• Signal-based models can be divided into two groups:
• a. Intrusive models b. Non-intrusive model
• Intrusive models require the degraded output signal of the transmission system and the
clean original input signal.
• Non-intrusive or single-ended models rely only on the degraded output signal of the
transmission system.
• The long-term standard for NB and WB speech quality assessment by the International
Telecommunication Union (ITU-T) has been PESQ and WB-PESQ.
• They are now replaced by P.OLQA , the current recommendation by the ITU-T, which
also considers SWB transmission.
MODEL DESCRIPTION
• The SWB speech quality estimator NISQA is based on a convolutional neural network
(CNN) that estimates the speech quality for each frame of the input signal.
• The estimated per-frame quality values are then aggregated over time by using a recurrent
neural network (RNN).
• Advantage of RNNs is that they allow time sequences with different lengths as input.
NISQA MODEL
• The advantage of the CNN-LSTM approach is twofold: Firstly, the per-frame quality
gives some insight into the cause of a quality degradation.
• Secondly, this approach helps to regularize the training of the RNN.
• As on a per frame basis, we have a large amount of training data, but on a file basis only
limited data, it is important to minimize the input feature size of the RNN.
DATABASES

• Overall, 29 different databases with typical P.800 double sentences with a duration of 6 -
12 s were available.
• All SWB test set databases from the P.OLQA pool were chosen for our test set and all
SWB training sets were included in our training set.
• Many of the databases are using the same reference signals.
MODEL TRAINING AND RESULTS

• To train the model , first calculating the per-frame similarity between the
degraded and the original signal with POLQA v2 in the SQuadAnalyzer
implementation.
• Then we aligned the per-frame similarity with the spectrogram segments,
using a nearest neighbor interpolation.
CORRELATION DIAGRAM OF THE BEST- AND WORST-CASE
RESULTS
CONCLUSION
• Here presented a new non-intrusive speech quality assessment model NISQA for SWB
transmission.
• We showed that the proposed model is able to give good prediction results over the same
test set that was used for the P.OLQA validation, with an average RMSE*3rd of 0.29 and
a worst case RMSE*3rd of 0.37.
• NISQA is able to predict the speech quality of packet loss concealment conditions of
modern speech codecs.
REFERENCES

• ITU-T Rec. P863, “Perceptual objective listening quality assessment,”.

• D. Kim and A. Tarraf, “Anique+: A new american national standard for non-intrusive
estimation of narrowband speech quality,” Bell Labs Technical Journal, vol. 12, no. 1, pp.
221–236, Spring 2007.
• Szu wei Fu, Yu Tsao, Hsin-Te Hwang, and HsinMin Wang, “Quality-net: An end-to-end
non-intrusive speech quality assessment model based on BLSTM,” in Proc. Interspeech
2018, 2018, pp. 1873–1877.

DENOASR
Document13 pages
DENOASR
Proma Mondal
No ratings yet
Bit Error Rate Performance Evaluation of New
Document17 pages
Bit Error Rate Performance Evaluation of New
dd
No ratings yet
Application of Microphone Array For Speech Coding in Noisy Environment
Document5 pages
Application of Microphone Array For Speech Coding in Noisy Environment
scribd1235207
No ratings yet
Group 13 PPT - 23rd - Modified
Document30 pages
Group 13 PPT - 23rd - Modified
Prashanjeet Yadav
No ratings yet
Rashmi
Document31 pages
Rashmi
NISHANT395
No ratings yet
A Novel Method of Compressing Speech With Higher Bandwidt
Document12 pages
A Novel Method of Compressing Speech With Higher Bandwidt
shaikshaa007
100% (2)
Ramya
Document12 pages
Ramya
Nexgen Technology
No ratings yet
A Survey On Digital Modulation Techniques For Software Defined Radio Applications
Document8 pages
A Survey On Digital Modulation Techniques For Software Defined Radio Applications
sreekanthreddy peram
No ratings yet
Perceptual Objective Listening Quality Assessment (POLQA), The Third Generation ITU-T Standard For End-to-End Speech Quality Measurement Part II-Perceptual Model
Document18 pages
Perceptual Objective Listening Quality Assessment (POLQA), The Third Generation ITU-T Standard For End-to-End Speech Quality Measurement Part II-Perceptual Model
mewarules
No ratings yet
Jordan University of Science and Technology Faculty of Engineering Electrical Engineering Department
Document18 pages
Jordan University of Science and Technology Faculty of Engineering Electrical Engineering Department
Majd Shakhatreh
No ratings yet
Analog Communication Lab
Document2 pages
Analog Communication Lab
Shoebahmed
No ratings yet
Chapter 7: Conclusion and Future Scope
Document4 pages
Chapter 7: Conclusion and Future Scope
Utkarsh Lavate
No ratings yet
UEC1701U1LS07
Document14 pages
UEC1701U1LS07
Rakshana Saravanan
No ratings yet
Packet Dispersion in IEEE 802.11 Wireless Networks
Document37 pages
Packet Dispersion in IEEE 802.11 Wireless Networks
Satish Naidu
No ratings yet
Cyclic Short-Time Varying Channel Estimation in OFDM Power-Line Communication
Document20 pages
Cyclic Short-Time Varying Channel Estimation in OFDM Power-Line Communication
Gopu Thalikunnath
No ratings yet
Wcdma Soft Ho
Document28 pages
Wcdma Soft Ho
Mostafa Gaber
No ratings yet
WIMAX
Document25 pages
WIMAX
Salem Trabelsi
No ratings yet
Digital Communication Systems Course Objectives and Outcomes
Document4 pages
Digital Communication Systems Course Objectives and Outcomes
Suhil Irshad
No ratings yet
M.Phil Computer Science Networking Projects
Document34 pages
M.Phil Computer Science Networking Projects
kasanpro
100% (1)
Speech Compression
Document29 pages
Speech Compression
Siddharth Sah
No ratings yet
08-06-2022-1654686738-6-.-3. Engg - Improved Method For Channel Estimation in Mimo Ofdm
Document8 pages
08-06-2022-1654686738-6-.-3. Engg - Improved Method For Channel Estimation in Mimo Ofdm
Impact Journals
No ratings yet
1MA202 1e 3G4G Voice Quality Testing POLQA
Document22 pages
1MA202 1e 3G4G Voice Quality Testing POLQA
Junior Mariano JC
No ratings yet
Title: Semi-Persistent Scheduling (SPS) : Enhancing Efficiency and Capacity in Wireless Communication Systems
Document17 pages
Title: Semi-Persistent Scheduling (SPS) : Enhancing Efficiency and Capacity in Wireless Communication Systems
Abdul Rehman
No ratings yet
Compliance and Validation of Superspeed Usb/Pcie Gen 3: Insight
Document26 pages
Compliance and Validation of Superspeed Usb/Pcie Gen 3: Insight
Munish Garg
No ratings yet
Sensors: Objective Video Quality Assessment Based On Machine Learning For Underwater Scientific Applications
Document15 pages
Sensors: Objective Video Quality Assessment Based On Machine Learning For Underwater Scientific Applications
hoainamcomit
No ratings yet
2 Text Independent Voice Based Students Attendance System Under Noisy Environment Using RASTA-MFCC Feature
Document6 pages
2 Text Independent Voice Based Students Attendance System Under Noisy Environment Using RASTA-MFCC Feature
Susanta Sarangi
No ratings yet
Review On Frequency Synthesizers
Document7 pages
Review On Frequency Synthesizers
Akshaya Andy
No ratings yet
Simulation and Experimental Studies of A Rate-Adaptive Digital Subscriber Loop (Ra-Dsl) Transmission Method
Document4 pages
Simulation and Experimental Studies of A Rate-Adaptive Digital Subscriber Loop (Ra-Dsl) Transmission Method
jffm7147
No ratings yet
Performance Evaluation & Benefits of Stable Election Protocol in Wsns
Document16 pages
Performance Evaluation & Benefits of Stable Election Protocol in Wsns
Mr. Joker
No ratings yet
Woowoowowoow
Document9 pages
Woowoowowoow
Bujji John
No ratings yet
Dissertation On Reliability in Wireless Sensor Networks by Differed Reporting Rate
Document37 pages
Dissertation On Reliability in Wireless Sensor Networks by Differed Reporting Rate
shrikant2002
No ratings yet
Analysis and Comparision of Different Spectrum Sensing Technique For Ieee802.11
Document29 pages
Analysis and Comparision of Different Spectrum Sensing Technique For Ieee802.11
NISHANT395
No ratings yet
Variable Rate Variable Power
Document13 pages
Variable Rate Variable Power
Rajesh Roy
No ratings yet
Wnn-Lqe: Wavelet-Neural-Network-Based Link Quality Estimation For Smart Grid Wsns
Document10 pages
Wnn-Lqe: Wavelet-Neural-Network-Based Link Quality Estimation For Smart Grid Wsns
sondv89
No ratings yet
Lab Note4
Document74 pages
Lab Note4
Nacho Reyes
No ratings yet
Wireless Mesh Networking: Samir R. Das Stony Brook University, SUNY Stony Brook, New York 11747, U.S.A
Document52 pages
Wireless Mesh Networking: Samir R. Das Stony Brook University, SUNY Stony Brook, New York 11747, U.S.A
mmaranha5801
No ratings yet
Adaptive Video Encoding and Dynamic Channel Access For Real-Time Streaming Over Sdrs
Document9 pages
Adaptive Video Encoding and Dynamic Channel Access For Real-Time Streaming Over Sdrs
Fabian Molinengo
No ratings yet
Human Speech Producing Organs: 2.4 Kbps
Document108 pages
Human Speech Producing Organs: 2.4 Kbps
dfbbvcx
No ratings yet
Politecnico Di Torino Porto Institutional Repository
Document4 pages
Politecnico Di Torino Porto Institutional Repository
Shafayat
No ratings yet
6.2 Wavelength Routing Networks
Document44 pages
6.2 Wavelength Routing Networks
Ponmalar Sivaraj
No ratings yet
Inceptra
Document29 pages
Inceptra
Mangalanageshwari
No ratings yet
Fundamentals of Wireless Communication Fundamentals
Document76 pages
Fundamentals of Wireless Communication Fundamentals
Mary Helen
No ratings yet
MIMO: Motivations and Techniques for High Speed Wireless
Document28 pages
MIMO: Motivations and Techniques for High Speed Wireless
Jitendra Asati
No ratings yet
ECE4001-Digital Communication Systems-Syllabus PDF
Document4 pages
ECE4001-Digital Communication Systems-Syllabus PDF
Dinesh jk
100% (1)
J044 Impulse Noise
Document8 pages
J044 Impulse Noise
Muneeb Raees Malik
No ratings yet
Optics & Laser Technology: Chan Zhang, Tigang Ning, Jing Li, Chao Li, Xueqing He, Li Pei
Document8 pages
Optics & Laser Technology: Chan Zhang, Tigang Ning, Jing Li, Chao Li, Xueqing He, Li Pei
Fares Abderraouf
No ratings yet
Simulation On Zigbee
Document15 pages
Simulation On Zigbee
Karthik Vasudevan
No ratings yet
EFR Implementation: Executive Summary
Document7 pages
EFR Implementation: Executive Summary
sugadoor
No ratings yet
Santosh Proj
Document6 pages
Santosh Proj
Nat Raj
No ratings yet
Audiocompression
Document42 pages
Audiocompression
mehul maheshwari
No ratings yet
BER Analysis of Digital Modulation Schemes Using Labview: R. Prameela Devi and Humaira Nishat
Document5 pages
BER Analysis of Digital Modulation Schemes Using Labview: R. Prameela Devi and Humaira Nishat
Bhavik Kumar
No ratings yet
Method and Modelling For Allocating Wavelength in WDM Passive Optical Networks
Document10 pages
Method and Modelling For Allocating Wavelength in WDM Passive Optical Networks
semselvan794694
No ratings yet
Tems Moving From Pesq To Polqa
Document16 pages
Tems Moving From Pesq To Polqa
Mohammed Hussain Jawad
100% (1)
Convolutional Recurrent Neural Networks For Small-Footprint Keyword Spotting
Document5 pages
Convolutional Recurrent Neural Networks For Small-Footprint Keyword Spotting
Viet Nguyen
No ratings yet
Voice Quality Degradation Recognition Using Call Lengths
Document6 pages
Voice Quality Degradation Recognition Using Call Lengths
msapranidis
No ratings yet
Modified Mel Frequency Cepstral Coefficient
Document5 pages
Modified Mel Frequency Cepstral Coefficient
Md. Hisham
No ratings yet
460 MBPS VISIBLE LIGHT DATA TRANSMISSION
Document6 pages
460 MBPS VISIBLE LIGHT DATA TRANSMISSION
Hai Le
No ratings yet
Cognitive Technique For Software Defined Optical Network (SDON)
Document38 pages
Cognitive Technique For Software Defined Optical Network (SDON)
Ehsan Rohani
No ratings yet
Software Radio: Sampling Rate Selection, Design and Synchronization
From Everand
Software Radio: Sampling Rate Selection, Design and Synchronization
Elettra Venosa
No ratings yet
Node-to-Node Approaching in Wireless Mesh Connectivity
From Everand
Node-to-Node Approaching in Wireless Mesh Connectivity
Madhusudan Singh
Rating: 5 out of 5 stars
5/5 (1)
Null-Steering Beamformer-Based Feedback Cancellation For Multi-Microphone Hearing Aids
Document18 pages
Null-Steering Beamformer-Based Feedback Cancellation For Multi-Microphone Hearing Aids
chinku kumar
No ratings yet
UWVC
Document22 pages
UWVC
Youdhishter Raj
No ratings yet
Beamforming /doa Estimation Using Sparse Arrays
Document4 pages
Beamforming /doa Estimation Using Sparse Arrays
chinku kumar
No ratings yet
Beamforming /doa Estimation Using Sparse Arrays
Document58 pages
Beamforming /doa Estimation Using Sparse Arrays
chinku kumar
No ratings yet
Ceragon Evolution IP20LH Installation Guide Rev B.01
Document114 pages
Ceragon Evolution IP20LH Installation Guide Rev B.01
Telworks RS
No ratings yet
2019HT01605 - ES ZG553 - EC2MAssignment
Document7 pages
2019HT01605 - ES ZG553 - EC2MAssignment
vithya
No ratings yet
Ericsson AVP 4000
Document5 pages
Ericsson AVP 4000
Hertz
No ratings yet
Program of Matrix Multiplication: Lab Practical No. - 1
Document45 pages
Program of Matrix Multiplication: Lab Practical No. - 1
ashu345
No ratings yet
Zoning and Conduits For Railways - Security Architecture
Document59 pages
Zoning and Conduits For Railways - Security Architecture
mita-balija
No ratings yet
BestPower 610 Family 7 To10 kVA (Specs)
Document2 pages
BestPower 610 Family 7 To10 kVA (Specs)
Kostas Tsoumanis
No ratings yet
B0193aw R (Concept)
Document284 pages
B0193aw R (Concept)
Jack Yen
No ratings yet
WF-C20590 C17590 Rev.H
Document1,520 pages
WF-C20590 C17590 Rev.H
HEGEL CRISTIAN ROCA PEREZ
100% (6)
Azure and Linux
Document28 pages
Azure and Linux
sanjayid1980
No ratings yet
End of Sale (Eos) Announcement Hpe Officeconnect 1820 Switch Series
Document2 pages
End of Sale (Eos) Announcement Hpe Officeconnect 1820 Switch Series
Caroline Azevedo
No ratings yet
Cambridge International AS & A Level: Computer Science 9608/13
Document12 pages
Cambridge International AS & A Level: Computer Science 9608/13
Asim
No ratings yet
Grocery Store Project
Document88 pages
Grocery Store Project
Suffyan Arshad
No ratings yet
Online Nonnegative Matrix Factorization With Outliers
Document28 pages
Online Nonnegative Matrix Factorization With Outliers
余深宝
No ratings yet
Network Engineer Sunny Babu
Document2 pages
Network Engineer Sunny Babu
Tanveer Akhtar
No ratings yet
IBS Planning
Document29 pages
IBS Planning
abhaykumar80
100% (1)
WorkForce Enterprise WF M21000 Printer Product Specification Sheet CPD 60282
Document2 pages
WorkForce Enterprise WF M21000 Printer Product Specification Sheet CPD 60282
Hakeeme Wady
No ratings yet
KB 192.100.100.66
Document5 pages
KB 192.100.100.66
nico.mathis.pro
No ratings yet
Optimizing The Global Trade Management Solution Evaluation Selection Process
Document9 pages
Optimizing The Global Trade Management Solution Evaluation Selection Process
kvrnagesh
No ratings yet
cobc - Concise title for COBOL compiler documentation
Document3 pages
cobc - Concise title for COBOL compiler documentation
Irene Ragona
No ratings yet
GIS Professional March/April 2010
Document18 pages
GIS Professional March/April 2010
URISA- The Association for GIS Professionals
No ratings yet
5.6 Cutover Strategy
Document25 pages
5.6 Cutover Strategy
nikhitha vidyala
No ratings yet
Viewsonic IFP8662 IFP62 UG ENG
Document141 pages
Viewsonic IFP8662 IFP62 UG ENG
pedrodiasmendes
No ratings yet
G11-SLM4-RWS-Q1 SHSPH
Document18 pages
G11-SLM4-RWS-Q1 SHSPH
Atheena Grace Navarro
No ratings yet
CSC 340 - Assignment 1
Document10 pages
CSC 340 - Assignment 1
Daniel s
No ratings yet
Seeview Manual PDF
Document365 pages
Seeview Manual PDF
Shiva
No ratings yet
08 Subnetting IP Networks - Cleaned
Document55 pages
08 Subnetting IP Networks - Cleaned
Ishetu husen
No ratings yet
Chapter 01 - Security Principals
Document18 pages
Chapter 01 - Security Principals
Jaye 99
No ratings yet
LU Decomposition PDF
Document12 pages
LU Decomposition PDF
Surender Reddy
No ratings yet
I2C LCD With ESP32 On Arduino IDE ESP8266 Compatible Random Nerd Tutorials
Document14 pages
I2C LCD With ESP32 On Arduino IDE ESP8266 Compatible Random Nerd Tutorials
Taofik Hidayat
No ratings yet
Acronis Backup Cloud - Datasheet
Document4 pages
Acronis Backup Cloud - Datasheet
Jorge
No ratings yet