David Crawford Epson

Digital Signal Processing:
An Introduction and Some Examples of its Everyday Use Dr D. H. Crawford EPSON Scotland Design Centre
Contents
What is DSP? What is DSP used for?
Speech & Audio processing Image & Video processing Adaptive filtering
DSP Devices and Architectures DSP at EPSON Scotland Design Centre Summary & Conclusions
Slide 2
What is DSP?
Digital Signal Processing the processing or manipulation of signals using digital techniques
Digital Signal Processor
Input Signal
ADC
Analogue to Digital Converter
DAC
Digital to Analogue Converter
Output Signal
Slide 3
What is DSP Used For?
And much more!

Slide 4
Speech Processing
Speech coding/compression Speech synthesis Speech recognition

Slide 5
Some Properties of Speech
The blue--- s---p--o---------t i-s--on--the-- k--ey a---g--ai----n------
oo ininblue e in again eein spot o in key s k key
Slide 6
Some Properties of Speech

Vowels
oo in blue
o in spot
ee in key
e in again
Quasi-periodic Relatively high signal power Consonants

s in spot k in key
Non-periodic (random) Relatively low signal power

Slide 7
Speech Coding
TRAU
MSC
64 kbits/s BSC 13 kbits/s BTS 22.8 kbits/s
Slide 8
Speech Coding Linear Prediction

Try to predict the current sample value; Transmit the prediction error.
s(n) A(z) se(n) +
d(n)
d(n) +
+ se(n) A(z)
sr(n)
Slide 9
Speech Coding Vocoder

Encoder
Original Speech Analysis: Voiced/Unvoiced decision Pitch Period (voiced only) Signal power (Gain)
Pitch Period
Decoder
Pulse Train Signal Power V/U G
Vocal Tract Model
Random Noise
LPC-10:
Slide 10
Synthesized Speech
Text-to-Speech Synthesis
Input text
To be or not to be that is the question
Tu bee awr nawt tu bee dhat iz dhe kwestchun Parsing semantic & syntactic parts of speech analysis of text
phonetic form
Text normalization expands abbreviations dates, times, money..etc
Pronunciation phonetic description of each word, dictionary with letter-to-sound rules as a back up
Prosody rules Apply word stress, duration and pitch
Waveform generation Phonetic-toacoustic transformation
Synthesized speech
Text-to-speech synthesis sounds very natural these days.

Slide 11
Speech Synthesis Applications

Speaking clocks Spoken (variable) announcements Talking emails + talking heads for mobile Synthesis of location-based information (e.g. traffic information) Interactive systems (e.g. catalogue ordering, Yellow Pages, ...)
Slide 12
Speech/Speaker Recognition
Speech Recognition What has been spoken?
Speaker dependent Recognition system trained for a particular persons voice. Speaker independent Recognition system expected to deal with a wide variety of speakers.
Speaker Recognition Who has spoken? Not easy

Sometimestherearenogapsbetweenwords. Sometim esthereareg aps inthe mid dleofwords.
Accents, dialects and Stress eggsist.

Slide 13
Speech Recognition System
Phoneme models
Word pronunciation
Semantic knowledge
speech
Feature extraction
Phoneme recognition
Word recognition
Sentence recognition
decision
Syntactic knowledge
Dialogue knowledge
Slide 14
Digital Audio
Standard music CD:
Sampling Rate: 44.1 kHz 16-bit samples 2-channel stereo Data transfer rate = 21644,100 = 1.4 Mbits/s 1 hour of music = 1.43,600 = 635 MB
Slide 15
Audio Coding (Contd)

Key standards:
MPEG: Layers I, II, and III (MP3); AAC.
used in DAB, DVD
Dolby AC3, Dolby Digital, Dolby Surround.
Typical bit rates for 2-channel stereo:

64kbits/s to 384 kbits/s.
Subband- or transform-based, making use of perceptual masking properties.

Slide 16
Audio Coding (Contd)

Typical 3/2 multichannel stereo configuration:
Right Centre Left Surround Left Surround Right
5.1 channels (3/2) with LFE channel:

Left, Right, Centre, Left Surround, Right Surround, Low Frequency Effects (LFE) (Reduced Bandwidth).
LFE loudspeaker can, in general, be placed anywhere in the listening room.
Slide 17
Audio Coding Masking

Auditory Masking:
Spectral: Strong frequency components mask weaker neighbouring frequency components. Temporal: Strong temporal events mask recent and future events.
Spectral Masking
SPL/dB SPL/dB
Temporal Masking
freq/kHz
10ms
160ms
time
Slide 18
Masking Example
60
50
dB
40
30
20
10 200 300 400 500 Hz 600 700 800
Slide 19
Image/Video
Still Image Coding:
JPEG (Joint Photographic Experts Group):
Discrete Cosine Transform (DCT) based
JPEG2000: Wavelet Transform based
Video Coding:
MPEG (Moving Pictures Experts Group): DCT-based,
Interframe and intraframe prediction, Motion estimation.
Applications: Digital TV, DVD, etc.

Slide 20
JPEG Example
Original
JPEG (4:1)
JPEG (100:1)
Slide 21
Adaptive Filtering
Self-learning: Filter coefficients adapt in response to training signal.
d(n)
x(n)
W(z)
y(n)
e(n)
Filter update: Least Mean Squares (LMS) algorithm

w(n + 1) = w(n) + 2e(n)x(n)
Slide 22
Adaptive Filtering Applications

Echo cancellation (telephone lines)
Used in modems (making Internet access possible!!)
Acoustic echo cancellation

Hands-free telephony
Adaptive equalization Active noise control Medical signal processing

e.g. foetal heart beat monitoring
Slide 23
Some Other Application Areas

Image analysis, e.g:
Face recognition, Optical Character Recognition (OCR);
Restoration of old image, video, and audio signals; Analysis of RADAR data; Analysis of SONAR data; Data transmission (modems, radio, echo cancellation, channel equalization, etc.); Storage and archiving; Control of electric motors.
Slide 24
DSP Devices & Architectures

Selecting a DSP several choices:
Fixed-point; Floating point; Application-specific devices (e.g. FFT processors, speech recognizers,etc.).
Main DSP Manufacturers:

Texas Instruments (http://www.ti.com) Motorola (http://www.motorola.com) Analog Devices (http://www.analog.com)
Slide 25
Typical DSP Operations

Filtering Energy of Signal Frequency transforms
y ( n) =
L 1 i =0
ai x(n i)
Pseudo C code for (n=0; n<N; n++) { s=0; for (i=0; i<L; i++) { s += a[i] * x[n-i]; } y[n] = s; }
Slide 26
Traditional DSP Architecture

X RAM x(n-i) ai Y RAM
Multiply/Accumulate
Accumulator
y(n)
N.B. Most modern DSPs have more advanced features.

Slide 27
DSP at EPSON
Energy-saving Firmware
EPSON Scotland Design Centre develops a broad range of technologies to minimize power consumption and maximize cost effectiveness in mobile DSP applications.
Slide 28
SDC Core Skills

DSP
System modelling Firmware design System Integration CPU (Oak, ARM) H/w & S/w Co-design
Speech
Speech compression Speech Recognition Speech synthesis Speech enhancement Speech Testing
Audio
MP3 Other digital audio Performance Assessment
Mobile
Baseband processing Channel coding AMR Coding
Services
Administration CAD Tools Computer & Networking
System on Chip (SoC)
Slide 29
SDC Firmware Development

Algorithm Definition Floating-point and Fixed-point Co-Simulation Co-Design Implementation COSSAP Matlab ... Behavioural, RTL, Logic ...
Co-Verification MCU, DSP ...
Product Development With Barcelona and Tokyo Design Centres

Slide 30
Summary & Conclusions

DSP used in a wide range of everyday applications Looked at:
Speech coding; Speech synthesis & recognition; Image/Video; Adaptive filtering.
Other areas include:

Image analysis (e.g. face recognition, OCR, etc.); RADAR/SONAR; Data transmission and reception; And many more..!!
Slide 31

David Crawford Epson

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

David Crawford Epson

Uploaded by

Copyright:

Available Formats

Digital Signal Processing:

What is DSP Used For?

And much more!

Speech coding/compression Speech synthesis Speech recognition

Some Properties of Speech

The blue--- s---p--o---------t i-s--on--the-- k--ey a---g--ai----n------

oo ininblue e in again eein spot o in key s k key

Some Properties of Speech

Quasi-periodic Relatively high signal power Consonants

Non-periodic (random) Relatively low signal power

64 kbits/s BSC 13 kbits/s BTS 22.8 kbits/s

Speech Coding Linear Prediction

Speech Coding Vocoder

Text normalization expands abbreviations dates, times, money..etc

Prosody rules Apply word stress, duration and pitch

Waveform generation Phonetic-toacoustic transformation

Text-to-speech synthesis sounds very natural these days.

Speech Synthesis Applications

Speaker Recognition Who has spoken? Not easy

Accents, dialects and Stress eggsist.

Speech Recognition System

Audio Coding (Contd)

Dolby AC3, Dolby Digital, Dolby Surround.

Typical bit rates for 2-channel stereo:

Subband- or transform-based, making use of perceptual masking properties.

Audio Coding (Contd)

5.1 channels (3/2) with LFE channel:

Audio Coding Masking

10 200 300 400 500 Hz 600 700 800

JPEG2000: Wavelet Transform based

Applications: Digital TV, DVD, etc.

Filter update: Least Mean Squares (LMS) algorithm

Adaptive Filtering Applications

Acoustic echo cancellation

Adaptive equalization Active noise control Medical signal processing

Some Other Application Areas

DSP Devices & Architectures

Main DSP Manufacturers:

Typical DSP Operations

Traditional DSP Architecture

N.B. Most modern DSPs have more advanced features.

SDC Core Skills

System on Chip (SoC)

SDC Firmware Development

Co-Verification MCU, DSP ...

Product Development With Barcelona and Tokyo Design Centres

Summary & Conclusions

Other areas include:

You might also like