You are on page 1of 31

Digital Signal Processing:

An Introduction and Some Examples of its Everyday Use Dr D. H. Crawford EPSON Scotland Design Centre

Contents
What is DSP? What is DSP used for?
Speech & Audio processing Image & Video processing Adaptive filtering

DSP Devices and Architectures DSP at EPSON Scotland Design Centre Summary & Conclusions
Slide 2

What is DSP?
Digital Signal Processing the processing or manipulation of signals using digital techniques
Digital Signal Processor

Input Signal

ADC
Analogue to Digital Converter

DAC
Digital to Analogue Converter

Output Signal

Slide 3

What is DSP Used For?

And much more!


Slide 4

Speech Processing

Speech coding/compression Speech synthesis Speech recognition


Slide 5

Some Properties of Speech

The blue--- s---p--o---------t i-s--on--the-- k--ey a---g--ai----n------

oo ininblue e in again eein spot o in key s k key

Slide 6

Some Properties of Speech


Vowels

oo in blue

o in spot

ee in key

e in again

Quasi-periodic Relatively high signal power Consonants


s in spot k in key

Non-periodic (random) Relatively low signal power


Slide 7

Speech Coding
TRAU

MSC

64 kbits/s BSC 13 kbits/s BTS 22.8 kbits/s

Slide 8

Speech Coding Linear Prediction


Try to predict the current sample value; Transmit the prediction error.
s(n) A(z) se(n) +

d(n)

d(n) +

+ se(n) A(z)

sr(n)

Slide 9

Speech Coding Vocoder


Encoder
Original Speech Analysis: Voiced/Unvoiced decision Pitch Period (voiced only) Signal power (Gain)

Pitch Period

Decoder
Pulse Train Signal Power V/U G
Vocal Tract Model

Random Noise

LPC-10:
Slide 10

Synthesized Speech

Text-to-Speech Synthesis
Input text
To be or not to be that is the question

Tu bee awr nawt tu bee dhat iz dhe kwestchun Parsing semantic & syntactic parts of speech analysis of text

phonetic form

Text normalization expands abbreviations dates, times, money..etc

Pronunciation phonetic description of each word, dictionary with letter-to-sound rules as a back up

Prosody rules Apply word stress, duration and pitch

Waveform generation Phonetic-toacoustic transformation

Synthesized speech

Text-to-speech synthesis sounds very natural these days.


Slide 11

Speech Synthesis Applications


Speaking clocks Spoken (variable) announcements Talking emails + talking heads for mobile Synthesis of location-based information (e.g. traffic information) Interactive systems (e.g. catalogue ordering, Yellow Pages, ...)
Slide 12

Speech/Speaker Recognition
Speech Recognition What has been spoken?
Speaker dependent Recognition system trained for a particular persons voice. Speaker independent Recognition system expected to deal with a wide variety of speakers.

Speaker Recognition Who has spoken? Not easy


Sometimestherearenogapsbetweenwords. Sometim esthereareg aps inthe mid dleofwords.

Accents, dialects and Stress eggsist.


Slide 13

Speech Recognition System

Phoneme models

Word pronunciation

Semantic knowledge

speech

Feature extraction

Phoneme recognition

Word recognition

Sentence recognition

decision

Syntactic knowledge

Dialogue knowledge

Slide 14

Digital Audio
Standard music CD:
Sampling Rate: 44.1 kHz 16-bit samples 2-channel stereo Data transfer rate = 21644,100 = 1.4 Mbits/s 1 hour of music = 1.43,600 = 635 MB

Slide 15

Audio Coding (Contd)


Key standards:
MPEG: Layers I, II, and III (MP3); AAC.
used in DAB, DVD

Dolby AC3, Dolby Digital, Dolby Surround.

Typical bit rates for 2-channel stereo:


64kbits/s to 384 kbits/s.

Subband- or transform-based, making use of perceptual masking properties.


Slide 16

Audio Coding (Contd)


Typical 3/2 multichannel stereo configuration:
Right Centre Left Surround Left Surround Right

5.1 channels (3/2) with LFE channel:


Left, Right, Centre, Left Surround, Right Surround, Low Frequency Effects (LFE) (Reduced Bandwidth).
LFE loudspeaker can, in general, be placed anywhere in the listening room.
Slide 17

Audio Coding Masking


Auditory Masking:
Spectral: Strong frequency components mask weaker neighbouring frequency components. Temporal: Strong temporal events mask recent and future events.
Spectral Masking
SPL/dB SPL/dB

Temporal Masking

freq/kHz

10ms

160ms

time

Slide 18

Masking Example

60

50

dB

40

30

20

10 200 300 400 500 Hz 600 700 800

Slide 19

Image/Video
Still Image Coding:
JPEG (Joint Photographic Experts Group):
Discrete Cosine Transform (DCT) based

JPEG2000: Wavelet Transform based

Video Coding:
MPEG (Moving Pictures Experts Group): DCT-based,
Interframe and intraframe prediction, Motion estimation.

Applications: Digital TV, DVD, etc.


Slide 20

JPEG Example
Original

JPEG (4:1)

JPEG (100:1)

Slide 21

Adaptive Filtering
Self-learning: Filter coefficients adapt in response to training signal.
d(n)

x(n)

W(z)

y(n)

e(n)

Filter update: Least Mean Squares (LMS) algorithm


w(n + 1) = w(n) + 2e(n)x(n)
Slide 22

Adaptive Filtering Applications


Echo cancellation (telephone lines)
Used in modems (making Internet access possible!!)

Acoustic echo cancellation


Hands-free telephony

Adaptive equalization Active noise control Medical signal processing


e.g. foetal heart beat monitoring

Slide 23

Some Other Application Areas


Image analysis, e.g:
Face recognition, Optical Character Recognition (OCR);

Restoration of old image, video, and audio signals; Analysis of RADAR data; Analysis of SONAR data; Data transmission (modems, radio, echo cancellation, channel equalization, etc.); Storage and archiving; Control of electric motors.
Slide 24

DSP Devices & Architectures


Selecting a DSP several choices:
Fixed-point; Floating point; Application-specific devices (e.g. FFT processors, speech recognizers,etc.).

Main DSP Manufacturers:


Texas Instruments (http://www.ti.com) Motorola (http://www.motorola.com) Analog Devices (http://www.analog.com)

Slide 25

Typical DSP Operations


Filtering Energy of Signal Frequency transforms

y ( n) =

L 1 i =0

ai x(n i)

Pseudo C code for (n=0; n<N; n++) { s=0; for (i=0; i<L; i++) { s += a[i] * x[n-i]; } y[n] = s; }

Slide 26

Traditional DSP Architecture


X RAM x(n-i) ai Y RAM

Multiply/Accumulate

Accumulator

y(n)

N.B. Most modern DSPs have more advanced features.


Slide 27

DSP at EPSON

Energy-saving Firmware
EPSON Scotland Design Centre develops a broad range of technologies to minimize power consumption and maximize cost effectiveness in mobile DSP applications.

Slide 28

SDC Core Skills


DSP
System modelling Firmware design System Integration CPU (Oak, ARM) H/w & S/w Co-design

Speech
Speech compression Speech Recognition Speech synthesis Speech enhancement Speech Testing

Audio
MP3 Other digital audio Performance Assessment

Mobile
Baseband processing Channel coding AMR Coding

Services
Administration CAD Tools Computer & Networking

System on Chip (SoC)

Slide 29

SDC Firmware Development


Algorithm Definition Floating-point and Fixed-point Co-Simulation Co-Design Implementation COSSAP Matlab ... Behavioural, RTL, Logic ...

Co-Verification MCU, DSP ...

Product Development With Barcelona and Tokyo Design Centres


Slide 30

Summary & Conclusions


DSP used in a wide range of everyday applications Looked at:
Speech coding; Speech synthesis & recognition; Image/Video; Adaptive filtering.

Other areas include:


Image analysis (e.g. face recognition, OCR, etc.); RADAR/SONAR; Data transmission and reception; And many more..!!
Slide 31

You might also like