You are on page 1of 47

Digital Signal Processing:

An Introduction and Some Examples of its


Everyday Use

Ardik Wijayanto
ardik@eepis-its.edu
Target
• Sampling
• FFT
• Filter

Slide 2
Materi Dalam 1 Semester
• Pendahuluan (1 TM)
• Dasar DSP : Sinyal dan Sistem (2 TM) + 1 Tugas
• ADC dan Segala Aspeknya (1 TM)
• Persamaan Beda Koef Linier Konstan (2 TM)
• Konvolusi (1 TM) + 1 Tugas
• Transformasi Laplace (1 TM)
• Transformasi Z (1 TM)
• Filter Digital dan Segala Aspeknya (2 TM) + 1 Tugas
• Transformasi Fourier (1 TM) + 1 Tugas

TM = Tatap Muka
DSP ???
Contents
• What is DSP?
• What is DSP used for?
– Speech & Audio processing
– Image & Video processing
– Adaptive filtering
• DSP Devices and Architectures
• Summary & Conclusions

Slide 5
What is DSP?
• Digital Signal Processing – the processing
or manipulation of signals using digital
techniques

Digital
Input Output
Signal
ADC Signal DAC Signal
Analogue Processor Digital to
to Digital Analogue
Converter Converter

Slide 6
PSD - Ardik / Bima
PSD - Ardik / Bima
PSD - Ardik / Bima
What is DSP Used For?

…And much more!


Slide 10
Slide 11
The world is filled with signals…
• 2-D signals:
– Photographs
• 1-D signals:
– Medical images
– Seismic vibrations
– Radar
– EEG and EKG
– Satellite data
– Speech
– Fax
– Sonar
– Fingerprints…
– Audio & music
– Dow-Jones averages

And of course there are 3-d signals (video, volumetric data sets…)
and beyond…
Slide 12
And we want to process them…
• Recognize what’s in a signal
– Target detection • Enhance a signal
– Speech recognition – Image contrast enhancement
– Image analysis
• Compress a signal
• Predict a future value of the – Faster transmission
signal – Less storage space
– Stock market prediction
• Synthesize a realistic example of
• Interpolate missing values a signal
of a signal – Speech synthesis
– Conceal lost video packets – Image texture generation
• Restore a signal that has • Choose specific input signals to
been degraded control a process
– Noise removal – Chemical process control
– Echo cancellation – Aerodynamic control

Slide 13
PSD - Ardik / Bima
Slide 15
Speech Processing

• Speech coding/compression
• Speech synthesis
• Speech recognition

Slide 16
Some Properties of Speech

The blue--- s---p--o---------t i-s--on--the-- k--ey a---g--ai----n------

“oo”
“e”
“ee”
“o”
“s” in
“k”in
in
in “blue”
in“again”
“spot”
“key”
“key”

Slide 17
Some Properties of Speech
Vowels

“oo” in “blue” “o” in “spot” “ee” in “key” “e” in “again”

•Quasi-periodic
•Relatively high signal power

Consonants

“s” in “spot” “k” in “key”

•Non-periodic (random)
•Relatively low signal power

Slide 18
Speech Coding

TRAU

MSC
64 kbits/s
22.8 kbits/s
BSC
13 kbits/s

BTS

Slide 19
Speech Coding – Linear Prediction
• Try to predict the current sample value;
• Transmit the prediction error.

s(n)
+ d(n) d(n)
– …
A(z)  
se(n) + sr(n)
+
A(z)
se(n)

Slide 20
Speech Coding – Vocoder
Encoder
Original Speech
Analysis:
• Voiced/Unvoiced decision
• Pitch Period (voiced only)
• Signal power (Gain)

Pitch Decoder
Period Signal Power
Pulse Train V/U
Vocal Tract
G Model

Synthesized Speech
LPC-10:
Random Noise

Slide 21
Text-to-Speech Synthesis
Input To be or
text not to be
that is the Tu bee awr phonetic form
question nawt tu bee
dhat iz dhe
kwestchun

Text
normalization Parsing Pronunciation
expands semantic & phonetic description
abbreviations syntactic ‘parts of each word, dictionary
dates, times, of speech’ with letter-to-sound
money..etc analysis of text rules as a back up

Prosody Waveform Synthesized


rules generation speech
Apply word Phonetic-to-
stress, duration acoustic
and pitch transformation

Text-to-speech synthesis sounds very natural these days.


Slide 22
Speech Synthesis Applications

• Speaking clocks
• Spoken (variable) announcements
• Talking emails + talking heads for mobile
• Synthesis of location-based information
(e.g. traffic information)
• Interactive systems (e.g. catalogue ordering,
Yellow Pages, ...)

Slide 23
Speech/Speaker Recognition
• Speech Recognition – What has been spoken?
– Speaker dependent – Recognition system trained
for a particular person’s voice.
– Speaker independent – Recognition system
expected to deal with a wide variety of speakers.
• Speaker Recognition – Who has spoken?
• Not easy…
Sometimestherearenogapsbetweenwords.
Sometim esthereareg aps inthe mid dleofwords.
Accents, dialects and Stress eggsist.
Slide 24
Speech Recognition System

Phoneme Word Semantic


models pronunciation knowledge

Feature Phoneme Word Sentence


speech extraction recognition recognition recognition decision

Syntactic Dialogue
knowledge knowledge

Slide 25
Digital Audio

• Standard music CD:


– Sampling Rate: 44.1 kHz
– 16-bit samples
– 2-channel stereo
– Data transfer rate = 21644,100 = 1.4 Mbits/s
– 1 hour of music = 1.43,600 = 635 MB

Slide 26
Audio Coding (Cont’d)
• Key standards:
– MPEG: Layers I, II, and III (MP3); AAC.
• used in DAB, DVD
– Dolby AC3, Dolby Digital, Dolby Surround.
• Typical bit rates for 2-channel stereo:
– 64kbits/s to 384 kbits/s.
• Subband- or transform-based, making use
of perceptual masking properties.

Slide 27
Audio Coding (Cont’d)
• Typical 3/2 multichannel stereo configuration:
Surround
Right
Right

Centre

Left Surround
Left

• 5.1 channels (3/2) with LFE channel:


– Left, Right, Centre,
– Left Surround, Right Surround,
– Low Frequency Effects (LFE) (Reduced Bandwidth).
• LFE loudspeaker can, in general, be placed anywhere in the
listening room.
Slide 28
Audio Coding – Masking
• Auditory Masking:
– Spectral: Strong frequency components mask weaker
neighbouring frequency components.
– Temporal: Strong temporal events mask recent and
future events.

Spectral Masking Temporal Masking


SPL/dB SPL/dB

1 freq/kHz 10ms 160ms time

Slide 29
Masking Example

60

50

40
dB

30

20

10
200 300 400 500 600 700 800

Hz

Slide 30
Image/Video
• Still Image Coding:
– JPEG (Joint Photographic Experts Group):
• Discrete Cosine Transform (DCT) based
– JPEG2000: Wavelet Transform based
• Video Coding:
– MPEG (Moving Pictures Experts Group):
• DCT-based,
• Interframe and intraframe prediction,
• Motion estimation.
– Applications: Digital TV, DVD, etc.

Slide 31
JPEG Example
Original

JPEG (4:1) JPEG (100:1)

Slide 32
Video compression: Example of a
packet loss & need to interpolate:

Packet loss means


a horizontal slice of
data is missing

The decoder holds


over the slice from
the previous frame
to conceal the loss.

Because of camera
pan, slice from
previous frame
doesn’t line up well

Interpolating the missing data in the current frame using the data from
above & below would likely provide Slide 33
better visual concealment of the loss
Example: contrast enhancement
Original magnetic resonance Contrast enhancement using
brain scan histogram equalization

Slide 34
ECE161C: DSP II

• Image processing and computer vision


• Topics include:
– image formation: cameras, radiometry,
and color
– 2D DSP, Discrete Cosine Transform
– Filtering, Edge detection
– Multiscale representations, texture
– Least squares model fitting, motion
– Statistics
– Principal components and face detection
– Video compression

Slide 35
Quantization of 24-bit true color
down to 8-bit color:
24-bit color original 8-bit color quantized version

Slide 36 the quantization banding on the right


A little random dithering would help mask
ECE 172A – Introduction to
Intelligent Systems

Main topics covered in the course


include:
1. Introduction to Intelligent Systems and
Sensor-based Robots
2. Model-Based approach in perception
3. Image segmentation
4. Edge Detection
5. Region growing
6. Texture analysis
7. Object recognition and image
understanding
8. Extraction of 3-dimensional cues: passive Image Classified as a
and active approaches Input Image Day Image

Project (about 5 weeks long):


1. Vehicle Detection & Re-identification
2. Person Detection and Tracking
3. Robust Image Classification
Slide 37
Adaptive Filtering
• Self-learning: Filter coefficients adapt in response
to training signal.
d(n)

+
– 
x(n) W(z) e(n)
y(n)

• Filter update: Least Mean Squares (LMS) algorithm

w(n  1)  w(n)  2e(n)x(n)


Slide 38
Adaptive Filtering Applications

• Echo cancellation (telephone lines)


– Used in modems (making Internet access possible!!)
• Acoustic echo cancellation
– Hands-free telephony
• Adaptive equalization
• Active noise control
• Medical signal processing
– e.g. foetal heart beat monitoring

Slide 39
Some Other Application Areas
• Image analysis, e.g:
– Face recognition,
– Optical Character Recognition (OCR);
• Restoration of old image, video, and audio signals;
• Analysis of RADAR data;
• Analysis of SONAR data;
• Data transmission (modems, radio, echo
cancellation, channel equalization, etc.);
• Storage and archiving;
• Control of electric motors.

Slide 40
DSP Devices & Architectures
• Selecting a DSP – several choices:
– Fixed-point;
– Floating point;
– Application-specific devices
(e.g. FFT processors, speech recognizers,etc.).
• Main DSP Manufacturers:
– Texas Instruments (http://www.ti.com)
– Motorola (http://www.motorola.com)
– Analog Devices (http://www.analog.com)

Slide 41
Typical DSP Operations
• Filtering L 1
• Energy of Signal y ( n)   ai x(n  i )
i 0
• Frequency transforms

Pseudo C code
for (n=0; n<N; n++)
{
s=0;
for (i=0; i<L; i++)
{
s += a[i] * x[n-i];
}
y[n] = s;
}

Slide 42
Traditional DSP Architecture

X RAM x(n-i) ai Y RAM

Multiply/Accumulate

Accumulator

y(n)

N.B. Most modern DSPs have more advanced features.


Slide 43
DSP at EPSON

“Energy-saving Firmware”
EPSON Scotland Design Centre develops a
broad range of technologies to minimize
power consumption and maximize cost
effectiveness in mobile DSP applications.

Slide 44
SDC Core Skills
DSP Speech Audio Mobile Services

System modelling Speech compression MP3 Baseband processing Administration

Firmware design Speech Recognition Other digital audio Channel coding CAD Tools

System Integration Speech synthesis Performance AMR Coding Computer


Assessment &
Networking
CPU (Oak, ARM) Speech enhancement

H/w & S/w Speech Testing


Co-design

System on Chip (SoC)

Slide 45
SDC Firmware Development
Algorithm
Definition

Floating-point
and COSSAP
Fixed-point Matlab ...
Co-Simulation

Co-Design Behavioural,
RTL, Logic ...

Implementation Co-Verification MCU, DSP ...

Product Development With Barcelona and Tokyo


Design Centres

Slide 46
Summary & Conclusions
• DSP used in a wide range of everyday applications
• Looked at:
– Speech coding; Speech synthesis & recognition;
– Image/Video;
– Adaptive filtering.
• Other areas include:
– Image analysis (e.g. face recognition, OCR, etc.);
– RADAR/SONAR;
– Data transmission and reception;
– And many more…..!!

Slide 47

You might also like