P01 - Pengantar

Digital Signal Processing:
An Introduction and Some Examples of its

Everyday Use
Ardik Wijayanto
ardik@eepis-its.edu
Target
• Sampling
• FFT
• Filter
Slide 2
Materi Dalam 1 Semester
• Pendahuluan (1 TM)
• Dasar DSP : Sinyal dan Sistem (2 TM) + 1 Tugas
• ADC dan Segala Aspeknya (1 TM)
• Persamaan Beda Koef Linier Konstan (2 TM)
• Konvolusi (1 TM) + 1 Tugas
• Transformasi Laplace (1 TM)
• Transformasi Z (1 TM)
• Filter Digital dan Segala Aspeknya (2 TM) + 1 Tugas
• Transformasi Fourier (1 TM) + 1 Tugas
TM = Tatap Muka
DSP ???
Contents
• What is DSP?
• What is DSP used for?
– Speech & Audio processing
– Image & Video processing
– Adaptive filtering
• DSP Devices and Architectures
• Summary & Conclusions
Slide 5
What is DSP?
• Digital Signal Processing – the processing
or manipulation of signals using digital
techniques
Digital
Input Output
Signal
ADC Signal DAC Signal
Analogue Processor Digital to
to Digital Analogue
Converter Converter
Slide 6
PSD - Ardik / Bima
PSD - Ardik / Bima
PSD - Ardik / Bima
What is DSP Used For?
…And much more!

Slide 10
Slide 11
The world is filled with signals…
• 2-D signals:
– Photographs
• 1-D signals:
– Medical images
– Seismic vibrations
– Radar
– EEG and EKG
– Satellite data
– Speech
– Fax
– Sonar
– Fingerprints…
– Audio & music
– Dow-Jones averages
And of course there are 3-d signals (video, volumetric data sets…)
and beyond…
Slide 12
And we want to process them…
• Recognize what’s in a signal
– Target detection • Enhance a signal
– Speech recognition – Image contrast enhancement
– Image analysis
• Compress a signal
• Predict a future value of the – Faster transmission
signal – Less storage space
– Stock market prediction
• Synthesize a realistic example of
• Interpolate missing values a signal
of a signal – Speech synthesis
– Conceal lost video packets – Image texture generation
• Restore a signal that has • Choose specific input signals to
been degraded control a process
– Noise removal – Chemical process control
– Echo cancellation – Aerodynamic control
Slide 13
PSD - Ardik / Bima
Slide 15
Speech Processing
• Speech coding/compression
• Speech synthesis
• Speech recognition
Slide 16
Some Properties of Speech
The blue--- s---p--o---------t i-s--on--the-- k--ey a---g--ai----n------
“oo”
“e”
“ee”
“o”
“s” in
“k”in
in
in “blue”
in“again”
“spot”
“key”
“key”
Slide 17
Some Properties of Speech
Vowels
“oo” in “blue” “o” in “spot” “ee” in “key” “e” in “again”
•Quasi-periodic
•Relatively high signal power
Consonants
“s” in “spot” “k” in “key”
•Non-periodic (random)
•Relatively low signal power
Slide 18
Speech Coding
TRAU
MSC
64 kbits/s
22.8 kbits/s
BSC
13 kbits/s
BTS
Slide 19
Speech Coding – Linear Prediction
• Try to predict the current sample value;
• Transmit the prediction error.
s(n)
+ d(n) d(n)
– …
A(z)  
se(n) + sr(n)
+
A(z)
se(n)
Slide 20
Speech Coding – Vocoder
Encoder
Original Speech
Analysis:
• Voiced/Unvoiced decision
• Pitch Period (voiced only)
• Signal power (Gain)
Pitch Decoder
Period Signal Power
Pulse Train V/U
Vocal Tract
G Model
Synthesized Speech
LPC-10:
Random Noise
Slide 21
Text-to-Speech Synthesis
Input To be or
text not to be
that is the Tu bee awr phonetic form
question nawt tu bee
dhat iz dhe
kwestchun
Text
normalization Parsing Pronunciation
expands semantic & phonetic description
abbreviations syntactic ‘parts of each word, dictionary
dates, times, of speech’ with letter-to-sound
money..etc analysis of text rules as a back up
Prosody Waveform Synthesized

rules generation speech
Apply word Phonetic-to-
stress, duration acoustic
and pitch transformation
Text-to-speech synthesis sounds very natural these days.

Slide 22
Speech Synthesis Applications
• Speaking clocks
• Spoken (variable) announcements
• Talking emails + talking heads for mobile
• Synthesis of location-based information
(e.g. traffic information)
• Interactive systems (e.g. catalogue ordering,
Yellow Pages, ...)
Slide 23
Speech/Speaker Recognition
• Speech Recognition – What has been spoken?
– Speaker dependent – Recognition system trained
for a particular person’s voice.
– Speaker independent – Recognition system
expected to deal with a wide variety of speakers.
• Speaker Recognition – Who has spoken?
• Not easy…
Sometimestherearenogapsbetweenwords.
Sometim esthereareg aps inthe mid dleofwords.
Accents, dialects and Stress eggsist.
Slide 24
Speech Recognition System
Phoneme Word Semantic

models pronunciation knowledge
Feature Phoneme Word Sentence

speech extraction recognition recognition recognition decision
Syntactic Dialogue
knowledge knowledge
Slide 25
Digital Audio
• Standard music CD:

– Sampling Rate: 44.1 kHz
– 16-bit samples
– 2-channel stereo
– Data transfer rate = 21644,100 = 1.4 Mbits/s
– 1 hour of music = 1.43,600 = 635 MB
Slide 26
Audio Coding (Cont’d)
• Key standards:
– MPEG: Layers I, II, and III (MP3); AAC.
• used in DAB, DVD
– Dolby AC3, Dolby Digital, Dolby Surround.
• Typical bit rates for 2-channel stereo:
– 64kbits/s to 384 kbits/s.
• Subband- or transform-based, making use
of perceptual masking properties.
Slide 27
Audio Coding (Cont’d)
• Typical 3/2 multichannel stereo configuration:
Surround
Right
Right
Centre
Left Surround
Left
• 5.1 channels (3/2) with LFE channel:

– Left, Right, Centre,
– Left Surround, Right Surround,
– Low Frequency Effects (LFE) (Reduced Bandwidth).
• LFE loudspeaker can, in general, be placed anywhere in the
listening room.
Slide 28
Audio Coding – Masking
• Auditory Masking:
– Spectral: Strong frequency components mask weaker
neighbouring frequency components.
– Temporal: Strong temporal events mask recent and
future events.
Spectral Masking Temporal Masking

SPL/dB SPL/dB
1 freq/kHz 10ms 160ms time
Slide 29
Masking Example
60
50
40
dB
30
20
10
200 300 400 500 600 700 800
Hz
Slide 30
Image/Video
• Still Image Coding:
– JPEG (Joint Photographic Experts Group):
• Discrete Cosine Transform (DCT) based
– JPEG2000: Wavelet Transform based
• Video Coding:
– MPEG (Moving Pictures Experts Group):
• DCT-based,
• Interframe and intraframe prediction,
• Motion estimation.
– Applications: Digital TV, DVD, etc.
Slide 31
JPEG Example
Original
JPEG (4:1) JPEG (100:1)
Slide 32
Video compression: Example of a
packet loss & need to interpolate:
Packet loss means

a horizontal slice of
data is missing
The decoder holds

over the slice from
the previous frame
to conceal the loss.
Because of camera
pan, slice from
previous frame
doesn’t line up well
Interpolating the missing data in the current frame using the data from
above & below would likely provide Slide 33
better visual concealment of the loss
Example: contrast enhancement
Original magnetic resonance Contrast enhancement using
brain scan histogram equalization
Slide 34
ECE161C: DSP II
• Image processing and computer vision

• Topics include:
– image formation: cameras, radiometry,
and color
– 2D DSP, Discrete Cosine Transform
– Filtering, Edge detection
– Multiscale representations, texture
– Least squares model fitting, motion
– Statistics
– Principal components and face detection
– Video compression
Slide 35
Quantization of 24-bit true color
down to 8-bit color:
24-bit color original 8-bit color quantized version
Slide 36 the quantization banding on the right

A little random dithering would help mask
ECE 172A – Introduction to
Intelligent Systems
Main topics covered in the course

include:
1. Introduction to Intelligent Systems and
Sensor-based Robots
2. Model-Based approach in perception
3. Image segmentation
4. Edge Detection
5. Region growing
6. Texture analysis
7. Object recognition and image
understanding
8. Extraction of 3-dimensional cues: passive Image Classified as a
and active approaches Input Image Day Image
Project (about 5 weeks long):

1. Vehicle Detection & Re-identification
2. Person Detection and Tracking
3. Robust Image Classification
Slide 37
Adaptive Filtering
• Self-learning: Filter coefficients adapt in response
to training signal.
d(n)
+
– 
x(n) W(z) e(n)
y(n)
• Filter update: Least Mean Squares (LMS) algorithm
w(n  1)  w(n)  2e(n)x(n)

Slide 38
Adaptive Filtering Applications
• Echo cancellation (telephone lines)

– Used in modems (making Internet access possible!!)
• Acoustic echo cancellation
– Hands-free telephony
• Adaptive equalization
• Active noise control
• Medical signal processing
– e.g. foetal heart beat monitoring
Slide 39
Some Other Application Areas
• Image analysis, e.g:
– Face recognition,
– Optical Character Recognition (OCR);
• Restoration of old image, video, and audio signals;
• Analysis of RADAR data;
• Analysis of SONAR data;
• Data transmission (modems, radio, echo
cancellation, channel equalization, etc.);
• Storage and archiving;
• Control of electric motors.
Slide 40
DSP Devices & Architectures
• Selecting a DSP – several choices:
– Fixed-point;
– Floating point;
– Application-specific devices
(e.g. FFT processors, speech recognizers,etc.).
• Main DSP Manufacturers:
– Texas Instruments (http://www.ti.com)
– Motorola (http://www.motorola.com)
– Analog Devices (http://www.analog.com)
Slide 41
Typical DSP Operations
• Filtering L 1
• Energy of Signal y ( n)   ai x(n  i )
i 0
• Frequency transforms
Pseudo C code
for (n=0; n<N; n++)
{
s=0;
for (i=0; i<L; i++)
{
s += a[i] * x[n-i];
}
y[n] = s;
}
Slide 42
Traditional DSP Architecture
X RAM x(n-i) ai Y RAM
Multiply/Accumulate
Accumulator
y(n)
N.B. Most modern DSPs have more advanced features.

Slide 43
DSP at EPSON
“Energy-saving Firmware”
EPSON Scotland Design Centre develops a
broad range of technologies to minimize
power consumption and maximize cost
effectiveness in mobile DSP applications.
Slide 44
SDC Core Skills
DSP Speech Audio Mobile Services
System modelling Speech compression MP3 Baseband processing Administration
Firmware design Speech Recognition Other digital audio Channel coding CAD Tools
System Integration Speech synthesis Performance AMR Coding Computer

Assessment &
Networking
CPU (Oak, ARM) Speech enhancement
H/w & S/w Speech Testing

Co-design
System on Chip (SoC)
Slide 45
SDC Firmware Development
Algorithm
Definition
Floating-point
and COSSAP
Fixed-point Matlab ...
Co-Simulation
Co-Design Behavioural,
RTL, Logic ...
Implementation Co-Verification MCU, DSP ...
Product Development With Barcelona and Tokyo

Design Centres
Slide 46
Summary & Conclusions
• DSP used in a wide range of everyday applications
• Looked at:
– Speech coding; Speech synthesis & recognition;
– Image/Video;
– Adaptive filtering.
• Other areas include:
– Image analysis (e.g. face recognition, OCR, etc.);
– RADAR/SONAR;
– Data transmission and reception;
– And many more…..!!
Slide 47

P01 - Pengantar

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

P01 - Pengantar

Uploaded by

Copyright:

Available Formats

Digital Signal Processing:

An Introduction and Some Examples of its

…And much more!

The blue--- s---p--o---------t i-s--on--the-- k--ey a---g--ai----n------

“oo” in “blue” “o” in “spot” “ee” in “key” “e” in “again”

“s” in “spot” “k” in “key”

Prosody Waveform Synthesized

Text-to-speech synthesis sounds very natural these days.

Phoneme Word Semantic

Feature Phoneme Word Sentence

• Standard music CD:

• 5.1 channels (3/2) with LFE channel:

Spectral Masking Temporal Masking

1 freq/kHz 10ms 160ms time

JPEG (4:1) JPEG (100:1)

Packet loss means

The decoder holds

• Image processing and computer vision

Slide 36 the quantization banding on the right

Main topics covered in the course

Project (about 5 weeks long):

• Filter update: Least Mean Squares (LMS) algorithm

w(n  1)  w(n)  2e(n)x(n)

• Echo cancellation (telephone lines)

X RAM x(n-i) ai Y RAM

N.B. Most modern DSPs have more advanced features.

System modelling Speech compression MP3 Baseband processing Administration

System Integration Speech synthesis Performance AMR Coding Computer

H/w & S/w Speech Testing

System on Chip (SoC)

Implementation Co-Verification MCU, DSP ...

Product Development With Barcelona and Tokyo

You might also like