Professional Documents
Culture Documents
Ardik Wijayanto
ardik@eepis-its.edu
Target
• Sampling
• FFT
• Filter
Slide 2
Materi Dalam 1 Semester
• Pendahuluan (1 TM)
• Dasar DSP : Sinyal dan Sistem (2 TM) + 1 Tugas
• ADC dan Segala Aspeknya (1 TM)
• Persamaan Beda Koef Linier Konstan (2 TM)
• Konvolusi (1 TM) + 1 Tugas
• Transformasi Laplace (1 TM)
• Transformasi Z (1 TM)
• Filter Digital dan Segala Aspeknya (2 TM) + 1 Tugas
• Transformasi Fourier (1 TM) + 1 Tugas
TM = Tatap Muka
DSP ???
Contents
• What is DSP?
• What is DSP used for?
– Speech & Audio processing
– Image & Video processing
– Adaptive filtering
• DSP Devices and Architectures
• Summary & Conclusions
Slide 5
What is DSP?
• Digital Signal Processing – the processing
or manipulation of signals using digital
techniques
Digital
Input Output
Signal
ADC Signal DAC Signal
Analogue Processor Digital to
to Digital Analogue
Converter Converter
Slide 6
PSD - Ardik / Bima
PSD - Ardik / Bima
PSD - Ardik / Bima
What is DSP Used For?
And of course there are 3-d signals (video, volumetric data sets…)
and beyond…
Slide 12
And we want to process them…
• Recognize what’s in a signal
– Target detection • Enhance a signal
– Speech recognition – Image contrast enhancement
– Image analysis
• Compress a signal
• Predict a future value of the – Faster transmission
signal – Less storage space
– Stock market prediction
• Synthesize a realistic example of
• Interpolate missing values a signal
of a signal – Speech synthesis
– Conceal lost video packets – Image texture generation
• Restore a signal that has • Choose specific input signals to
been degraded control a process
– Noise removal – Chemical process control
– Echo cancellation – Aerodynamic control
Slide 13
PSD - Ardik / Bima
Slide 15
Speech Processing
• Speech coding/compression
• Speech synthesis
• Speech recognition
Slide 16
Some Properties of Speech
“oo”
“e”
“ee”
“o”
“s” in
“k”in
in
in “blue”
in“again”
“spot”
“key”
“key”
Slide 17
Some Properties of Speech
Vowels
•Quasi-periodic
•Relatively high signal power
Consonants
•Non-periodic (random)
•Relatively low signal power
Slide 18
Speech Coding
TRAU
MSC
64 kbits/s
22.8 kbits/s
BSC
13 kbits/s
BTS
Slide 19
Speech Coding – Linear Prediction
• Try to predict the current sample value;
• Transmit the prediction error.
s(n)
+ d(n) d(n)
– …
A(z)
se(n) + sr(n)
+
A(z)
se(n)
Slide 20
Speech Coding – Vocoder
Encoder
Original Speech
Analysis:
• Voiced/Unvoiced decision
• Pitch Period (voiced only)
• Signal power (Gain)
Pitch Decoder
Period Signal Power
Pulse Train V/U
Vocal Tract
G Model
Synthesized Speech
LPC-10:
Random Noise
Slide 21
Text-to-Speech Synthesis
Input To be or
text not to be
that is the Tu bee awr phonetic form
question nawt tu bee
dhat iz dhe
kwestchun
Text
normalization Parsing Pronunciation
expands semantic & phonetic description
abbreviations syntactic ‘parts of each word, dictionary
dates, times, of speech’ with letter-to-sound
money..etc analysis of text rules as a back up
• Speaking clocks
• Spoken (variable) announcements
• Talking emails + talking heads for mobile
• Synthesis of location-based information
(e.g. traffic information)
• Interactive systems (e.g. catalogue ordering,
Yellow Pages, ...)
Slide 23
Speech/Speaker Recognition
• Speech Recognition – What has been spoken?
– Speaker dependent – Recognition system trained
for a particular person’s voice.
– Speaker independent – Recognition system
expected to deal with a wide variety of speakers.
• Speaker Recognition – Who has spoken?
• Not easy…
Sometimestherearenogapsbetweenwords.
Sometim esthereareg aps inthe mid dleofwords.
Accents, dialects and Stress eggsist.
Slide 24
Speech Recognition System
Syntactic Dialogue
knowledge knowledge
Slide 25
Digital Audio
Slide 26
Audio Coding (Cont’d)
• Key standards:
– MPEG: Layers I, II, and III (MP3); AAC.
• used in DAB, DVD
– Dolby AC3, Dolby Digital, Dolby Surround.
• Typical bit rates for 2-channel stereo:
– 64kbits/s to 384 kbits/s.
• Subband- or transform-based, making use
of perceptual masking properties.
Slide 27
Audio Coding (Cont’d)
• Typical 3/2 multichannel stereo configuration:
Surround
Right
Right
Centre
Left Surround
Left
Slide 29
Masking Example
60
50
40
dB
30
20
10
200 300 400 500 600 700 800
Hz
Slide 30
Image/Video
• Still Image Coding:
– JPEG (Joint Photographic Experts Group):
• Discrete Cosine Transform (DCT) based
– JPEG2000: Wavelet Transform based
• Video Coding:
– MPEG (Moving Pictures Experts Group):
• DCT-based,
• Interframe and intraframe prediction,
• Motion estimation.
– Applications: Digital TV, DVD, etc.
Slide 31
JPEG Example
Original
Slide 32
Video compression: Example of a
packet loss & need to interpolate:
Because of camera
pan, slice from
previous frame
doesn’t line up well
Interpolating the missing data in the current frame using the data from
above & below would likely provide Slide 33
better visual concealment of the loss
Example: contrast enhancement
Original magnetic resonance Contrast enhancement using
brain scan histogram equalization
Slide 34
ECE161C: DSP II
Slide 35
Quantization of 24-bit true color
down to 8-bit color:
24-bit color original 8-bit color quantized version
+
–
x(n) W(z) e(n)
y(n)
Slide 39
Some Other Application Areas
• Image analysis, e.g:
– Face recognition,
– Optical Character Recognition (OCR);
• Restoration of old image, video, and audio signals;
• Analysis of RADAR data;
• Analysis of SONAR data;
• Data transmission (modems, radio, echo
cancellation, channel equalization, etc.);
• Storage and archiving;
• Control of electric motors.
Slide 40
DSP Devices & Architectures
• Selecting a DSP – several choices:
– Fixed-point;
– Floating point;
– Application-specific devices
(e.g. FFT processors, speech recognizers,etc.).
• Main DSP Manufacturers:
– Texas Instruments (http://www.ti.com)
– Motorola (http://www.motorola.com)
– Analog Devices (http://www.analog.com)
Slide 41
Typical DSP Operations
• Filtering L 1
• Energy of Signal y ( n) ai x(n i )
i 0
• Frequency transforms
Pseudo C code
for (n=0; n<N; n++)
{
s=0;
for (i=0; i<L; i++)
{
s += a[i] * x[n-i];
}
y[n] = s;
}
Slide 42
Traditional DSP Architecture
Multiply/Accumulate
Accumulator
y(n)
“Energy-saving Firmware”
EPSON Scotland Design Centre develops a
broad range of technologies to minimize
power consumption and maximize cost
effectiveness in mobile DSP applications.
Slide 44
SDC Core Skills
DSP Speech Audio Mobile Services
Firmware design Speech Recognition Other digital audio Channel coding CAD Tools
Slide 45
SDC Firmware Development
Algorithm
Definition
Floating-point
and COSSAP
Fixed-point Matlab ...
Co-Simulation
Co-Design Behavioural,
RTL, Logic ...
Slide 46
Summary & Conclusions
• DSP used in a wide range of everyday applications
• Looked at:
– Speech coding; Speech synthesis & recognition;
– Image/Video;
– Adaptive filtering.
• Other areas include:
– Image analysis (e.g. face recognition, OCR, etc.);
– RADAR/SONAR;
– Data transmission and reception;
– And many more…..!!
Slide 47