Professional Documents
Culture Documents
Statistics
INTRODUCTION
Signal-to-Noise Ratio (SNR)
Histogram of Signals
2
INTRODUCTION
Digital Signal Processing is distinguished from other areas in computer science by the unique
type of data it uses: signals.
In most cases, these signals originate as sensory data from the real world: seismic vibrations,
visual images, sound waves, etc.
DSP is the mathematics, the algorithms, and the techniques used to manipulate these signals
after they have been converted into a digital form.
This includes a wide variety of goals, such as: enhancement of visual images, recognition and
generation of speech, compression of data for storage and transmission, etc.
3
Digital Signal Processing is one of the most powerful
technologies that shapes and will continue to shape
science and engineering in the twenty-first century.
5
An Audio Processing Application
6
STATISTICS
Statistics and probability are used in Digital Signal Processing to characterize signals and the
processes that generate them.
For example, a primary use of DSP is to reduce interference, noise, and other undesirable
components in acquired data.
These may be an inherent part of the signal being measured, arise from imperfections in the
data acquisition system, or be introduced as an unavoidable byproduct of some DSP
operation.
Statistics and probability allow these disruptive features to be measured and classified, the
first step in developing strategies to remove the offending components.
Figure shows two discrete signals, such as might be acquired with a digital data acquisition
system.
7
• The vertical axis may represent voltage, light intensity, sound pressure, or an infinite number
of other parameters.
• Since we don't know what it represents in this particular case, we will give it the generic label:
amplitude.
• This parameter is also called several other names: the y_axis, the dependent variable, the range,
and the ordinate.
• The horizontal axis represents the other parameter of the signal, going by such names as: the x-
axis, the independent variable, the domain, and the abscissa.
• Time is the most common parameter to appear on the horizontal axis of acquired signals;
however, other parameters are used in specific applications.
• For example, a geophysicist might acquire measurements of rock density at equally spaced
distances along the surface of the earth.
• To keep things general, we will simply label the horizontal axis: sample number.
• If this were a continuous signal, another label would have to be used, such as: time, distance, x,
etc. 8
Pay particular attention to the word: domain, a very widely used term in DSP.
For instance, a signal that uses time as the independent variable (i.e., the parameter on the
horizontal axis), is said to be in the time domain.
Another common signal in DSP uses frequency as the independent variable, resulting in the
term, frequency domain.
Likewise, signals that use distance as the independent parameter are said to be in the spatial
domain (distance is a measure of space).
9
Mean and Standard Deviation
Examples of two digitized signals with different means and standard deviations 10
The term, 𝝈𝝈𝟐𝟐 , occurs frequently in statistics and is given the name
variance.
The standard deviation is a measure of how far the signal fluctuates
from the mean.
The variance represents the power of this fluctuation.
In some situations, the mean describes what is being measured, while the standard deviation
represents noise and other interference.
In these cases, the standard deviation is not important in itself, but only in comparison to the
mean.
This gives rise to the term: signal-to-noise ratio (SNR), which is equal to the mean divided by
the standard deviation.
For example, a signal (or other group of measure values) with a CV of 2%, has an SNR of 50.
Better data means a higher value for the SNR and a lower value for the CV. 13
The Histogram
Suppose we attach an 8 bit analog-to-digital converter to a computer, and acquire 256,000 samples of some signal.
As an example, Figure (a) shows 128 samples that might be a part of this data set.
The value of each sample will be one of 256 possibilities, 0 through 255.
The histogram displays the number of samples there are in the signal that have each of these possible values.
Figure (b) shows the histogram for the 128 samples in (a).
For example, there are 2 samples that have a value of 110, 8 samples that have a value of 131, 0 samples that have
a value of 170, etc.
We will represent the histogram by Hi , where i is an index that runs from 0 to M-1, and M is the number of
possible values that each sample can take on.
For instance, H50 is the number of samples that have a value of 50.
Figure (c) shows the histogram of the signal using the full data set, all 256k points.
As can be seen, the larger number of samples results in a much smoother appearance. 14
15
The histogram can be used to efficiently calculate the mean and standard deviation of very
large data sets. This is especially important for images, which can contain millions of samples.
The histogram groups samples together that have the same value.
This allows the statistics to be calculated by working with a few groups, rather than a large
number of individual samples.
Using this approach, the mean and standard deviation are calculated from the histogram by
the equations:
16
17
Almost all oscilloscopes provide an averaging function, the most common statistical process applied to
waveforms.
Acquiring multiple waveforms and adding them point by point then dividing the sum by the number of
waveforms in the average yields the average or mean value of the waveform as in Figure.
The upper trace is the acquired waveform.
The lower trace is the average of one thousand acquisitions.
Averaging has suppressed the noise leaving a clean waveform.
18
Any value obtained by a measurement contains two components: one carries the information of
interest, the signal, the other consists of random errors, or noise, that is superimposed on the first
component.
These random errors are, of course, unwanted because they diminish the accuracy and precision of
the measurement.
The term "signal" is sometimes used for the pure, noise-free signal but sometimes also for the noisy
"raw" data.
The difficulty in reaching this goal depends both on the characteristics of the noise-free signal and the
noise.
The signal-to-noise-ratio (SNR) is the ratio of the strength of the signal and the strength of the noise.
The higher the ratio the easier it is to extract information and the more reliable are the results.
19
What is Signal to Noise Ratio?
20
21
100, 70, 1000, 10000 and 500
22
23
24
25
26
Attenuation is a reduction of signal strength during transmission, such as when
sending data collected through automated monitoring.
For example, an office wall (the specific medium) that changes the propagation
of an RF signal from a power level of 10 milliwatts (the input) to 5 milliwatts
(the output) represents 3 dB of attenuation.
27
28
29
30
31
Example 3
32
Example 4
33
34