Professional Documents
Culture Documents
1966
DIETRICH, W.
^881
CALCULATIONS OP THE EFFECTS OF PEAK
CLIPPING ON SPEECH-LIKE SIGNALS
Jmttmmii
I 1 ffi u188uffl H
H 18 ffl H fi I fin ifniul
, POSTGftfltJUATJSSCHOOL
SREY, CALIF. 93940
by
William Varner Dietrich
n
Lieutenant, United States Navy
B.S., University of Wisconsin, 1962
signals before clipping indicate that such variation may improve the
7. Results of Calculations 32
8. Repeaking 49
10. Conclusion 55
11. Bibliography 56
Appendix
A. Spectra of Clipped Signals 59
rms value
peaks. The average power in the speech signal being transmitted, then,
is only a small fraction of the transmitter's peak power capability. For
100 watts, a speech signal having a 14.5 db peak factor would have an
average power of only 3.65 watts. Work done in speech processing for
lungs through the trachea to the larynx. At the larynx the air stream
throat, tongue, and jaw, etc.) its acoustic transfer function changes
constricted openings such as between the tongue and the upper teeth.
NASAL
-NOSTRILS
CAVITIES
ORAL
LUNGS TRACHEA LARYNX THROAT TEETH LIPS
CAVITIES
During normal breathing the vocal cords, which are actually two
thick folds or bands, are widely separated at one end, forming a large
triangular opening. To produce a voiced sound the cords are drawn to-
gether, closing off the air passage except for a thin slit. The act of
exhaling then sets the vocal cords vibrating; the slit opens and closes
periodically. The fundamental frequency of vibration varies from about
90 Hz for a deep voiced man to a maximum of about 350 Hz for a high-
voiced woman, with typical values of 125 Hz and 250 Hz respectively [6] .
The vibration is by no means purely sinusoidal since the slit may remain
closed for as long as half of the cycle. The sound spectrum produced
therefore contains a fundamental and many harmonics. R. L. Miller
has shown that the relative amount of energy present in the higher
harmonics is related to the abruptness of closing the vocal cord slit.
the cords become firmer, and they are stretched to a greater length.
10
3. Vocal Tract Analog
O
1i
Tram 1
TT ^= •
)
'Z.
o —a
4
o tt>~
*Z£-~
T
11
The relations for the electrical parameters in terms of the dimensions
/>/ ^ = kAi
L = C
kA
fie
where:
c = speed of sound
= density of air
f>
HASAL
COUPLING »- feLjJ TTTTTTTT
MOUTH
VOCAL-TRACT ANALOG flAOIATION
IMPEOANCE
L. I L. L, L. L. t, * L,
——/*gfi
lc„ JL icL :
^-c „ ^c ,
:
\ ^-el
5l.c"a
* * » 4 6 4 4 1 4 4
OUTPUT
*., A.i A, A, A, »4 C,L, L, C,/C,
-
COMFICUrtATlO.S COSTROL S;G. 4ALS
the audio frequency range. The buzz filter attenuates the excitation
12
so that the spectrum decreases at six db per octaize with increasing
speech sounds. [3] The noise source may be inserted at various points
along the vocal tract analog according to the requirements of the sound
being represented.
13
• .
f.
r-
r
CO
* •*
i
V
/
> \
\
\
2 30 *
i \
ui '
\
2 j
i
-no Hi
UJ
V
1 N
>
—
h-
20 '
IS
!
**
-M"
h"
N
\
\
S
<r.
_i
|
|
1
X
1
>
—* \
\
ijj :
!
I
i \
CL 10 -
i
1 1 1 !
1 1
«
\
|
1 I
1
Xs
,
i
1
i |
I
i 1
;
.. i
i . -*-i. 1 -j— i
, 1
!
FReouewcv - kHz.
Unvoiced sounds have much less energy than voiced sounds due
to the absence of vocal chord vibration. Their energy is spread over
14
formant structure is usually not present.
Researchers have concluded that the intelligibility of speech
will remain good if the general shape of the power spectrum is pre-
analog discussed earlier. The filter signals are used to control the
15
VOCODER MULTIPLEXING VOCODER
ANALVZER AND SYNTHESIZER
TRANSVlSS'ON
771 J ?oo- !
,
300 HZ '\ j
300 HZ •
_J 300-
450 HZ
MICRO- LOUD-
PHONE SPEAKER
"ft
SPEECH REMADE
SPEECH
2800- !
3200 HZ I
-*t SWITCH
! I
16
5. Intelligibility of Clipped Speech
the words became audible at the same level. A closely related effect
speech was barely audible, the listener heard only the vowel sounds.
However, when severely clipped speech wasiheard near the threshold
of audibility, the consonants were as audible as the vowels. [13]
The Harvard report found that the quality of speech was reduced
"surprisingly little" by peak clipping. The experimenters observed
that:
17
2) 12 db
peak clipping is not at all objectionable, but,
on the contrary, sounds as though the speaker were
enunciating with special care;
The only characteristics retained from the original waveform are the
times of zero crossings.) Integration and differentiation were accom-
plished by the circuits shown in Figure 6. The circuits were driven
by low impedance sources into high impedance loads. Over the fre-
quency range of interest in speech, these circuits act as "spectrum
18
10 fcfl.
R
r
!wd
V,
R+
F-
R+ ^
UJ
= -J
S
I0 + 5«*>
a) differentiator b) integrator
it was found that the process following clipping had little effect on
The final group of curves is the pair for which integration pre-
ceded infinite clipping. The articulation scores for this group are so
19
100 . J__ 100
• ••
•
NO DISTORTION
90
3~* ••'•- •••
100 •
» • • -
. *
•
T -
• •
DIFFERENTIATION
90
..:«• ^jj—i^j
100
****** ; i. *
INTEGRATION
o 90
h- mo
< 00
^" •
_J
Z>
o
J-
£C
<
90
•
^S^
>^^ m
• •
90
3s^
•
•
•
• •
•
•
80 < •
80
•
••J
• •
Q •
•
nr
o fO
70
5 INFINITE CLIP. INT.
•
PEAK CLIPPING
H- 60 (•CUP.)
Z
UJ
o 50
•
30 •
30
•
20 •
•
•
•
•
10
•
INT. CUR INT. CLIP. DIFF.
n _i I i
5 10 15 20 25 5 10 15 20 25
on the new tests were about ten percentage points lower than the
corresponding scores using the familiar words. However, scores for
the new words were 15 points higher than the average of the first
modulation process and then mixed with gaussian noise. The signal-
to-noise ratio defined for these tests was:
X = 20 1og _|E_
10
n
21
15 Z\
22
discussed further in Section 7. Figure 9 shows the effect of differ-
hances intelligibility. -
The most significant results from the Montana State study are
those comparing pre-modulation and post-modulation clipping for
SSB filter.
23
iZ. & Z* 30 36
ssb Clipping lev^l (Ai)
24
>
-J
111
o A* \5 «ib
IZ \fc 24 30 3b 42.
25
.
y = ax
1
+ ax 3
v5
+ ax 5 + ...,
i
x * X
max
3 5
y = a .
1
(A cos u>
o
t) + a„
3
(A cos
.
to
o
t) + ar
5
(A cos wo t) + .
a A
y = a n
1
A cos to
o
t + —— 3:
4
(3 cos wo t + cos 3 a)
o
t) +
26
If the input is a two-tone signal of the form x A cos u) t +
2w, "i*2
i
2w 2 _a)
l
3u) - 2 u>
l 2
3 60 - 2 u)
2 1
of the form A nK cos 2Trnf t,n = l,2,... . (An even function clip-
o
ped symmetrically remains an even function.) In Figure 12 the
27
..
Table I
FREQUENCY AMPLITUDE
(rad/sec)
3 2 5
co a A + 3/4 a A + 3/2 a AB + 5/8 a A
3 2 4
+ 15/4 a A B + 15/8 a AB + . .
5 5
3 5
CO a B + 3/4 a B + 3/2 a A B + 5/8 a B
2 2 4
+ 15/i a A B + 15/8 a A B + . . .
3 5 3 2
3u> 1/4 a A + 5/16 a A + 5/4 a A B + ..
1 5
3 5 2 3
3«) 1/4 a B + 5/16 a B + 5/4 a A B + .
2
2 4 2 3
2 u> ±u> 3/4 a A B + 5/4 a A B + 15/8 a A B +
1 2
2 4 3 2
u) ±2 u> 3/4 a AB + 5/4 a AB + 15/8 a A B +
2 3
5
5u>'1 1/16
"' a cA + . . .
5
>2
1 / 16a 5 !
4
4(J, ±<0o 5/16
' a.A B + ...
1 2 5
3
3u),
1
±2 od
*>2n
V
5/8 a„A B + ...
2u>
12 ±3 u>_ 5/8
'
a cA B
5
2 3
4
+ ...
±4a)
u>_1**"»2 5/16aa 5"
5/16 c AB + ...
28
WrVr tV^v- —,4
m = 9
i ... ..i
1 ^i^—H^iV- **tt* 1
— ~riir
r
"nrr
11
1
m = 6
-
j
'-
i^ i'
i
J V "'i '—^"
;
,
n " ''
l
— rf
-
rJ~^r —
,
II
'
1 '
'' —^^—
m = 3
JL II..,, H. Y]—'
' '
t |
— * *•
—I '
—
l ' * *i » ' ' "i I ' '
i r i
*
»
m = 1
29
-I . , 1. 1 t | >
,
i
,
I, .
,
,>,,.,, .r-^o^yl
FT
j
|
m=26
i, ,i
T r-rr-r npr
J. i,
, .,
i.,. r , ,..i
r -j. ,...! , i.i. i ,.i i
m=12
1
11
^
m=4
HI
1,.,I.aJ.,,,
1 T ,il,, ,.ii,i r—prr —J-—r—r ^y-*-^.
m=l
30
originally equal amplitudes have been distorted so that the component
the same type of result for a three-tone input of the form cos 2Trmf +
o
cos 2 it (m + l)f + cos 2Tf (m + 2)f .
o o
J. M. Dukes [4] analyzed the clipping process from a statistical
point of view. He found that, for a totally random signal, infinite
clipping does not change the shape of the signal's power spectral
1
= if , i=l,2,3,....,
clipping will generate no new frequencies in the band of the original
will lie at frequencies higher than those present in the original sig-
the same manner as corrupting noise. But much of the distortion will
31
7. Results of Calculations
2760 Hz.
The relative phases of the components were computed by inferring
a phase characteristic from the amplitude characteristic. Considering
the vocal tract analog discussed in Section 3, a peak in the amplitude
V,
V
2 1
V
1 s LC + sRC + 1
(1 - U) L0 + jwRC
32
The phase of V~ with respect to V is:
=
-1 u>RC
<t>
- tan
1 - UJ LC
=
-1 U>RC
- tan
2
l-u.
2
L u)
O
-1 tO RC
= - tan
60 o u>
a) to
o J
but (J RC = tt
o Qo
l
then (fr
=tan
Q
ii*y
The phase characteristic corresponding to an amplitude function having,
for example, three formants is then the superposition of three phase
functions of the type shown above, one for each resonant frequency.
The Q factor for each phase function is dependent upon the formant
frequency and bandwidth according to the relation:
BW 3db Qo
33
characteristic inferred by the process previously discussed. The am-
plitude characteristic was normalized so that the corresponding time
function would have a maximum absolute value of 10. The curve labeled
"PWR" is, in effect, a power distribution curve: at any given frequency
it represents the total power (across a one ohm resistor) in the components
at that frequency and below. Figures 15, 16, and 17 show the spectra
resulting from clipping the time waveform 10 db, 20 db, and infinitely.
input between +10 and -10, the clipper output was identical to the in-
put, and for values exceeding that range the output was fixed at +10
the infinitely clipped wave, the total power reaches the maximum
possible value of 100.
not the audio waveform but the corresponding upper sideband signal.
N
W>" £,
n=l
C co.[2lKf +nf )t + 4> ]
n
that the number of computations required for fourier analysis would not
vowel sounds, the phonemes /a/, /£/ and /u/. The amplitude char-
acteristics for these models were taken from spectrum envelopes
34
-o
i
i
<
4 4 8 JO 1^
FR.£C?'J.£MCS> - kHz
Fig. 14 Amplitude and phase characteristics for the vowel sound in "bed".
35
s
PWR
1£DB_
21 4 6 e. to /2
FRCQUSNCV -kHz
Fig. 15 Spectrum of "bed" clipped 10 db
3fi
9
&
PWR
a:
L'j
y-
7}
ft
a*
<
01
4 6 10 /2
FfXQUtUCV - kUz
Fig. 16 Spectrum of "bed" clipped 20 db
37
£ 4 6 6 10
rRCQUENCV - kWz
Fig. 17 Spectrum of "bed" infinitely clipped
38
B
pU>!
a:
3
o
»0 5.
p. 3
t/1
BED
Z A 6 8 10 it
plane by tilting the spectrum. The spectra calculated for the differ-
the orignal spectra. In particular, the formant frequencies are not al-
peaks appear between the formants where the original spectra had
smooth "valleys." The spectra of the signals clipped at SSB are, as
1
N
CT.
1
(T
2
£-
i=l
li 2i
where C and C , i = 1 ,2, .... ,N, are the discrete values of the
2
07 - L£
i=l
c
J1
2
, j = i,2
For each computation the value of N was set so that only those com-
ponents at frequencies which were present in the original signal were
40
included. The higher frequency distortion components generated in the
clipping process did not enter into the calculation. In the cases where
differentiation preceded clipping, the coherence coefficient was cal-
culated using the spectrum of the differentiated signal as the original
TABLE II
Listed in Table III are the values of power calculated after various
its maximum absolute value was 10. The peak factor before clipping,
averaged over the four signals, is 10.9 db. For the differentiated
signals the average peak factor is 12.3 db. In every case the differ-
entiated signal has less power than the corresponding non-differentiated
wave. This result may explain why the Montana State study found
differentiated and clipped speech to be less intelligible in noise where-
41
TABLE III
42
presumably greater intelligibility, perhaps the difference in power is
ferentiated prior to clipping; the lower points are averaged over the
in total power of the four signals from this experiment. The other
points on the graph represent data replotted from Figure 8 by taking the
differentiated wave, but for /u/ the reverse is true. For the sake of
clarity, points were not plotted for the signals clipped at SSB, how-
ever these dafa have no consistent relationship to the curves shown.
more common abrupt clipper. It was thought that the gradual clipper
might give better results since its input-output characteristic did not
have a sharp bend. At modest clipping levels, the output waveform would
43
I
/o DIFFERENTIATED
BEFORE CUPPING
</>
a.
ID
d
o
2
-
5
cc
3
o
a.
\0 £0
CLIPPING - <U>
# eo
U /o ©
<
it)
a: EI
'O
o
2 . . CALCUOTeO FROM
Q
(B RftPLOTTeO FfcO* C»6- 8
u
© RESULTS OF THVS EXVeRl/AEMT
3
o
a.
6 I* (8 24
CUPPtNG - <ib
44
loo
© /u/
B/e/
.60
BED
A "BED" DIFFERENTIATED
Q /Q/
^ /a/ DIFFERENTIATED
COHEREMCE COEFFICIENT
45
have rounded rather than sharp corners and therefore should contain
less distortion. The intelligibility tests, however, showed no
significant difference between the two types of clipper when additive
noise was present.
46
TABLE IV
Comparison of Power for Gradual and Abrupt Clipping
Audio Clipping SSB Clipping
TABLE V
Comparison of Coherence Coefficients for Gradual
and Abrupt Clipping
Audio Clipping SSB Clipping
47
.
— <&
-
10
20 40
VA
i
10 30 <x>
GRADUAL CUPPIW6
© A&R.UPT
VA oo
40
A/ DIFFERENTIATED
• - — • gradual clipping
© Abrupt "
YA
30 40 oO
CUPPING - Jib
48
8. Repeaking
It is well known that when a clipped speech signal is filtered to
reduce its bandwidth the peak factor increases. One explanation for
finite SSB clipping, calculations were made using the sound models
"bed" and /e/ ' . The peak values of the time waveforms increased (from
10) to 16.85 for "bed" and 16.75 for /e/. The resulting peak factor
was 5.72 db in both cases. For infinite clipping at audio followed, by
ideal filtering, the average peak factor for the four sound models was
3.17 db
These results might seem at first glance to indicate a significant
appears. The narrowband signal which has the smallest possible peak
factor is a simple sinusoid of constant amplitude. If this sinusoid
signal can have is 3 db. The peak factor .computed for the clipped SSB
signals is only 2.72 db above this minimum.
49
It has been assumed in various studies [21 , 24] that the peak
factor of a signal changes significantly with frequency translation from
assumption. When the undipped model sounds "bed", /a/, and /e/
were translated to narrowband, the peak factors changed only +0.2 7 db,
+0.65 db, and -0.03 db respectively. This is a rather surprising result
50
o
CD
la?
is
PQ
CO
CO
id
Amplitude
51
9. Effects of phase characteristics on clipping
characteristic has on the clipping process. The four model sounds were
infinitely clipped at audio with: 1) their components all in phase,
The signals have the largest peak factors before clipping when
their components are all in phase. (PF = 20-10 log l0 (Power) db since
the signals were normalized so that their maximum absolute value was
10.) This result was expected since the expression
N
f(t) = 5""*
*--i
n=l
C
n
cos (2 1> nf
on
t + (b )
has an absolute maximum at t = when the (b 's are all zero. The
n
coherence coefficients for the in-phase signals are all lower than for
the same signals with other phase characteristics, probably because the
large positive peak causes much of the signal to be below the zero axis.
(See Figure 25) When the signal is clipped, information is lost because
<&} m <m -
'.-Wo rwrfseO
53
With the phase characteristic inferred from the spectral amplitude
distribution, the peak factors are smaller and the coherence coefficients
larger. With random phases the peak factors decrease even further.
Since the ear is quite insensitive to phase variations [6], this result
suggests that the peak factor of speech could be reduced, without im-
network which would not distort the spectrum appreciably. For any sig-
nal expressed in the form of Eq. 9.1, there is some phase characteris-
tic which minimizes its maximum absolute value. The questions of
research in this area. Notice also that the coherence coefficients are
generally larger for the random phase signals. This indicates that the
54
10. Conclusion
The analysis of the effects of peak clipping on speech-like signals
has produced three important findings. First, the coherence co-
efficients are large, indicating a high degree of similarity between the
power spectra before and after clipping. Thus it is not so surprising
the processed speech in noise, but the method of analysis used in this
study can give indications which point toward improved speech pro-
cessing schemes.
55
BIBLIOGRAPHY
12. Miller, R.L. "Nature of the Vocal Card Wave, " Journal of the
Acoustical Society of America v. 31, (June 1959), 667-677.
,
56
13. Office of Scientific Research and Development, The Effects of
Amplitude Distortion upon the Intelligibility of Speech OSRD ,
22. Stevens, K.N., and Fant, C.G.M. "An Electrical Analog of the
Vocal Tract, " Journal of the Acoustical Society of America v. 25, ,
December 1963
57
26. Weibel, E.S. "Vowel Synthesis by means of Resonant Circuits,
Journal of the Acoustical Society of America v. 27, (September
,
1955), 858-865.
58
APPENDIX A
59
PHJ
-CI
ORIGINAL SP£CT£UM
lor "BCD'
rs
<
Pklft
JBEH
4 4 e >p a
FR.6QUtK!CV - kMz
60
9
u
SPecTfcum of Beb"
DlFF6Rev)TlATgO 4 CtlTPfft <0«U
0£
3
o
io -'-
'
PUR
jam
4 6 8 10 /2
Frcouemcv - kWz
61
9
specTfcu* or "aco"
4 6 8 10 /Z
FrcQUEHCV - kHz
62
c
l
M
SPECTRUM OF B£D"
DiFFeRCNTiM^D 4 «UPPc& imfimitcu?
'o
*6
4 6 8 10 re
_
9
a /O /2
F 3£G.UE UCV - kHz
64
9
&
^PeCTfcUM OF "BCD"
PU/R
4 6 e 10 /2
FRCQUENCY - kU£
c
i
z
SPECTRUM op H 5€0"
GRADUALLY CuPP^O ZO Ab
7.
PvjR
•
/
/
3
6
i
5. i
IS
V |
1
! !
, i
<
!
-J !
(/1
V
I
Hi
\
z
^ V^"^—v—
4
^-V.
FrcQUEHCV
-f,
_+.
6
- kHz
8/0
1
/2
DflTlfc
66
o
SPECTfcMNV OF "BCD"
10 /2
- kHz
PUItf
SPecTfeu* op "Bea M
G>£M>UHJL*? CLlPP£0 so 4L
g
to
i
1 i
<
-J
2
£
!
fi
k I
I
A
A /\
68
G»J2-fcDUALXv> CClPPCD 40 M>
fl&DB
4 6 8 /o /E
Ffxquemcv - kHz
KQ
B PUIS
r~
%
^PeCTfcUM OP 'g€D"
'o
a;
<
I-
! I
BEO
8 /© /Z
Frcquemcv - kU<
70
9
I
PHI
ORIGINAL SPGCTR.UM
x £
?-
'3
c:
I-
PUIfi
., AH
4 6 a /o /2
FfcCQUfcHCV - kHz
71
^T>€CmUM *P /a./
p
P!-.'R
- IBDEl
10 re
o
t
PUR
^PecrKutA op /a/
CUPPE5 2oA\>
4 6 /o ft
Fpxqugmcv kU«
1A.
VJK
SPECTRUM OF /*/
4 6 8 10 it
Frcquemcv - kHz
9
PU/R
4
Frcqupucv
6a - kWz
/c /2
_
OftJGlUAl SPCCTfcUM
o
for /a/
or
3
o
PL
3
U
I
a 3
<
CL
V-
C/7
PhJR
iH
4 6 8 10 it
Ff&qushcv - kU;
76
'o
4 6 8 /<? /a
pRSQUQMCV -kHz
77
?m
SPECTRUM OP /* (
k
DlFFCRENTlATCD { CtiPPe* Zo\\>
4 6 8 10 /z
Frcquencv - kHz
78
r^tf-v
S?eCTfcUM OF /a./
4 6 8 10 /2
Fsxquemcy - kHz
79
s
Pu«
StfcCTfcUM OP
/«./ DlFFCfcCfoTlATFa
(MrifcHTCLtJ CUPP60 «T SSB
8 /0 /2
rPXQUENSV - k^
80
p.
o
;6
3
o
<r
V)
<
c>
01
B
PUR
SperreuM of (*> I
-
8 10
9
PwW
^P€CTfcU/w OP /a./
4 6 8
Vv A
/o
/A
It
Frcqu^hcv - kHz
Q
I
1
10 /2
- kUz
84
2<
<
w hi
3 .
a
I-
-J
<
^. t.
4 4> 8 >Q
k.H<
*'
'SPECTftU* or /€/
g
* (>
at
Ui
O
10
5
J-
<
C-i
i*
\ \
\v V
i
v ..
v
•aJw-«-
6 8 /0
Ffjcpuemcv - kHz
86
°>
*Spee.TfeUM. op /fc /
i
o clipped 2oJLb
* 6
S :
5
<r
4.
-J
I-
(/J
I)
" v
v -
V v 3\
2-l^JL
• -t
\ x*'-JlJO
4~ 6
?.
8 /0 tz
Ffxquemcv - kHz
87
K-,
B\
'o
*
JMFIJOITCL^ CLlPPCfc
£;
3
o
<r
<
'
a-
C/l
Hi VV
4*8
i
u.
\\\ \\ \ Xv
/o /2
Frcqusmcv -kHz
8ft
$
fUA
3
I
<
•c
a
S;
4 6s
Frcquencv -kHz
/0 It
89
s
SPCC.T*UM or /«/
4 6 8 /d /*
FrcquencV - kHz
an
Pwk
V 1
4 6 8 /0 tt
FrcquencV - kt^
91
$
Ptofc
4 6 8 /o /z
Frcquencv -kHz
^a2
J!
ORKilKiAu SPECTRUM
W /u/
UJ
PUR
< I
nn
4 4» 8 to IX.
93
s
PWR
J
'o
<
I-
Vli
T 8
8 10 It
Fkcquencv - kUz
94
PUR
OP /u/
CUPPED
p
a*
3
o
,,2flm
4 6 8 10 /z
FRCQUSMdY - kHz
95
J
'SpecTfeUM op /a /
'o
u
9
o
U)
<r
J
3
4 6 8 10 tz
Frcou&ncv - WWz
96
a
_PU)fc
SP€crfcu* of /u/
'o /WFIWITCLV CLiW^D AT
* 6
SSE
3
o
<
J
a 3
<
-J
c-
ML
4 6 8 /0 /£
97
t
I
Vi)
1/1
0-
OfciciMAc ^peeTfcuM
ViJ
I-
-J
6-
PUIR
aUCL
4 4 8 10 II.
FfcEQueMCV - fc.H*
98
s
^PecYfcUM OF /u/t>iFF.
PUR
4 6 8
um
10 /z
Fp.counNCV - kHz
99
4 6
FRequswcV - kHz
100
^ECTfcUM OF /tt f DJFF.
8 10 tz
APPENDIX B
Fourier Analysis
f(t) = -7T- +
l
n=l
/Z
—
'
f
(A
n
cos 2trnf
on
t + B sin 2-rrnf
o
t)
T o cos
2 *-, n n
n=l
where f = 1/T
o
C = A
o o
'
2 2
C
n
= J V
A
n
H B
n
_1 -B
=tan n
n
A
n .
1/2 £
n=l
C
n
1Q2
APPENDIX C
Digital Computer Programs
The calculations for this study were made by a Control Data 1604
digital computer with a Fortran 63 compiler. The major portion of the
computations involved transformations back and forth between the
time domain and the frequency domain. The transformations were
accomplished using the principles of Fourier analysis. The integrals
involved were evaluated by making the approximation
T N
f(x)dx - ? f (i A x) Ax where Ax = —
T
1O T~l
=1
N
1
did not significantly affect the results. Listed on the following pages
study.
103
)
COMMON/B1/S/B2/T
ARG2=2.*3. 141 59265 36*FREQ
FORM=2,/FLOATF(LIM)
29 DO 30 N=1.100
AN*0.
BN*=0.
ARG1*ARG2*FL0ATF(N
DO 35 I«ltLIM
ARG=ARG1*T( I)
A N »A N +S(I)*COSF(ARG)
35 B N =B N +S(I)*SINF(ARG)
A(N)«A N FORM
B(N)=B N •FORM
30 C(N) = ( A(N)**2+B(N)**2)
RETURN
END
104
SUBROUTINE AN (A.BtC)
C PERFORMS FOURIER ANALYSIS OF AN INFINITELY CLIPPED
C WAVEFORM IN ARRAY S BY SUPERPOSITION OF THE
C SPECTRA OF THE COMPONENT PULSES. A AND B ARE THE
C OUTPUT ARRAYS OF FOURIER COEFFICIENTS AND
C C = A**2 + B**2.
DIMENSION A(IOO) tB(100) tAHlOO) »BI (100) ,Z(100)»C( 100)
CALL ZEROS (900,Z)
DO 10 1=1,100
10 A( I )*B( I)=0.
DO 22 1=1.100,2
IF (Z(D) 21,23,21
21 TQ=Z(I+1)-Z( I)
Tl=Z(I)+T0/2.
CALL SPECTRUM -20 . , 1 ./120. ,T0,T1 ,AI ,BI
(
DO 22 N=l»100
A(N)=A(N)+AI (N)
22 B(N)=B(N)+BI (N)
23 DO 24 I«l,100
24 C( I )=A(I)**2+B( I)**2
RETURN
END
20 B(N)=-C*SINF(PHI)
RETURN
END
LOS
SUBROUTINE ZEROS <NUM»Z)
C STORES THE TIMES OBTAINED FROM ARRAY T OF ZERO
C CROSSING OF THE WAVEFORM IN ARRAY S (NUMBER OF
C POINTS « NUM) INTO THE ARRAY Z. AFTER THE LAST
C ZERO CROSSING HAS BEEN ENTEREDt SUCCEEDING VALUES
C OF Z ARE EQUAL TO ZERO.
DIMENSION S(900)»T(900)»Z(100)
COMMON/B1/S/B2/T
J=l
LAST»1
DO 10 I«ltl00
10 ZU)«0.
DO 25 I-1»NUM
IF (S(I)) 21,22,22
21 NOW*-l
GO TO 23
22 NOW-1
23 IF(NOW+LAST) 25,24t25
24 Z(J)=T( I)
J=J + 1
25 LAST«NOW
RETURN
END
106
SUBROUTINE SYN A.B.FREQ. TYPE »NUM)
( I
DO 20 N-l.NUM
ARG=ARG1*FL0ATF(N)
IF (ITYPE) 15.10.15
10 S( I)=S( I)+A(N)*COSF(ARG)+B(N)*SlNF(ARG)
GO TO 20
15 S( I )=S( I)+A(N)*COSF(ARG+B(N) )
20 CONTINUE
21 CONTINUE
RETURN
END
22 SMAX-S( I)
21 CONTINUE
GAIN«10./SMAX
DO 23 I»1.NUM
23 S( I )=S( I)*GAIN
RETURN
END
107
)
108
SUBROUTINE POWERF <C,PF)
C COMPUTES THE CUMULATIVE DISTRIBUTION FUNCTION
C OF THE POWER IN THE ARRAY C.
DIMENSION C(100) »PF( 100)
PF(l)=C(l)/2,
DO 10 I«2»100
10 PF(I)»PF( I-l>+C( I 1/2.
RETURN
END
COEF=SUM/ S0RTF(SIGSQ1*SIGSQ2)
RETURN
END
109
SUBROUTINE SMOOTH (NUM»SINP #LIM)
C INTERPOLATES A SMOOTH CURVE BETWEEN NUM INPUT
C POINTS IN ARRAY C BY MEANS OF SIN(X)/X FUNCTION.
C OUTPUT IS THE ARRAY S AND WILL HAVE LIM POINTS.
DIMENSION S(9O0)fT(900)»SINP(100)
COMMON/B1/S/B2/T
INT«900/(NUM-1)
LIM-INT*(NUM-1)
SAVG=0.
DO 10 I=1»NUM
10 SAVG=SAVG+SINP( I)
SAVG=SAVG/FLOATF(NUM)
DO 11 I»ltNUM
11 SINPU )=SINP( D-SAVG
DO 20 1=1,900
T( I )=FLOATF( I)/FLOATF(LIM)/120.
20 S( I)=0.
DO 30 J=1.NUM
DO 30 I»1,LIM
ARG=3.1415926536*FLOATF I - J-l *I NT /FLOATF INT)+1.0E-50
( ( ) ) (
110
SUBROUTINE PHASE (NUM. C. PHI)
C INFERS A PHASE CHARACTERISTIC BASED ON THE
C AMPLITUDE FUNCTION IN ARRAY C. PHI IS THE OUTPUT
C PHASE ARRAY IN RADIANS.
DIMENSION S 900 ).T( 900). PHI 1(600) .C(50) ,PHI( 50),MAX( 50) .X(300).
(
* YI300)
COMMON/B1/S/B2/T
ILAST»NUM*12
DO 70 I»1.NUM
X( >=120.*FLOATF( I
70 Y(I)«C( I)
DO 71 I=12»ILAST
XINT=10.*FLOATF( I)
CALL SPLINE X. Y»NUM ,X INT.
( INT )
71 Sm-YINT
IPOS«l
J=l
DO 50 I=12,ILAST
IF (S( I+l)-S( I )46.49.49
46 IF (IPOS) 50,50*47
47 MAX(J)=I
J»J+1
IPOS*0
GO TO 50
49 IPOS«l
50 CONTINUE
JLAST=J-1
DO 55 1=1.600
55 PHIK I)=0.
DO 60 J=1.JLAST
FMAXJ=MAX( J)
Q-FMAXJ/10,
DO 60 1=1.600
FI = I
111
) )
GO TO 7
1 YINT*Y(1)
RETURN
2 IF(XINT-X(K+1) )6»4,5
4 YINT»Y(K+1)
RETURN
5 K«K+1
IF(M-K) 71,71»3
71 K«M-1
GO TO 7
6 IF(XINT-X(K) )13»12»11
12 YINT»Y<K)
RETURN
13 K=K-1
GO TO 6
11 YINTMX(K+1)-XINT)*<C(1»K)*(X(K:+1)-XINT)**2+C(3,K))
YINT=YINT+(XINT-X(K) * C( 2 ,K )* XINT-X K )**2+C(4 t K)
) ( ( ( )
RETURN
7 PRINT 101»XINT
101 FORMAT(8H0XINT = E18.9»32H, OUT OF RANGE FOR INTERPOLATION)
GO TO 11
END
SUBROUTINE SPLICON (X ,Y »MtC )
A(K,3)»P(K)/A(K t 2)
4 B(K)"B(K)/A(Kt2)
Q=D(M-2)/D(M-l)
A<M.l)=l.+Q+A(M-2,3)
A(Mt2)=-0-A(M > l)*A(M-1.3)
B M =B M-2 -A M » 1 *B M-l
(
)
(
)
(
) (
Z(M)=B(M)/A(M»2)
MN=M-2
DO 6 I=1»MN
112
Z(K)«B(K)-A(Kt3)*Z(K+l )
Z(1)«-A(1»2)*Z(2)-A( 1»3)*Z(3)
DO 7 K«1»MM
Q*1./(6.*D(K) )
C(1»K)*Z(K)*Q
C(2,K)«Z(K+1)«Q
C(3»K)=Y(K)/D(K)-Z(K)*P(K)
C ( 4 »K = Y K+l
)
( ) /D K) -Z K + l *P K)
( ( )
END
113
I I ) ) ) ) )
COMMON/B1/S/B2/T
LAB(1)»4H10DB $ LAB 2 =4H20DB $ LAB(3)=4H INF ( )
DO 4 I-1.50
4 PHI ( I)»0.
DO 5 1=1.100
5 A( I )=B( I)=C( I )=C1( I )=0.
READ 9. LA. NUM
9 FORMAT (A4.I3)
"
DO 13 1=1.100
ORD(I )=C( I
13 ABS( )=120.*FLOATF( I
CALL ITITLE ( IT)
IT(D*8HORIGINAL
IT(2)=8H AUDIO
IT(3)=8HSPECTRUM
CALL DRAW(100»ABS.ORD»1.0.LA.IT.2000..1..0,0.2.2,7,10»0.L)
DO 135 1=1.50
135 ORD(I)«PHI( I )+8.
CALL DRAW 50 . ABS.ORD.2 .0 .4H PHI .2000.
( . 1 • .0.0.2 .2 .7. 10 .0 »L
DO 14 1=1.100
14 ORD( I )=A( I )/10.
CALL DRAWU00.ABS.ORD.3.0.4H PWR. I T.2000. . 1. .0.0. 2 .2.7. 10 .0 ,L )
IT(1)=8HCLIPPED
CLPLVL=3. 16
DO 20 J=l*3
16 CALL GRADCLIP S , 900 CLPL VL (
DO 18 1=1,100
18 ORD( I )=A( I ) /10.
CALL DRAW( 100 ABS , ORD , 3 , , 4H PWR, I T »2000 • , 1. ,0 .6 » 2 , 2 , 7 , 10 »0 , )
20 CONTINUE
TW0PI=6. 2831853072
ARG2=TWOPI/1200000.
DO 30 1=1,10000
SN( I )=0.
ARG1=ARG2*FL0ATF( I )
DO 30 N=1,NUM
ARG=ARG1* (24000. +1 20. *FLOATF(N) )
DO 51 N=1,NUM
A(N)=0.
B(N)=0.
ARG1=TW0PI*( 24O0O.+120.*FLOATr(N) )/ 1200 000.
DO 50 1=1,10000
ARG = ARG1*FL0ATF ( I )
A(N)=A(N)+SN( )*COSF(ARG) I
A(N)=A(N) /5000.
B(N)=B(N)/5000.
CA(N) = (A(N)**2+3(N)**2)
51 ORD(N)=SQRTF(CA(N) )
IT(2)=8H SSB
CALL DRAW(NUM,ABS,0RD,1 ,0,LA,TT,20 00.,l.,0,0,2,2,7,10,n,L)
CALL POWERF (CA,A)
CALL COHERE CI ,CA ,NUM ,COEF
(
DO 52 1=1,100
52 ORD( )=A( I/10. I )
55 END
115
INITIAL DISTRIBUTION LIST „
No. „
.
Copies
1. Defense Documentation Center 20
Cameron Station
Alexandria, Virginia 22314
2 . Library 2
Naval Postgraduate School
Monterey, California 93940
116
UNCLASSIFIED
Security Classification
3 REPORT TITLE
b. PROJECT NO.
9b. OTHER REPORT NOfS) (Any othar numbara that may ba aaalgnad
thia raport)
<HM
10. A VA IL ABILITY/LIMITATION NOTICES This document has been a^ roved i
+ammmam**mm*m-
11. SUPPLEMENTARY NOTES 12 SPONSORING MILITARY ACTIVITY
13 ABSTRACT
Peak clipping is a well known method of increasing the average power output
of a peak power limited voice communication transmitter. Although the clipping
process introduces distortion, articulation tests have shown that clipped speech
remains highly intelligible. Using idealizations of vowel sounds based on the
mechanism of speech production, calculations were made of the spectra re-
sulting from clipping these speech-like signals. The results indicate a high
degree of similarity between the spectra before and after clipping. The power
gained by clipping at audio frequency and at narrowband was calculated and
compared with previously published data. Repeaking due to component rejection
was investigated for clipping at audio and narrowband. Calculations of the
effect of varying the phase characteristic of the signals before clipping indicate
that such variation may improve the intelligibility of clippdd speech.
FORM
DD 1 JAN 94 1473 UNCLASSIFIED
117 Security Classification
"
UNCLASSIFIED
Security Classification
14- LINK A LINK LINK C
KEY WORDS ROLE WT HOLE WT ROLE WT
Speech
Speech models
Clipping
Repeaking
Speech spectra
Distortion
INSTRUCTIONS
1. ORIGINATING ACTIVITY: Enter the name and address imposed by security classification, using standard statements
of the contractor, subcontractor, grantee, Department of De- such as:
fense activity or other organization (corporate author) iaaulng "Qualified requesters may obtain copies of this
(1)
the report. report from DDG"
2a. REPORT SECURTY CLASSIFICATION: Enter the over-
(2) "Foreign announcement and dissemination of this
all security classification of the report* Indicate whether
report by DDC is not authorized.
"Restricted Data" is included. Marking is to be in accord-
ance with appropriate security regulation*. (3) "U. S. Government agencies may obtain copies of
this report directly from DDC. Other qualified DDC
2b. GROUP: Automatic downgrading is specified in DoD Di- users shall request through
rective 5200. 10 and Armed Forces Industrial Manual. Enter
the group number. Also, when applicable, show that optional
markings have been used for Group 3 and Group 4 as author- "U. S. military agencies may obtain cop ies
ized,
(4) p f this
f report directly from DDC Other quaUfte##lsfs
3. REPORT TITLE: Enter the complete report title in all shall request through
capital letters. Titles in all cases should be unclassified.
If a meaningful title cannot be selected without classifica-
tion, show title classification in all capitals in parenthesis (5) "All distribution of this report is controlled. Qual-
immediately following the title. ified DDC users shall request through
DD 1
FORM
JAN 64 1473 (BACK) 118 UNCLASSIFIED
Security Classification