Professional Documents
Culture Documents
Published in the Proceedings of the 8th International Conference on Spoken Language (ICSLP 2004), October 4-8, 2004 in Jeju, Korea. Personal use of this
material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to
servers or lists, or to reuse any copyrighted component of this work in other works, must be obtained from International Speech Communication Association, ISCA.
Dspfactory Ltd., 611 Kumpf Drive, Unit 200, Waterloo, Ontario, Canada, N2V 1K8
tina.soltani@dspfactory.com
Time
Synthesis
Analysis
Frequency Bands
Input
Ft
Ff
Subband
Broadband Data
Subband
flag gain nbb
Distance
Output signal (dBm)
14
28
12
26
10
24
Clippin 8
22 at
inpu
analo 6
20
Linear stag
4
18
12 17 22 27 32 37 42 47 52 57
2
Input signal
Sample
Figure 5: Input response of the system to different
Figure 6: The LAR distance for the input and output signals,
tones applying the broadband, narrowband, and
relative to the no shock carrying speech
combinational gains
-0.5
speech, the output signal of the DSP system is compared to -1
the input signal by using the log area ratio (LAR) distance [4]. 1 2 3 4 5 6
Sample
7 8 9 10 11
4
x 10
In this case, the affected signal is compared to the original
unperturbed speech signal on a sample per sample basis and Figure 7: (a) Shock carrying signal (b) output signal without
the difference is expressed as the distance between the two transient reduction algorithm (c) with transient reduction
signals. In this measurement, the smaller the distance is, the algorithm.
more similar the two signals are. Figure 6 demonstrates the
improvement made in speech quality compared to the original provides low group delay subband processing and it is
shock carrying signal. Both signals are compared to the non- configurable for integration into different headset.
shock carrying speech signal. Currently, the algorithm detects and limits shocks as they
Figure 7 illustrates the input and output signals with and occur. Some standards, such as ITU-T P.360, also specify the
without the transient reduction algorithm. The top graph necessity to monitor sound exposure over specific periods of
shows the unprocessed input signal. The second graph is the time. Future work on the algorithm to meet these
output of the system when only the narrowband gain is requirements will involve using adaptive thresholds based on
applied to reduce the shock and the third graph shows the accumulated energy.
output of the system when the subband and broadband gains
are combined. As it can be observed, the overshoot in the 6. References
output signal at the beginning of the shock is controlled and
reduced significantly when the broadband gain is used in [1] ITU-T P.360, “Efficiency of Devices for Preventing the
parallel with the subband gain. Occurrence of Excessive Acoustic Pressure by Telephone
Receivers”, International Telecommunication Union,
5. Conclusion Dec. 1998.
[2] Choy, G. and Hermann, D. “Subband-based acoustic
The acoustic shock limiting algorithm proposed in this paper shock limiting algorithm on a low resource DSP system”,
limits the output signal to a constant value for different input Proc. Eurospeech 2003.
signals, while minimizing the effects on speech quality. It also [3] Schneider, T., Brennan, R, “An ultra low-power
reduces the transient effect observed at the beginning of the programmable DSP system for hearing aids and other
shock in a subband-only limiting approach. The algorithm audio applications”, Proc. ICSPAT 1999.
does not distort or otherwise effect signals with energy levels
[4] Quackenbush, S.R. et al, Objective Measures of Speech
Quality, Prentice Hall, New Jersey, 1988.