You are on page 1of 6

A Hybrid MFWT Technique for Denoising

Audio Signals
2022 2nd International Conference on Innovative Sustainable Computational Technologies (CISCT) | 978-1-6654-7416-0/22/$31.00 ©2022 IEEE | DOI: 10.1109/CISCT55310.2022.10046509

Uma Rani V Jebamalar Tamil Selvi J


Department of Computer Science and Engineering, Department of Computer Science,
Saveetha Engineering College, SRM Institute of Science and Technology,
Chennai, Tamil Nadu, India Ramapuram,Chennai, Tamil Nadu, India,
umaranibharathy@gmail.com jebamalj@srmist.edu.in

Kaladevi Ramar Hariharan Shanmugasundaram


Department of CSE, Department of Computer Science and Engineering,
Saveetha Engineering College, Chennai, India Vardhaman College of Engineering, Hyderabad, India
Kalaramar26@gmail.com mailtos.hariharan@gmail.com

Abstract—Any unwanted background sound is considered denoising. But they have some advantages and disadvantages;
noise. The perception of a sound at the same decibel level is linear filter is suitable for salt and pepper noise removal.
determined by whether it is considered pleasant music or irritating nonlinear filter is suitable to multiplicative or function-based
noise. The removal of such noise is necessary to increase the noise removal. But both are failed to remove noise in sharp
quality of the audio signal as it travels through the communication edges and lines, linear and nonlinear filters [7] perform poorly.
system.A microphone is used in this study to collect real-time
audio signals. A real audio signal is collected and combined with To remove audio sounds in transform domain filtering, low
salt, pepper and white Gaussian noise for this experimental study. pass or high pass filters with a cut-off frequency and a
To remove disturbances from an audio signal, this study integrates frequency domain filter are utilized. However, they impair the
linear and nonlinear spatial filtering mechanisms such as the originality of the audio signal. Wavelet Thresholding [8]
median low pass filter, finite impulse response low pass filter, and reduces picture noise by using threshold-based computations.
wavelet transform approach. The suggested hybrid filter Thresholds can be adaptive or non-adaptive, but if the best
technique is compared to individual filtering methods, and its threshold is not chosen, the original image blurs. As a result, a
performance is measured using fundamental metrics such as mean hybrid filter is required to reduce noise while maintaining signal
square error and signal-to-noise ratio. When compared to other quality.
individual nonlinear filtering approaches, the hybrid method has
a high signal to noise ratio (SNR - 88.04%) and a lower mean Several studies are being carried out to perform denoising
square error (MSE - 0.003). on images, audio, and videos. This work is divided into several
sections. Section II examines various approaches developed by
Keywords—De-noising audio signal, Median filter , FIR researchers to denoising audio signals. Section III discusses the
filter,wavelet transform proposed hybrid methodology for removing noise from audio
streams. Section IV discusses the experimental results of
I. INTRODUCTION proposed methodology after using filters in each level and
Today, a wide range of industries use digital image compares its performance to that of individual filters.
processing [1], including satellites, CT image recognition, face
recognition, pattern matching, dashboard image processing, II. LITERATURE REVIEW
obstacle processing, machine or robot vision and acoustic signal Many studies on text or image denoising, audio and video
analysis, among others. The audio or noise signals were denoising from diverse sources have been undertaken. Several
gathered from multiple sources but were tainted by noise. Data denoising algorithms and noise reduction approaches have been
transmission mistakes, noise reduction techniques like created to eliminate various types of noise from the original
amplifiers and different quantization methods, or thermally image, audio, or video, however they fail to retain their
generated electrons at sensor sites are the main causes of this originality. The noise linked with the texture and edge of a
noise [2]. To better the analysis of the audio signal, this picture, as well as the same frequency of audio and video, is
distorted noise is first eliminated. difficult to remove using conventional methods. This section
summarises some of the research that has been done by various
Audio de-noising [3,4] is a technique for preserving original
research groups.
voices after noise or distortions from noisy backgrounds have
been removed. Researchers [5,6] have developed a variety of Gaetan Frusque et al [9] created a wavelet packet transform
audio de-noising methods to eliminate noise from audio approach to improve audio signal resolution. They use the
sources, including spatial filtering, transform domain filtering, DCASE08 data collection for experimental purposes. Julius et
and wavelet thresholding. In this scenario, spatial filtering is an al. [10] developed a new noise estimator based on the wavelet
effective method for removing additive noise from audio mechanism and combined it with a log spectral subtraction to
signals. The two types of spatial filtering approaches are linear moderate the noise level in windmill sound.
filters and nonlinear filters for image or audio or video

Authorized licensed use limited to: ULAKBIM UASL - KARADENIZ TECHNICAL UNIVERSITY. Downloaded on December 12,2023 at 13:27:10 UTC from IEEE Xplore. Restrictions apply.
Brusa et al [11] have utilized a wavelet transform and
thresholding technique to detect local defeats in rolling bearing
audio signals. They employ principal component analysis to
extract features in the temporal domain. For channel estimation,
Palanisamy et al [12] employed an integrated discrete fourier
transform, least square Wiener, and minimal mean square error
technique. When compared to the traditional discrete fourier
transform technique, it has a lower symbol error rate, a lower
bit error rate, and a higher SNR.
Ihza et al [13] created a method for de-noising heart sound
audio that is integrated. Three algorithms are used: discrete
wavelet transform, low pass filter, and short term fourier
transform. To decrease noise in the decomposition process,
Huijie et al [14] employed a threshold shrinkage technique. Fig.1 Flow of audio noise removal using HMFWT Technique

Y.Cho et al. [15] have developed a denoising-based driver


assistance system for off-road teleoperation. In their study, HMFWT has following stages
Jaroslavceva et al [16] employed the labanotation template 1. Audio signal collection and noisy signal generation
subtraction method to reduce noise for human-robot 2. Median Low Pass filtering
communication. J Hao et al [17] used an FFT to extract 3. FIR Low Pass filtering
frequency and a neural network to decrease signal defects. 4. DWT transformation
Panah et al [18] have created an audio processing pipeline to
improve cardiac sound quality. For signal analysis, Leng et al. A. Audio signal Collection and noisy signal generation
[19] have used a conditional diffusion model. They emphasise
the importance of wavelet transformation in signal denoising. The first step in denoising is to gather real-time audio
Bai et al. [20] have employed a multichannel Wiener filter to signals. To generate the noise signal, add AWGN after
remove noise. gathering the audio signal.

The main objective of proposed methodology is B. Median low Pass Filtering


MLPF is a type of nonlinear filter that successfully removes
a) Developed a hybrid model which combines the
speckle and salt pepper noise from a signal. It also retains all
advantages of three filtering methods, Median filter, FIR filter,
signal regions by employing a dynamic windowing mechanism.
and wavelet transform approach to eliminate noise from audio
signals. C. FIR low pass Filtering
b) It remove noise from distorted audio signals without FIRLPF is an intrinsically stable linear phase symmetric
reducing original quality. filter. It does not require feedback or a reaction to any input
c) Compare the performance of hybrid model at each signal of finite length. Fig.2 depicts the basic block diagram of
stages and provide basic quality metric analysis. FIRLPF.

III. PROPOSED METHODOLOGY


The audio signal is evaluated using noise theory and audio
theory basics. The audio signal is concatenated with AWGN to
form a noisy signal. The Fast Fourier Transform technique is
used to convert this noisy signal from time domain to frequency
domain (FFT). The proposed filter HMFWT would require
creating linear and nonlinear spatial filters such as the Median
Low Pass Filter (MLPF), finite impulse response Low pass
(FIR-LPF) filter, and Discrete Wavelet Transform (DWT)
Fig.2 Block diagram of FIR filter
while taking various factors of filter design into account such
as system order and cutoff frequency. The HMFWT filtering
FIR low pass filter produce output signal Y[n] from input
method includes signal normalization, a decomposition
signal X[n] by convolving input signal x and its impulse
mechanism, and the reconstruction process itself. During the
response  . This is defined by equation (1).
decomposition stage, the audio signal is rebuilt and inverse FFT
is employed to transform the frequency domain to the time [] =  0  × [ − ] (1)

domain. Fig. 1 depicts the flow of audio noise removal using Here  is a unit delay operator in Z- transform notation.
HMFWT technique.Table 1 describe the notations and TABLE 1 NOTATIONS USED
abbreviation used in proposed methodology.

Authorized licensed use limited to: ULAKBIM UASL - KARADENIZ TECHNICAL UNIVERSITY. Downloaded on December 12,2023 at 13:27:10 UTC from IEEE Xplore. Restrictions apply.
Abbreviation and Meaning Step 2:Using equation (2) and (3) ,Calculate SNR and
Notations RMSE of noisy signal.
FFT Fast Fourier Transform 
 =  log0   | ( −  )  (2)
AWGN Additive White Gaussian Noise
DWT Discrete Wavelet Transform 
1(  )
2

SNR Signal to Noise Ratio RMSE = =  (3)



RMSE Root Mean Square Error
IFFT Inverse Fast Fourier Transfer
MF Median Filter Step 3:Using the wavelet transform method, establish a
X[n] ,Y[n] Input, Output signal threshold value for the decomposition stage.The threshold
  Unit delay point Tnewis derived from number of samples N and Noise
intensity  . This is described by equation (4)
N Number of samples
 New Threshold  =  log  (4)
WT Wavelet Transform
AFFT Average Fast Fourier Transform
PR Phase Response Step 4: Decompose received signal into wavelet
MLPF Median Low Pass Filter coefficients using filter bank.
SP Salt and Pepper Step 5: Apply a threshold value to the wavelet coefficients
FIRLPF Finite Impulse Response Low Pass that have been decomposed. It employs the Soft threshold,
Filter which is defined by an equation (5).
HMFWT Hybrid Median,FIR filter Wavelet
Transform r = sign(s)(|s| -Tnew) (5)

D. DWT transformation Step 6: Rebuild the signal with threshold wavelet


DWT transformation [19,20,21] is a useful technique for coefficients and a filter.
minimizing noise in digital communications. The DWT
algorithm uses signal threshold value based on number of
samples and density. It decomposes the incoming signal into
wavelet coefficients using the associated analytical filter banks. 2. RMSE: The Root Mean Square Error is a statistic that is
It processes each sub-band separately in a loop. Across all used to calculate the level of error in statistical models. It
scales and sub-bands, apply the global threshold values to the calculates the average squared difference between observed
decomposed wavelet coefficients. It uses a soft threshold for and predicted visual output.
wavelet denoising. Finally, using threshold wavelet coefficients
and a filter bank, the audio signal is recreated. This process is 
illustrated in DWT algorithm (presented in Table 2).  = √ × ∑ () (7)

Initially, an audio signal is captured by an audio recorder with


IV. RESULTS AND PERFORMANCE ANALYSIS Fs = 8000, which records real-time audio for up to 5 seconds
Matlab is used to run the denoising audio signal tests, and the and saves the resulting audio file (minimum 78.1kb file size) in
filter's performance [21] is measured using SNR and RMSE, as .wav format. Fig. 3 depicts the frequency of the input audio
indicated by equations (6) and (7), respectively. signal, while Fig. 4 depicts the noise analysis of the input audio
signal. The Averaged FFT Spectrum of the input audio signal is
1. SNR : The Signal to Noise Ratio (SNR) is the ratio of as shown in Fig.5. After adding noise (SP and white Gaussian
required output signal strength to distorting noise power
noise), the MLPF, FIRLPF and DWT transform are applied to
that decreases input signal quality..
noisy input signal. Fig. 6 depicts the output of the MLPF.
   
 =  log dB (6)
    

TABLE 2 DWT ALGORITHM FOR AUDIO DENOISING

Algorithm 1: DWT algorithm for audio denoising


Input : Noisy audio signal
Output : Denoising audio signal
Step 1: Read an input signal s(i) and noisy signal r(i).

Authorized licensed use limited to: ULAKBIM UASL - KARADENIZ TECHNICAL UNIVERSITY. Downloaded on December 12,2023 at 13:27:10 UTC from IEEE Xplore. Restrictions apply.
Fig.6 Output of MLPF
Fig. 3 Audio signal after AWGN
The output of FIR LPF is as shown in Fig.7.

Fig.4 Noise in Input Signal


Fig.7 Output of FIR LPF
The output of DWT low pass filter is as shown in Fig.8.

Fig.8 Output of DWT


Fig.5 Input Average FFT Spectrum
The magnitude and phase response of HMFWT as shown in
Fig.9.

Authorized licensed use limited to: ULAKBIM UASL - KARADENIZ TECHNICAL UNIVERSITY. Downloaded on December 12,2023 at 13:27:10 UTC from IEEE Xplore. Restrictions apply.
TABLE 3 PERFORMANCE ANALYSIS OF FILTERS
S.No Metrics MLPF FIRLPF DWFT HMFWT
filter
1 SNR 51.08 67.01 77.2 88.08
2 RMSE 6.02 3.98 0.83 0.003

The output Signal of the final audio signal is shown in Fig.10.


The AFFT spectrum signal of the output audio signal is as
shown in Fig.11.The Spectrum analysis of the output audio
Fig.12.

Fig.11 AFFT Spectrum of output signal

Fig.9 Magnitude(dB), PR (degree) of HMFWT

Fig.12 Output spectrum

V. CONCLUSION

Denoising audio signal is one of the major problems occur in


real-time signal processing. Throughout the real-time audio
acquisition method, the genuine signal is tainted by random
noise. Because of the random noise, the transmission quality is
altered, and the original audio stream is diminished. This study
emphasized the significance of denoising techniques in audio
signals. The HMFWT filter described here incorporates linear
and nonlinear filters, as well as the wavelet transform
technique. HMFWT beats individual filtering algorithms in the
presence of strong background noise, with a high SNR ratio
(88.08) and a reduced RMSE score (0.003). In the future, the
current work will be enhanced using a machine learning
technique. The audio signal after stereo mixing or the medical
Fig.10 Recovered audio signal using HMFWT domain is explored.

Table 3 displays the performance of each filter. According to


this investigation, HMFWT has a lower signal-to-noise ratio REFERENCES
and a lower RMSE score than individual filtering approaches.
[1] Yue, Zhong Qi, "Digital image based numerical methods for mechanical
analyses of geomaterials with actual meso-heterogeneity," in Chinese

Authorized licensed use limited to: ULAKBIM UASL - KARADENIZ TECHNICAL UNIVERSITY. Downloaded on December 12,2023 at 13:27:10 UTC from IEEE Xplore. Restrictions apply.
Journal of Geophysics,vol.1, 2022, pp.108-118.
[2] Singh, Abha, Sumit Kushwaha, Maryam Alarfaj, and Manoj Singh,
"Comprehensive Overview of Backpropagation Algorithm for Digital
Image Denoising." in Electronics vol.11, Issue.10,2022, pp.1590.
[3] C.Yu, L.Z. Hou,”Realization of a Real-Time Image Denoising System
for Dashboard Camera Applications,” in IEEE Transactions on
Consumer Electronics,2022, pp 1-12.
[4] Pouyani, MozhdeFiroozi, Mansour Vali, and Mohammad Amin
Ghasemi, "Lung sound signal denoising using discrete wavelet transform
and artificial neural network," in Biomedical Signal Processing and
Control, Issue.72 ,2022, pp103-129.
[5] Zhang, Youshan, and Li Jialu ,"Bird Sounds Denoising: Deep Visual
Audio Denoising for Bird Sounds," in arXiv preprint arXiv:2210,2022
pp.10196-10205.
[6] Liu, Xuanxin, Haiyan Zhang, Qi Jiang, Lili Ren, Zhibo Chen, Youqing
Luo, and Juhu Li,"Acoustic Denoising Using Artificial Intelligence for
Wood-Boring Pests Semanotus bifasciatus Larvae Early Monitoring." in
Sensors, vol.22, Issue. 10, 2022, pp 3861-3872.
[7] Li, Zhixiong, Xihao Liu, Atilla Incecik, Munish Kumar Gupta, Grzegorz
M. Królczyk, and Paolo Gardoni. "A novel ensemble deep learning
model for cutting tool wear monitoring using audio sensors." in Journal
of Manufacturing Processes,vol.79, 2022, pp. 233-249.
[8] P.Venkata Lavanya, C.Venkata Narasimhulu, K. Satya Prasad, "Image
Denoising Using an Artificial Neural Network and Genetic Optimization
Algorithm Based Dual-Tree Complex Wavelet Transform," in Soft
Computing and Signal Processing, Springer, 2022, pp. 393-401.
[9] Frusque, Gaetan, Olga Fink. "Robust Time Series Denoising with
Learnable Wavelet Packet Transform," in arXiv preprint
arXiv:2206.06126, 2022, pp 1-12.
[10] Juodakis, Julius, and Stephen Marsland. "Wind‐robust sound event
detection and denoising for bioacoustics," in Methods in Ecology and
Evolution, vol.9, issue.11, 2022, pp.2005-2017.
[11] Brusa, Eugenio, Cristiana Delprete, Simone Gargiuli, and Lorenzo Giorio.
"Screening of Discrete Wavelet Transform Parameters for the Denoising
of Rolling Bearing Signals in Presence of Localised Defects." in Sensors,
vol.1,2023, pp1-8.
[12] Palanisamy, SatheeshKumar, Fahima Hajjej, Osamah Ibrahim Khalaf,
and Ghaida Muttashar Abdulsahib. "Discrete Fourier Transform with
Denoise Model Based Least Square Wiener Channel Estimator for
Channel Estimation in MIMO-OFDM." in Entropy, vol.11,Issue 24
2022,pp.1601-1612.
[13] Ihza, Muhammad YaumilIhza, Satria Mandala, and Miftah Pramudyo.
"Study of Denoising Method to Detect Valvular Heart Disease Using
Phonocardiogram (PCG)," in Indonesia Journal on Computing (Indo-
JC) vol.7, issue.1,2022,pp 31-38.
[14] M.A Huijie, L. I. Shunming, L. U. Jiantao, Zongzhen Zhang, and G. O.
N. G. Siqi. "Structured sparsity assisted online convolution sparse coding
and its application on weak signature detection," in Chinese Journal of
Aeronautics, vol.35, Issue.1,2022,pp.266-276.
[15] Y. Cho, H. Yun, J. Lee, A. Ha, and J.Yun. "GoonDAE: Denoising-Based Driver
Assistance for Off-Road Teleoperation," in arXiv preprint, 2022.
[16] Jaroslavceva, J., Wake, N., Sasabuchi, K. and Ikeuchi, K., 2022. Robot Ego‐Noise
Suppression with Labanotation‐Template Subtraction. IEEJ Transactions on
Electrical and Electronic Engineering, vol.17,issue.3, pp.407-415.
[17] J.Hao, "Optimizing the Design of a Vocal Teaching Platform Based on Big Data
Feature Analysis of the Audio Spectrum," in Wireless Communications and
Mobile Computing 2022,pp.1-15.
[18] Panah, Davoud Shariat, Andrew Hines, Joseph A. McKeever, and Susan
McKeever. "An audio processing pipeline for acquiring diagnostic quality heart
sounds via mobile phone," in Computers in Biology and Medicine vol.145,2022
pp.105415.
[19] Leng, Yichong, Zehua Chen, Junliang Guo, Haohe Liu, Jiawei Chen, Xu Tan, Danilo
Mandic et al. "BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for
Binaural Audio Synthesis." in arXiv preprint,2022.
[20] Bai, R. Mingsian, and Fan-Jie Kung. "Speech Enhancement by Denoising
and Dereverberation Using a Generalized Sidelobe Canceller-Based
Multichannel Wiener Filter," Journal of the Audio Engineering
Society,vol. 70, Issue.3,2022,pp. 140-155.
[21] M.F Pouyani, M.Vali, M.A Ghasemi,”Lung sound signal denoising using
discrete wavelet transform and artificial neural network,”.in Biomedical
Signal Processing and Control, Vol.72, 2022, pp. 103-329.

Authorized licensed use limited to: ULAKBIM UASL - KARADENIZ TECHNICAL UNIVERSITY. Downloaded on December 12,2023 at 13:27:10 UTC from IEEE Xplore. Restrictions apply.

You might also like