Professional Documents
Culture Documents
5. Conclusion
Unlike traditional AEC, e.g., NLMS and RLS, BSS AEC’s sig-
nal model can be considered full duplex which both near-end
and far-end signals coexists. Near-end speech is explicitly mod-
eled as an independent component in the ICA model. Thus, the
BSS AEC is expected to have better performance in continuous
double-talking scene. In this paper, the Aux-ICA algorithm is
used to solve the AEC problem. After mathematical simplifica-
tion, it can be seen that the Aux-ICA AEC has strong connec-
tion with the RLS algorithm. Specifically, it can be considered
as weighted RLS, with the ICA nonlinearity as the weighting
function. The performance of the proposed approach is verified
by both simulated and real-world experiments. In addition, a
Figure 3: Real-world experimental environment. speech enhancement front-end is unified in the Aux-ICA/IVA
framework for the real-world application of smart TV keyword
Target speech, echo, and noise are captured individually in spotting.
the environment of Figure 3, then, added together as the algo-
6. References acoustic echo cancellation." IEEE transactions on audio, speech,
and language processing 19.3 (2010): 583-599.
[1] Haykin, Simon S. Adaptive filter theory. Pearson Education India, [21] Ikram, Muhammad Z. "Blind source separation and acoustic echo
2005. cancellation: A unified framework." 2012 IEEE International
[2] Morgan, Dennis R., and Steven G. Kratzer. "On a class of com- Conference on Acoustics, Speech and Signal Processing
putationally efficient, rapidly converging, generalized NLMS al- (ICASSP). IEEE, 2012.
gorithms." IEEE Signal Processing Letters 3.8 (1996): 245-247. [22] Buchner, Herbert, and Walter Kellermann. "A fundamental rela-
[3] Albu, Iuliana, Cristian Anghel, and Constantin Paleologu. "Adap- tion between blind and supervised adaptive filtering illustrated for
tive filtering in acoustic echo cancellation systems—A practical blind source separation and acoustic echo cancellation." 2008
overview." 2017 9th International Conference on Electronics, Hands-Free Speech Communication and Microphone Arrays.
Computers and Artificial Intelligence (ECAI). IEEE, 2017. IEEE, 2008.
[4] Yang, Jun. "Multilayer Adaptation Based Complex Echo Cancel- [23] Gunther, Jake. "Learning echo paths during continuous double-
lation and Voice Enhancement." 2018 IEEE International Con- talk using semi-blind source separation." IEEE transactions on
ference on Acoustics, Speech and Signal Processing (ICASSP). audio, speech, and language processing 20.2 (2011): 646-660.
IEEE, 2018. [24] Woodbury, Max A. "Inverting modified matrices." Memorandum
[5] Valin, Jean-Marc. "On adjusting the learning rate in frequency do- report 42.106 (1950): 336.
main echo cancellation with double-talk." IEEE Transactions on [25] Recommendation, ITU-T. "Perceptual evaluation of speech qual-
Audio, Speech, and Language Processing 15.3 (2007): 1030-1034. ity (PESQ): An objective method for end-to-end speech quality
[6] Buchner, Herbert, Jacob Benesty, and Walter Kellermann. "Gen- assessment of narrow-band telephone networks and speech co-
eralized multichannel frequency-domain adaptive filtering: effi- decs." Rec. ITU-T P. 862 (2001).
cient realization and application to hands-free speech communi- [26] Chen, Mengzhe, et al. "Compact Feedforward Sequential
cation." Signal Processing 85.3 (2005): 549-570. Memory Networks for Small-footprint Keyword Spotting." Inter-
[7] Apolinário, JoséAntonio, and R. Rautmann. QRD-RLS adaptive speech. 2018.
filtering. Ed. JoséAntonio Apolinário. New York: Springer, 2009.
[8] Franzen, Jan, and Tim Fingscheidt. "An efficient residual echo
suppression for multi-channel acoustic echo cancellation based on
the frequency-domain adaptive Kalman filter." 2018 IEEE Inter-
national Conference on Acoustics, Speech and Signal Processing
(ICASSP). IEEE, 2018.
[9] Hong, Jungpyo. "Stereophonic Acoustic Echo Suppression for
Speech Interfaces for Intelligent TV Applications." IEEE Trans-
actions on Consumer Electronics 64.2 (2018): 153-161.
[10] Huang, Hai, et al. "A multiframe parametric Wiener filter for
acoustic echo suppression." 2016 IEEE International Workshop
on Acoustic Signal Enhancement (IWAENC). IEEE, 2016.
[11] Lee, Chul Min, Jong Won Shin, and Nam Soo Kim. "DNN-based
residual echo suppression." Sixteenth Annual Conference of the
International Speech Communication Association. 2015.
[12] Carbajal, Guillaume, et al. "Multiple-input neural network-based
residual echo suppression." 2018 IEEE International Conference
on Acoustics, Speech and Signal Processing (ICASSP). IEEE,
2018.
[13] Hyvärinen, Aapo, and Erkki Oja. "Independent component anal-
ysis: algorithms and applications." Neural networks 13.4-5 (2000):
411-430.
[14] Ono, Nobutaka, and Shigeki Miyabe. "Auxiliary-function-based
independent component analysis for super-Gaussian sources." In-
ternational Conference on Latent Variable Analysis and Signal
Separation. Springer, Berlin, Heidelberg, 2010.
[15] Kim, Taesu, et al. "Blind source separation exploiting higher-or-
der frequency dependencies." IEEE transactions on audio, speech,
and language processing 15.1 (2006): 70-79.
[16] Ono, Nobutaka. "Stable and fast update rules for independent vec-
tor analysis based on auxiliary function technique." 2011 IEEE
Workshop on Applications of Signal Processing to Audio and
Acoustics (WASPAA). IEEE, 2011.
[17] Taniguchi, Toru, et al. "An auxiliary-function approach to online
independent vector analysis for real-time blind source separa-
tion." 2014 4th Joint Workshop on Hands-free Speech Communi-
cation and Microphone Arrays (HSCMA). IEEE, 2014.
[18] Scheibler, Robin, and Nobutaka Ono. "Independent vector analy-
sis with more microphones than sources." 2019 IEEE Workshop
on Applications of Signal Processing to Audio and Acoustics
(WASPAA). IEEE, 2019.
[19] Ono, Nobutaka. "Auxiliary-function-based independent vector
analysis with power of vector-norm type weighting functions."
Proceedings of The 2012 Asia Pacific Signal and Information
Processing Association Annual Summit and Conference. IEEE,
2012.
[20] Nesta, Francesco, Ted S. Wada, and Biing-Hwang Juang. "Batch-
online semi-blind source separation applied to multi-channel