Double-Talk Detection Based on Soft Decision for Acoustic Echo Suppression

음향학적 반향 제거를 위한 Soft Decision 기반의 동시통화 검출

  • Published : 2009.04.30

Abstract

In this paper, we propose a novel double-talk detection (DTD) technique based on soft decision in the frequency domain. In the proposed method, global near-end speech presence probability (GNSPP) considering the statistical model assumption and voice activity detection (VAD) decision of the near-end and far-end signal are applied to the DTD algorithm in the frequency domain instead of the traditional hard decision scheme using cross-correlation coefficients. The performance of the proposed algorithm is evaluated by the objective test under various environments, and yields better results compared with the conventional scheme.

본 논문에서는 음향학적 반향 제거(AES, acoustic echo suppression)를 위해 주파수영역에서 soft decision 기법에 근거한 새로운 동시통화 검출 (DTD, double-talk detection) 알고리즘을 제안한다. 제시된 방법은 효과적인 DTD를 위해 상관계수 (cross-correlation coefficient)에 기반하여 hard decision을 사용하는 기존의 알고리즘 대신 주파수 영역에서 입력 및 원단신호의 VAD (voice activity detection) 결과와 음성 통계모델에 기반한 soft decision 방법을 도입하여 전역 근단화자존 재확률 (GNSPP, global near-end speech presence probability)을 DTD에 적용한다. 제안된 알고리즘은 기존의 방법과 객관적인 실험을 통해 비교 평가한 결과 다양한 배경잡음 환경에서 우수한 성능을 보였다.

Keywords

References

  1. P. S. R. Diniz, "Adaptive Filtering: Algorithm and Practical lmplementation. Norwell, MA: Kluwer, 1997
  2. C. Avendano, "echo suppression in the STFT domain," in Proc. IEEE Workshop on AppI. of Sig. Proc. to Audio and Acoust., Oct. 2001 https://doi.org/10.1109/ASPAA.2001.969571
  3. S. J, Park, C. G. Cho, C. Lee. and D. H. Youn, "Integrated echo and noise canceler for hands-free applications," IEEE Trans. on Circuits and Systems II, vol. 49, issue 3, pp. 186-195, Mar. 2002 https://doi.org/10.1109/TCSII.2002.1013865
  4. R. J. McAualy and M. L. Malpass, "Speech enhancement using a soft-decision noise suppression filter," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-28, no. 2, pp. 137-145, Apr. 1980 https://doi.org/10.1109/TASSP.1980.1163394
  5. N. S. Kim and J.-H. Chang, “Spectral enhancement based on global soft decision,” IEEE Signal Processing Letters, vol. 7, no. 5, pp. 108-110, May 2000 https://doi.org/10.1109/97.841154
  6. C. Faller and C. Tournery, "Estimating the Delay and Coloration Effect of the Acoustic Echo Path for Low Com-plexity Echo Suppression," in Proc. Intl. Works. on Acoust, Echo and Noise Control (IWAENC), 2005
  7. C. Faller and J. Chen, "Suppressing acoustic echo in a spectral envelope space," IEEE Trans. on Speech and Audio Processing, vol. 13, no. 5, pp. 1048-1062, Sept. 2005 https://doi.org/10.1109/TSA.2005.852012
  8. J. Sohn, W. Sung, “A voice activity detector employing soft decision based noise spectrum adaptation,” in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, pp. 365-368, 1998 https://doi.org/10.1109/ICASSP.1998.674443
  9. J.-H. Chang, N. S. Kim and S. K. Mitra, “Voice activity detection based on multiple statistical models,” IEEE Trans, Signal Processing, vol. 54, no. 6, pp. 1965-1976, June 2006 https://doi.org/10.1109/TSP.2006.874403
  10. Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-32, no. 6, pp. 1109-1121, Dec. 1984 https://doi.org/10.1109/TASSP.1984.1164453
  11. S. McGovern, A Model for Room Acoustics, 2003 [Online]. Available: http://2pi.us/rir.html
  12. S. Y. Lee and N. S. Kim, "A statistical model based residual echo suppression," IEEE Signal Processing Letters, vol. 14, no. 10. pp. 758-761, Oct. 2007 https://doi.org/10.1109/LSP.2007.896452