Noise Suppression Using Normalized Time-Frequency Bin Average and Modified Gain Function for Speech Enhancement in Nonstationary Noisy Environments

  • Lee, Soo-Jeong (Department of Computer Engineering, Kwangwoon University) ;
  • Kim, Soon-Hyob (Department of Computer Engineering, Kwangwoon University)
  • Published : 2008.03.31

Abstract

A noise suppression algorithm is proposed for nonstationary noisy environments. The proposed algorithm is different from the conventional approaches such as the spectral subtraction algorithm and the minimum statistics noise estimation algorithm in that it classifies speech and noise signals in time-frequency bins. It calculates the ratio of the variance of the noisy power spectrum in time-frequency bins to its normalized time-frequency average. If the ratio is greater than an adaptive threshold, speech is considered to be present. Our adaptive algorithm tracks the threshold and controls the trade-off between residual noise and distortion. The estimated clean speech power spectrum is obtained by a modified gain function and the updated noisy power spectrum of the time-frequency bin. This new algorithm has the advantages of simplicity and light computational load for estimating the noise. This algorithm reduces the residual noise significantly, and is superior to the conventional methods.

Keywords

References

  1. M. Bhatnagar, "A modified spectral subtraction method combined with perceptual weighting for speech enhancement", Master's thesis, University of Texas at Dallas, 2003
  2. S. F. Boll., "Suppression of acoustic noise in speech using spectral subtraction", IEEE Trans. Acoust. Speech Signal Processing, 27 (2), 113-120, 1979 https://doi.org/10.1109/TASSP.1979.1163209
  3. Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean-square errror short-time spectral amplitude estimator", IEEE Trans. Acoust. Speech Signal Processing, 32(6), 1109-1121, 1984 https://doi.org/10.1109/TASSP.1984.1164453
  4. O. Cappe, "Elimination of the musical noise phenomenon with the Ephraim and Malah noise suppressor", IEEE Trans. Speech Audio Processing, 2(2), 346-349, 1994
  5. Y. Hu, "Subspace and multitaper methods for speech enhancement", Ph.D. dissertation. University of Texas at Dallas, 2003
  6. R. Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics", IEEE Trans. Speech Audio Processing, 9(5), 504-512, 2001 https://doi.org/10.1109/89.928915
  7. Y. Hu and P. Loizou, "Speech enhancement based on wavelet thresholding the multi taper spectrum", IEEE Trans. Speech Audio Processing, 12(1), 59-67 https://doi.org/10.1109/TSA.2003.819949
  8. R. Sundarrajan and C. L. Philipos, "A noise-estimation algorithm for highly nonstationary environments", Speech Communication, 48, 220-231, 2006 https://doi.org/10.1016/j.specom.2005.08.005
  9. I. Cohen, "Noise spectrum in adverse environments: improved minima controlled recursive averaging", IEEE Trans. SpeechAudio Processing, 11(5), 466-475, 2003 https://doi.org/10.1109/TSA.2003.811544
  10. I. Cohen, "Speech enhancement using a noncausal a priori SNR estimator", IEEE Signal Processing Letters, 11(9), 725-728, 2004 https://doi.org/10.1109/LSP.2004.833478
  11. C. L. Philipos, "Speech Enhancement (Theory and Practice", 1st edition. CRC Press, Boca Raton, FL, 2007
  12. J. L. Soo, "Speech enhancement based on Variance of Spectral Power in Time-Frequency for Speech Recognition in Non-stationary Noisy Environments", Ph.D, Thesis, Kwangwoon University, Seoul, Korea, December
  13. ITU-T, "Subjective test methodology for evaluating speech communication systems that include noise suppression algorithm", ITU-T Recommendation pp.835
  14. L. Lin, W. H. Holmes and E. Ambikairajah, "Adaptive noise estimation algorithm for speech enhancement", Electronic Letters, 39(9), 754-755, 2003 https://doi.org/10.1049/el:20030480
  15. A. V. Oppenheim and R. W. Schafer, "Discrete-Time Signal Processing", 2nd edition. Prentice Hall, Upper Saddle River, NJ, 1999