DOI QR코드

DOI QR Code

Sound Source Localization Method Using Spatially Mapped GCC Functions

공간좌표로 사상된 GCC 함수를 이용한 음원 위치 추정 방법

  • 권병호 (한국과학기술원 기계공학과) ;
  • 박영진 (한국과학기술원 기계공학과) ;
  • 박윤식 (한국과학기술원 기계공학과)
  • Published : 2009.04.20

Abstract

Sound source localization method based on the time delay of arrival(TDOA) is applied to many research fields such as a robot auditory system, teleconferencing and so on. When multi-microphones are utilized to localize the source in 3 dimensional space, the conventional localization methods based on TDOA decide the actual source position using the TDOAs from all microphone arrays and the detection measure, which represents the errors between the actual source position and the estimated ones. Performance of these methods usually depends on the number of microphones because it determines the resolution of an estimated position. In this paper, we proposed the localization method using spatially mapped GCC functions. The proposed method does not use just TDOA for localization such as previous ones but it uses spatially mapped GCC functions which is the cross correlation function mapped by an appropriate mapping function over the spatial coordinate. A number of the spatially mapped GCC functions are summed to a single function over the global coordinate and then the actual source position is determined based on the summed GCC function. Performance of the proposed method for the noise effect and estimation resolution is verified with the real environmental experiment. The mean value of estimation error of the proposed method is much smaller than the one based on the conventional ones and the percentage of correct estimation is improved by 30% when the error bound is ${\pm}20^{\circ}$.

Keywords

References

  1. Hwang, S. M., Park, Y.-S. and Park, Y. J., 2005, 'Detection of Speaker Position for Robot Using HRTF'' Proceedings of the KSNVE Annual Autumn Conference 2005, pp. 637-640
  2. Sasaki, Y., Kagami, S. and Mizoguchi, H., 2006, 'Multiple Sound Source Mapping for a Mobile Robot by Self-motion Triangulation,' Proceeding of the 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 380-385 https://doi.org/10.1109/IROS.2006.281797
  3. Hwang, S. M., Park, Y. J. and Park, Y.-S., 2007, 'Sound Direction Estimation Using Artificial Ear,' International Conference on Control, Automation and Systems 2007, pp. 1900-1910
  4. Brandstein, M. S. and Silverman, H. F., 1997, 'A Practical Methodology for Speech Source Localization with Microphone Arrays,' Computer Speech and Language, Vol. 11, No. 2, pp. 91-126 https://doi.org/10.1006/csla.1996.0024
  5. Kwon, B. H., Park, Y. J. and Park, Y.-S., 2007, 'Sound Source Localization Method Applied to Robot System', Proceedings of the KSNVE Annual Autumn conference 2007, KSNVE07A 03-02
  6. Li, H., Yosiara, T., Zhao, Q., Watanae, T. and Huang, J., 2007, 'A Spatial Sound Localization System for Mobile Robots,' IEEE Instrumentation and Measurement Technology Conference, pp. 1-6. https://doi.org/10.1109/IMTC.2007.379003
  7. Wang, H. and Chu, P., 1997, 'Voice Source Localization for Automatic Camera Pointing System in Videoconferencing,' Proceeding of the 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP ’97), Vol. 1, pp. 187-190 https://doi.org/10.1109/ICASSP.1997.599595
  8. Yoon, J.-R. and Glegg, S. A. L., 1997 'Microphone Array Design for Noise Source Imaging,' Transactions of the Korean Society for Noise and Vibration Engineering, Vol. 7, No. 2, pp. 255-260
  9. Knapp, C. H. and Carter, G. C., 1976, 'The Generalized Correlation Method for Estimation of Time Delay,' IEEE Trans. on Acoustics, Speech and Signal Processing, Vol. AS-24, No. 4, pp. 320-327
  10. Vaccaro, R. J., Ramalingam, C. S. and Tufts, D. W., 1992, 'Least-squares Time-delay Estimation for Transient Signals in a Multipath Environment,' Journal of Acoustic Society of America. Vol. 92, No. 1, pp. 210-218 https://doi.org/10.1121/1.404285
  11. Tung, T. L., Yao, K., Reed, C. W., Hudson, R. E., Chen, D. and Chen, J., 1999, 'Sound Localization and Time Delay Estimation Using Constrained Least Squares and Best Path Smoothing,' Part of the SPIE Conference on Advanced Signal Processing Algorithms, Architectures, and Implementation. SPIE Vol. 3807, pp. 220-233 https://doi.org/10.1117/12.367638
  12. Cabot, R. C., 1981, 'A Note on the Application of the Hilbert Transform to Time Delay Estimation,' IEEE Transactions on Acoustics, Speech, and Signal Processing, Vol. ASSP-29, No. 3, pp 607-609
  13. Chan, Y. T., So, H. C. and Ching, P. C., 1999, 'Approximate Maximum Likelihood Delay Estimation via Orthogonal Wavelet Transform,' IEEE Transactions on Signal Processing, Vol. 47, No. 4, pp. 1193-1198 https://doi.org/10.1109/78.752624
  14. Brandstein, M. S., Adcock, J. E. and Silverman, H. F., 1999, 'A Practical Time-delay Estimator for Localizing Speech Sources with a Microphone Array,' Computer Speech and Language, Vol. 9, No. 2, pp. 153-169 https://doi.org/10.1006/csla.1995.0009
  15. Kwon, B. H., Park, Y. J. and Park, Y.-S., 2008, 'Sound Source Localization Method in the Non Free-field Condition; Spherical Platform,' 15th International Congress on Sound and Vibration, pp. 2895-2902.
  16. Kwon, B. H., Park, Y. J. and Park, Y.-S., 2008, 'Sound Source Localization for Robot Auditory System Using the Summed GCC Method,' International Conference on Control, Automation and Systems 2008, pp. 241-245 https://doi.org/10.1109/ICCAS.2008.4694557
  17. Kim, T.-J., 2001, 'Noise Source Detection Using the Third-order Statistics,' Master thesis, Pukyong National Univ