DOI QR코드

DOI QR Code

Probabilistic Bilinear Transformation Space-Based Joint Maximum A Posteriori Adaptation

  • Received : 2012.02.06
  • Accepted : 2012.05.11
  • Published : 2012.10.31

Abstract

This letter proposes a more advanced joint maximum a posteriori (MAP) adaptation using a prior model based on a probabilistic scheme utilizing the bilinear transformation (BIT) concept. The proposed method not only has scalable parameters but is also based on a single prior distribution without the heuristic parameters of the previous joint BIT-MAP method. Experiment results, irrespective of the amount of adaptation data, show that the proposed method leads to a consistent improvement over the previous method.

Keywords

References

  1. Y. Cho and D. Yook, "Maximum Likelihood Training and Adaptation of Embedded Speech Recognizers for Mobile Environments," ETRI J., vol. 32, no. 1, Feb. 2010, pp. 160-162. https://doi.org/10.4218/etrij.10.0209.0242
  2. K.-T. Chen and H.-M. Wang, "Eigenspace-Based Maximum a posteriori Linear Regression for Rapid Speaker Adaptation," Proc. ICASSP, 2000, p. 317-320.
  3. C.J. Leggetter and P.C. Woodland, "Maximum Likelihood Linear Regression for Speaker Adaptation of Continuous Density Hidden Markov Models," Computer Speech Language, vol. 9, no. 2, Apr. 1995, pp. 171-185. https://doi.org/10.1006/csla.1995.0010
  4. J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori Estimation for Multivariate Gaussian Mixture Observations of Markov Chains," IEEE Trans. Speech Audio Process., vol. 2, no. 1, Apr. 1994, pp. 291-298. https://doi.org/10.1109/89.279278
  5. D.K. Kim and N.S. Kim, "Rapid Speaker Adaptation Using Probabilistic Principal Component Analysis," IEEE Signal Process. Lett., vol. 8, no. 6, June 2001, pp. 180-183. https://doi.org/10.1109/97.923045
  6. O. Siohan, C. Chesta, and C.-H. Lee, "Joint Maximum a posteriori Adaptation of Transformation and HMM Parameters," Proc. ICASSP, 2001, pp. 2945-2948.
  7. H.J. Song, Y. Lee, and H.S. Kim, "Joint Bilinear Transformation Space Based Maximum a posteriori Linear Regression Adaptation Using Prior with Variance Function," Proc. Interspeech, 2011, pp. 2577-2580.
  8. M.E. Tipping and C.M. Bishop, "Mixtures of Probabilistic Principal Component Analyzers," Neural Computation, vol. 11, no. 2, 1999, pp. 443-482. https://doi.org/10.1162/089976699300016728

Cited by

  1. Speech Enhancement Using Phase-Dependent A Priori SNR Estimator in Log-Mel Spectral Domain vol.36, pp.5, 2012, https://doi.org/10.4218/etrij.14.2214.0039
  2. Multimodal Unsupervised Speech Translation for Recognizing and Evaluating Second Language Speech vol.11, pp.6, 2012, https://doi.org/10.3390/app11062642