Comparison of Vowel and Text-Based Cepstral Analysis in Dysphonia Evaluation

발성장애 평가 시 /a/ 모음연장발성 및 문장검사의 켑스트럼 분석 비교

  • Kim, Tae Hwan (Department of Otorhinolaryngology-Head and Neck Surgery, Kangbuk Samsung Hospital, Sungkyunkwan University School of Medicine) ;
  • Choi, Jeong Im (Department of Otorhinolaryngology-Head and Neck Surgery, Kangbuk Samsung Hospital, Sungkyunkwan University School of Medicine) ;
  • Lee, Sang Hyuk (Department of Otorhinolaryngology-Head and Neck Surgery, Kangbuk Samsung Hospital, Sungkyunkwan University School of Medicine) ;
  • Jin, Sung Min (Department of Otorhinolaryngology-Head and Neck Surgery, Kangbuk Samsung Hospital, Sungkyunkwan University School of Medicine)
  • 김태환 (성균관대학교 의과대학 강북삼성병원 이비인후과학교실) ;
  • 최정임 (성균관대학교 의과대학 강북삼성병원 이비인후과학교실) ;
  • 이상혁 (성균관대학교 의과대학 강북삼성병원 이비인후과학교실) ;
  • 진성민 (성균관대학교 의과대학 강북삼성병원 이비인후과학교실)
  • Received : 2015.11.15
  • Accepted : 2015.11.21
  • Published : 2015.12.31

Abstract

Background : Cepstral analysis which is obtained from Fourier transformation of spectrum has been known to be effective indicator to analyze the voice disorder. To evaluate the voice disorder, phonation of sustained vowel /a/ sound or continuous speech have been used but the former was limited to capture hoarseness properly. This study is aimed to compare the effectiveness in analysis of cepstrum between the sustained vowel /a/ sound and continuous speech. Methods : From March 2012 to December 2014, total 72 patients was enrolled in this study, including 24 unilateral vocal cord palsy, vocal nodule and vocal polyp patients, respectively. The entire patient evaluated their voice quality by VHI (Voice Handicap Index) before and after treatment. Phonation of sustained vowel /a/ sample and continuous speech using the first sentence of autumn paragraph was subjected by cepstral analysis and compare the pre-treatment group and post-treatment group. Results : The measured values of pre and post treatment in CPP-a (cepstral peak prominence in /a/ vowel sound) was 13.80, 13.91 in vocal cord palsy, 16.62, 17.99 in vocal cord nodule, 14.19, 18.50 in vocal cord polyp respectively. Values of CPP-s (cepstral peak prominence in text-based speech) in pre and post treatment was 11.11, 12.09 in vocal cord palsy, 12.11, 14.09 in vocal cord nodule, 12.63, 14.17 in vocal cord polyp. All 72 patients showed subjective improvement in VHI after treatment. CPP-a showed statistical improvement only in vocal polyp group, but CPP-s showed statistical improvement in all three groups (p<0.05). Conclusion : In analysis of cepstrum, text-based analysis is more representative in voice disorder than vowel sound speech. So when the acoustic analysis of voice by cepstrum, both phonation of sustained vowel /a/ sound and text based speech should be performed to obtain more accurate result.

Keywords

References

  1. Roy N, Stemple J, Merrill RM, Thomas L. Epidemiology of voice disorders in the elderly: preliminary findings. Laryngoscope 2007;117(4):628-33. https://doi.org/10.1097/MLG.0b013e3180306da1
  2. Ruben RJ. Redefining the survival of the fittest: communication disorders in the 21st century. Laryngoscope 2000;110(2 Pt 1):241-5. https://doi.org/10.1097/00005537-200002010-00010
  3. Heman-Ackah YD, Heuer RJ, Michael DD, Ostrowski R, Horman M, Baroody MM, et al. Cepstral peak prominence: a more reliable measure of dysphonia. Ann Otol Rhinol Laryngol 2003;112(4):324-33. https://doi.org/10.1177/000348940311200406
  4. Heman-Ackah YD. Reliability of calculating the cepstral peak without linear regression analysis. J Voice 2004;18(2):203-8. https://doi.org/10.1016/j.jvoice.2004.01.005
  5. Zieger K, Schneider C, Gerull G, Mrowinski D. [Cepstrum analysis in voice disorders]. Folia Phoniatr Logop 1995;47(4):210-7. https://doi.org/10.1159/000266352
  6. Radish Kumar B, Bhat JS, Prasad N. Cepstral analysis of voice in persons with vocal nodules. J Voice 2010;24(6):651-3. https://doi.org/10.1016/j.jvoice.2009.07.008
  7. Park MC, Mun MK, Lee SH, Jin SM. Clinical Usefulness of Cepstral Analysis in Dysphonia Evaluation. Korean J Otorhinolaryngol-Head Neck Surg DE - 2013-09-27 2013;56(9):574-8. https://doi.org/10.3342/kjorl-hns.2013.56.9.574
  8. Moers C, Mobius B, Rosanowski F, Noth E, Eysholdt U, Haderlein T. Vowel- and text-based cepstral analysis of chronic hoarseness. J Voice 2012;26(4):416-24. https://doi.org/10.1016/j.jvoice.2011.05.001
  9. Parsa V, Jamieson DG. Acoustic discrimination of pathological voice: sustained vowels versus continuous speech. J Speech Lang Hear Res 2001;44(2):327-39. https://doi.org/10.1044/1092-4388(2001/027)
  10. Askenfelt AG, Hammarberg B. Speech waveform perturbation analysis: a perceptual-acoustical comparison of seven measures. J Speech Hear Res 1986;29(1):50-64. https://doi.org/10.1044/jshr.2901.50
  11. Carding PN, Steen IN, Webb A, MacKenzie K, Deary IJ, Wilson JA. The reliability and sensitivity to change of acoustic measures of voice quality. Clin Otolaryngol Allied Sci 2004;29(5):538-44. https://doi.org/10.1111/j.1365-2273.2004.00846.x
  12. Hillenbrand J, Houde RA. Acoustic correlates of breathy vocal quality: dysphonic voices and continuous speech. J Speech Hear Res 1996;39(2):311-21. https://doi.org/10.1044/jshr.3902.311
  13. Hillenbrand J. A methodological study of perturbation and additive noise in synthetically generated voice signals. J Speech Hear Res 1987;30(4):448-61. https://doi.org/10.1044/jshr.3004.448
  14. Klingholtz F. Acoustic recognition of voice disorders: a comparative study of running speech versus sustained vowels. J Acoust Soc Am 1990;87(5):2218-24. https://doi.org/10.1121/1.399189