TY - JOUR
T1 - Relating Cepstral Peak Prominence to Cyclical Parameters of Vocal Fold Vibration from High-Speed Videoendoscopy Using Machine Learning
T2 - A Pilot Study
AU - Popolo, Peter S.
AU - Johnson, Aaron M.
N1 - Publisher Copyright:
© 2020 The Voice Foundation
PY - 2021/9
Y1 - 2021/9
N2 - Objective: Smoothed cepstral peak prominence (CPPs) has been shown to be an effective indicator of breathiness (Hillenbrand and Houde, 1996). High-speed videoendoscopy (HSV) is frequently being used as a complement to stroboscopy especially when asymmetric or aperiodic vocal fold vibration is present in dysphonic voices. In an HSV image data set obtained with normal (nondisordered) voice subjects, we have observed that some degree of asymmetry is present in many of the vocal fold displacement curves extracted from the HSV exam videos; therefore, we have used this data set for a pilot study to investigate the relationship of CPPs to cyclical vocal fold vibration parameters, including left-right vocal fold (LVRF) phase asymmetry, in subjects with normal (nondisordered) voices. Methods: Twenty subjects with normal (nondisordered) voices produced sustained vowel phonations while undergoing a transoral HSV examination of the vocal folds with synchronized recording of the voice signal. Glottal area waveform (GAW) and cyclical parameters open quotient (OQ), closed quotient (CQ), speed quotient (SQ), and LVRF skew were extracted from the HSV exam videos, and CPPs measures were obtained from acoustic analysis of the audio recordings.
AB - Objective: Smoothed cepstral peak prominence (CPPs) has been shown to be an effective indicator of breathiness (Hillenbrand and Houde, 1996). High-speed videoendoscopy (HSV) is frequently being used as a complement to stroboscopy especially when asymmetric or aperiodic vocal fold vibration is present in dysphonic voices. In an HSV image data set obtained with normal (nondisordered) voice subjects, we have observed that some degree of asymmetry is present in many of the vocal fold displacement curves extracted from the HSV exam videos; therefore, we have used this data set for a pilot study to investigate the relationship of CPPs to cyclical vocal fold vibration parameters, including left-right vocal fold (LVRF) phase asymmetry, in subjects with normal (nondisordered) voices. Methods: Twenty subjects with normal (nondisordered) voices produced sustained vowel phonations while undergoing a transoral HSV examination of the vocal folds with synchronized recording of the voice signal. Glottal area waveform (GAW) and cyclical parameters open quotient (OQ), closed quotient (CQ), speed quotient (SQ), and LVRF skew were extracted from the HSV exam videos, and CPPs measures were obtained from acoustic analysis of the audio recordings.
KW - High-speed videoendoscopy—Cepstral peak prominence—Glottal area waveform—Vocal fold displacement—Machine learning
UR - http://www.scopus.com/inward/record.url?scp=85081914613&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85081914613&partnerID=8YFLogxK
U2 - 10.1016/j.jvoice.2020.01.026
DO - 10.1016/j.jvoice.2020.01.026
M3 - Article
C2 - 32173147
AN - SCOPUS:85081914613
SN - 0892-1997
VL - 35
SP - 703
EP - 716
JO - Journal of Voice
JF - Journal of Voice
IS - 5
ER -