Skip to Main content Skip to Navigation
Conference papers

Automatic extraction of speech rhythm descriptors for speech intelligibility assessment in the context of Head and Neck Cancers

Abstract : The temporal dimension of speech acoustics is rarely taken into account in automatic models for Speech Intelligibility evaluation, although the rhythmic recurrence of phonemes, syllables and prosodic groups are allegedly good predictors of speech intelligibility. The present study aims at unravelling those automatic parameters that best account for the different levels of the speech signal's rhythmic structure, and to evaluate their correlation with a perceptual intelligibility measure. The parameters are extracted from the Fourier Transform of the amplitude modulation of the signal (Envelope Modulation Spectrum) [1, 2]. A Lasso linear model for feature selection is first implemented to select the most relevant parameters, and a SVR regression analysis is run to reveal the best parameters' combination. Our analyses of EMS, using data from the French corpora of cancer speech C2SI [3], show strong performances of the automatic prediction, with a correlation of 0.70 between our model and an intelligibility evaluation score by speech-pathologists. In particular, the highest correlation with speech intelligibility lies in the ratio between the energy in the low frequency band (0.5-4 Hz that represents slow rhythmic modulations indicative of prosodic groups) and in the higher one (4-10 Hz that represents fast rhythmic modulations like phonemes).
Complete list of metadata

https://hal-univ-tlse3.archives-ouvertes.fr/hal-03269227
Contributor : Jérôme Farinas <>
Submitted on : Wednesday, June 23, 2021 - 9:22:25 PM
Last modification on : Friday, June 25, 2021 - 3:38:09 AM

File

Interspeech2021_1736_Paper.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-03269227, version 1

Citation

Robin Vaysse, Jérôme Farinas, Corine Astésano, Régine André-Obrecht. Automatic extraction of speech rhythm descriptors for speech intelligibility assessment in the context of Head and Neck Cancers. INTERSPEECH 2021, ISCA : International Speech and Communication Association, Aug 2021, Brno, Czech Republic. ⟨hal-03269227⟩

Share

Metrics

Record views

66

Files downloads

31