Factorizing Gender Bias in Automatic Speech Recognition for Mexican Spanish - Department of Natural Language Processing & Knowledge Discovery
Preprints, Working Papers, ... Year : 2024

Factorizing Gender Bias in Automatic Speech Recognition for Mexican Spanish

Abstract

Advances in speech technologies have led to significant progress in large acoustic models such as Whisper and Multilingual Massive Speech (MMS), improving tasks like Automatic Speech Recognition (ASR). Yet, there is still a need for thorough research to recognize and tackle stereotypical biases. In this paper, we investigate Whisper and MMS systems to quantify gender bias and factorize gender bias considering voice timbre, skin tone, and age group for Mexican-Spanish in a multilingual ASR setting. In addition to traditional ASR evaluation such as word error rate and phoneme error rate, we also perform statistical significance tests. Furthermore, we explore the vital role of factorization of gender attributes into sub-groups in bias quantification. This work presents an initial study of gender inclusivity with various factors in the context of MMS and Whisper for Mexican-Spanish.
Fichier principal
Vignette du fichier
Final_SPANISH_ASR_Bias_InterSpeech_2024_V4-merged.pdf (1.21 Mo) Télécharger le fichier
Origin Files produced by the author(s)

Dates and versions

hal-04607587 , version 1 (10-06-2024)
hal-04607587 , version 2 (20-09-2024)

Licence

Identifiers

  • HAL Id : hal-04607587 , version 2

Cite

Anastasiia Chizhikova, Hannah Billinghurst, Michelle Elizabeth, Shehenaz Hossain, Ajinkya Kulkarni, et al.. Factorizing Gender Bias in Automatic Speech Recognition for Mexican Spanish. 2024. ⟨hal-04607587v2⟩
71 View
81 Download

Share

More