Elghazaly, H., Mirheidari, B., Moosavi, N.S. orcid.org/0000-0002-8332-307X et al. (1 more author) (2025) Exploring gender disparities in automatic speech recognition technology. In: Schulte, B. and Feldhus, N., (eds.) Proceedings of the ISCA/ITG Workshop on Diversity in Large Speech and Language Models. ISCA/ITG Workshop on Diversity in Large Speech and Language Models, 20 Feb 2025, Berlin, Germany. , arXiv.
Abstract
This study investigates factors influencing Automatic Speech Recognition (ASR) systems' fairness and performance across genders, beyond the conventional examination of demographics. Using the LibriSpeech dataset and the Whisper small model, we analyze how performance varies across different gender representations in training data. Our findings suggest a complex interplay between the gender ratio in training data and ASR performance. Optimal fairness occurs at specific gender distributions rather than a simple 50-50 split. Furthermore, our findings suggest that factors like pitch variability can significantly affect ASR accuracy. This research contributes to a deeper understanding of biases in ASR systems, highlighting the importance of carefully curated training data in mitigating gender bias.
Metadata
| Item Type: | Proceedings Paper |
|---|---|
| Authors/Creators: |
|
| Editors: |
|
| Copyright, Publisher and Additional Information: | © 2025 The Author(s). This conference paper is made available under a Creative Commons Attribution 4.0 International License. (https://creativecommons.org/licenses/by/4.0/) |
| Dates: |
|
| Institution: | The University of Sheffield |
| Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
| Date Deposited: | 17 Feb 2026 14:23 |
| Last Modified: | 17 Feb 2026 14:52 |
| Status: | Published |
| Refereed: | Yes |
| Identification Number: | 10.48550/arXiv.2502.18434 |
| Related URLs: | |
| Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:238012 |
Download
Filename: 2502.18434v1.pdf
Licence: CC-BY 4.0

CORE (COnnecting REpositories)
CORE (COnnecting REpositories)