Paterson, M. orcid.org/0009-0004-9144-0438, Moor, J. and Cutillo, L. (2025) Detecting Throat Cancer From Speech Signals Using Machine Learning: A Scoping Literature Review. IEEE Access, 13. 58465 -58480. ISSN 2169-3536
Abstract
Cases of throat cancer are rising worldwide. With survival decreasing significantly at later stages, early detection is vital. Artificial intelligence (AI) and machine learning (ML) have the potential to detect throat cancer from patient speech, facilitating earlier diagnosis and reducing the burden on overstretched healthcare systems. However, no comprehensive review has explored the use of AI and ML for detecting throat cancer from speech. This review aims to fill this gap by evaluating how these technologies perform and identifying issues that need to be addressed in future research. We conducted a scoping literature review across three databases: Scopus, Web of Science, and PubMed. We included articles that classified speech using ML and specified the inclusion of throat cancer patients in their data. Articles were categorised based on whether they performed binary or multi-class classification. We found 27 articles fitting our inclusion criteria, 12 performing binary classification, 13 performing multi-class classification, and two that do both binary and multi-class classification. The most common classification method used was neural networks, and the most frequently extracted feature was mel-spectrograms. We also documented pre-processing methods and classifier performance. We compared each article against the TRIPOD-AI checklist, which showed a significant lack of open science, with only one article sharing code and only three using open-access data. Open-source code is essential for external validation and further development in this field. Our review indicates that no single method or specific feature consistently outperforms others in detecting throat cancer from speech. Future research should focus on standardising methodologies and improving the reproducibility of results.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2025 The Authors. This is an open access article under the terms of the Creative Commons Attribution License (CC-BY 4.0), which permits unrestricted use, distribution and reproduction in any medium, provided the original work is properly cited. |
Keywords: | Artificial intelligence, machine learning, speech, throat cancer, vocal pathologies |
Dates: |
|
Institution: | The University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering & Physical Sciences (Leeds) > School of Computing (Leeds) |
Depositing User: | Symplectic Publications |
Date Deposited: | 08 Apr 2025 10:18 |
Last Modified: | 08 May 2025 14:54 |
Published Version: | 10.1109/ACCESS.2025.3555767 |
Status: | Published |
Publisher: | IEEE |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:224935 |