Xu, X., Brown, G.J. orcid.org/0000-0001-8565-5476 and Ma, N. (2025) Sound-based sleep staging using pretrained speech foundation models. In: Proceedings of the 2025 47th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). 2025 47th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 14-18 Jul 2025, Copenhagen, Denmark. Institute of Electrical and Electronics Engineers (IEEE). ISBN: 9798331586195. ISSN: 2694-0604. EISSN: 2375-7477.
Abstract
Sleep staging is essential for diagnosing sleep disorders and understanding sleep physiology, yet traditional polysomnography (PSG) is costly, intrusive, and impractical for large-scale or home-based monitoring. Wearable devices provide alternatives but face limitations such as motion artifacts and inconsistent skin contact. In this study, we propose a non-contact sleep staging approach using sound-based analysis. To address data scarcity, we employ transfer learning with pretrained speech foundation models, originally developed for automatic speech recognition but capable of capturing rich acoustic representations beyond linguistic content. By analysing temporal attention weights in the extracted HuBERT embeddings, we demonstrate that these models effectively capture respiratory patterns. Our findings suggest that repurposing speech foundation models for sleep staging provides a scalable, non-contact alternative to PSG, with promising applications in home-based and clinical settings.Clinical Relevance—This demonstrates the feasibility of a non-contact, sound-based approach to sleep staging, potentially improving access and adherence in clinical and home settings.
Metadata
| Item Type: | Proceedings Paper |
|---|---|
| Authors/Creators: |
|
| Copyright, Publisher and Additional Information: | © 2025 The Author(s). Except as otherwise noted, this author-accepted version of a conference paper published in Proceedings of the 2025 47th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) is made available via the University of Sheffield Research Publications and Copyright Policy under the terms of the Creative Commons Attribution 4.0 International License (CC-BY 4.0), which permits unrestricted use, distribution and reproduction in any medium, provided the original work is properly cited. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ |
| Keywords: | Analytical models; Foundation models; Transfer learning; Predictive models; Sleep apnea; Data models; Skin; Wearable devices; Monitoring; Microphones |
| Dates: |
|
| Institution: | The University of Sheffield |
| Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
| Date Deposited: | 07 Aug 2025 14:32 |
| Last Modified: | 12 Dec 2025 11:37 |
| Status: | Published |
| Publisher: | Institute of Electrical and Electronics Engineers (IEEE) |
| Refereed: | Yes |
| Identification Number: | 10.1109/EMBC58623.2025.11253429 |
| Related URLs: | |
| Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:230050 |
Download
Filename: EMBC_2025___Sleep_Staging.pdf
Licence: CC-BY 4.0

CORE (COnnecting REpositories)
CORE (COnnecting REpositories)