Ensembling synchronisation-based and face–voice association paradigms for robust active speaker detection in egocentric recordings

Clarke, J. orcid.org/0000-0002-1032-6472, Gotoh, Y. and Goetze, S. (Accepted: 2025) Ensembling synchronisation-based and face–voice association paradigms for robust active speaker detection in egocentric recordings. In: Speech and Computer: 27th International Conference, SPECOM 2025 Szeged, Hungary, October 13-14, 2025, Proceedings. SPECOM 2025, 13-14 Oct 2025, Szeged, Hungary. Lecture Notes in Computer Science . Springer Cham ISSN: 0302-9743 EISSN: 1611-3349 (In Press)

Abstract

Metadata

Item Type: Proceedings Paper
Authors/Creators:
Copyright, Publisher and Additional Information:

© 2025 The Author(s).

Keywords: Face-voice association; Audiovisual active speaker detection; egocentric recordings
Dates:
  • Accepted: 31 July 2025
Institution: The University of Sheffield
Academic Units: The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield)
Funding Information:
Funder
Grant number
META PLATFORM INC
UNSPECIFIED
Engineering and Physical Sciences Research Council
2588133
Engineering and Physical Sciences Research Council
2638501
Depositing User: Symplectic Sheffield
Date Deposited: 15 Aug 2025 07:46
Last Modified: 15 Aug 2025 07:46
Status: In Press
Publisher: Springer Cham
Series Name: Lecture Notes in Computer Science
Refereed: Yes
Related URLs:
Open Archives Initiative ID (OAI ID):

Download

Accepted Version


Under temporary embargo

Filename: _Jason__SPECOM_2025.pdf

Request a copy

file not available

Export

Statistics