Ensembling synchronisation-based and face–voice association paradigms for robust active speaker detection in egocentric recordings

Clarke, J. orcid.org/0000-0002-1032-6472, Gotoh, Y. and Goetze, S. (2025) Ensembling synchronisation-based and face–voice association paradigms for robust active speaker detection in egocentric recordings. In: Karpov, A. and Gosztolya, G., (eds.) Speech and Computer: 27th International Conference, SPECOM 2025, Szeged, Hungary, October 13-15, 2025, Proceedings, Part II. 27th International Conference, SPECOM 2025, 13-15 Oct 2025, Szeged, Hungary. Lecture Notes in Computer Science, LNAI 16188. Springer Cham, pp. 289-301. ISBN: 9783032079589. ISSN: 0302-9743. EISSN: 1611-3349.

Abstract

Metadata

Item Type: Proceedings Paper
Authors/Creators:
Editors:
  • Karpov, A.
  • Gosztolya, G.
Copyright, Publisher and Additional Information:

© 2025 The Author(s). Except as otherwise noted, this author-accepted version of a paper published in Speech and Computer: 27th International Conference, SPECOM 2025, Szeged, Hungary, October 13-15, 2025, Proceedings, Part II is made available via the University of Sheffield Research Publications and Copyright Policy under the terms of the Creative Commons Attribution 4.0 International License (CC-BY 4.0), which permits unrestricted use, distribution and reproduction in any medium, provided the original work is properly cited. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/

Keywords: Face-voice association; Audiovisual active speaker detection; egocentric recordings
Dates:
  • Accepted: 31 July 2025
  • Published: 13 October 2025
Institution: The University of Sheffield
Academic Units: The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield)
Funding Information:
Funder
Grant number
META PLATFORM INC
UNSPECIFIED
Engineering and Physical Sciences Research Council
2588133
Engineering and Physical Sciences Research Council
2638501
Date Deposited: 15 Aug 2025 07:46
Last Modified: 22 Oct 2025 15:19
Status: Published
Publisher: Springer Cham
Series Name: Lecture Notes in Computer Science
Refereed: Yes
Identification Number: 10.1007/978-3-032-07959-6_21
Related URLs:
Open Archives Initiative ID (OAI ID):

Export

Statistics