Face-voice association for audiovisual active speaker detection in egocentric recordings

Clarke, J., Gotoh, Y. orcid.org/0000-0003-1668-0867 and Goetze, S. (2025) Face-voice association for audiovisual active speaker detection in egocentric recordings. In: 2025 33rd European Signal Processing Conference (EUSIPCO). 2025 33rd European Signal Processing Conference (EUSIPCO), 08-12 Sep 2025, Palermo, Italy. Institute of Electrical and Electronics Engineers, pp. 66-70. ISBN: 9798350391831.

Abstract

Metadata

Item Type: Proceedings Paper
Authors/Creators:
Copyright, Publisher and Additional Information:

© 2025 The Authors. Except as otherwise noted, this author-accepted version of a conference proceeding published in 2025 33rd European Signal Processing Conference (EUSIPCO) is made available via the University of Sheffield Research Publications and Copyright Policy under the terms of the Creative Commons Attribution 4.0 International License (CC-BY 4.0), which permits unrestricted use, distribution and reproduction in any medium, provided the original work is properly cited. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/

Keywords: Biometrics; Visualization; Adaptation models; Biological system modeling; Pipelines; Transformers; Acoustics; Recording; Synchronization; Noise measurement
Dates:
  • Accepted: 20 September 2025
  • Published (online): 17 November 2025
  • Published: 17 November 2025
Institution: The University of Sheffield
Academic Units: The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield)
Date Deposited: 20 Jan 2026 09:17
Last Modified: 21 Jan 2026 11:14
Status: Published
Publisher: Institute of Electrical and Electronics Engineers
Refereed: Yes
Identification Number: 10.23919/eusipco63237.2025.11226795
Related URLs:
Open Archives Initiative ID (OAI ID):

Export

Statistics