Speaker embedding informed audiovisual active speaker detection for egocentric recordings

Clarke, J., Gotoh, Y. orcid.org/0000-0003-1668-0867 and Goetze, S. (2025) Speaker embedding informed audiovisual active speaker detection for egocentric recordings. In: ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Proceedings. ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 06-11 Apr 2025, Hyperabad, India. Institute of Electrical and Electronics Engineers (IEEE) ISBN 9798350368758

Abstract

Metadata

Item Type: Proceedings Paper
Authors/Creators:
Copyright, Publisher and Additional Information:

© 2025 The Author(s). Except as otherwise noted, this author-accepted version of a paper published in ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Proceedings is made available via the University of Sheffield Research Publications and Copyright Policy under the terms of the Creative Commons Attribution 4.0 International License (CC-BY 4.0), which permits unrestricted use, distribution and reproduction in any medium, provided the original work is properly cited. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/

Keywords: Diarization; Audiovisual Active Speaker Detection; Video-based Face Recognition; Speaker Recognition
Dates:
  • Published: 7 March 2025
  • Published (online): 7 March 2025
  • Accepted: 20 December 2024
Institution: The University of Sheffield
Academic Units: The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield)
Depositing User: Symplectic Sheffield
Date Deposited: 18 Feb 2025 11:11
Last Modified: 14 Mar 2025 16:29
Status: Published
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Refereed: Yes
Identification Number: 10.1109/ICASSP49660.2025.10890414
Related URLs:
Open Archives Initiative ID (OAI ID):

Export

Statistics