Hobern, Sam, Archer-Boyd, Alan and MURPHY, DAMIAN THOMAS orcid.org/0000-0002-6676-9459 (2025) Investigating the use of Deep Convolutional Neural Networks for Direction-of-Arrival Estimation on Raw Stereo Audio. In: Audio Engineering Society E-Library. AES International Conference on Artificial Intelligence and Machine Learning for Audio 2025, 08-10 Sep 2025 Audio Engineering Society, GBR.
Abstract
This paper outlines the use of raw stereo audio as the input to a Deep Convolutional Neural Network for the purpose of Direction-of-Arrival (DOA) estimation based on source localisation over the frontal hemisphere of a 50-point Lebedev loudspeaker array using a pair of spaced stereo microphones. Results show the model is capable of a high classification accuracy with evidence of some ability to generalise to unseen data highlighting the benefits of raw audio as the input feature to models.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | This is an author-produced version of the published paper. Uploaded in accordance with the University’s Research Publications and Open Access policy. |
Dates: |
|
Institution: | The University of York |
Academic Units: | The University of York > Faculty of Sciences (York) > Electronic Engineering (York) |
Funding Information: | Funder Grant number EPSRC EP/W020602/1 |
Depositing User: | Pure (York) |
Date Deposited: | 17 Sep 2025 13:20 |
Last Modified: | 17 Sep 2025 13:20 |
Status: | Published |
Publisher: | Audio Engineering Society |
Related URLs: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:231815 |