Palomäki, K.J. and Brown, G.J. orcid.org/0000-0001-8565-5476 (2011) A computational model of binaural speech recognition: Role of across-frequency vs. within-frequency processing and internal noise. Speech Communication, 53 (6). pp. 924-940. ISSN 0167-6393
Abstract
This study describes a model of binaural speech recognition that is tested against psychoacoustic findings on binaural speech intelligibility in noise. It consists of models of the auditory periphery, binaural pathway and recognition of speech from glimpses based on the missing data approach, which allows the speech reception threshold (SRT) of the model and listeners to be compared. The binaural advantage based on differences between the interaural time differences (ITD) of the target and masker is modelled using the equalization-cancellation (EC) mechanism, either independently within each frequency channel or across all channels. The model is tested using a stimulus paradigm in which the target speech and noise interference are split into low- and high-frequency bands, so that the ITD in each band can be varied independently. The match between the model and listener data is quantified by a normalized SRT distance and a correlation metric, which demonstrate a slightly better match for the within-channel model (SRT: 0.5 dB, correlation: 0.94), than for the across-channel model (SRT: 0.7 dB, correlation: 0.90). However, as the differences between the approaches are small and non-significant, our results suggest that listeners exploit ITD via a mechanism that is neither fully frequency-dependent nor fully frequency-independent.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2011 Elsevier B.V. This is an author produced version of a paper subsequently published in Speech Communication. Uploaded in accordance with the publisher's self-archiving policy. Article available under the terms of the CC-BY-NC-ND licence (https://creativecommons.org/licenses/by-nc-nd/4.0/). |
Keywords: | Binaural model; Speech recognition; Equalization–cancellation model; Missing data |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 14 Nov 2019 17:05 |
Last Modified: | 14 Nov 2019 17:05 |
Status: | Published |
Publisher: | Elsevier |
Refereed: | Yes |
Identification Number: | 10.1016/j.specom.2011.03.005 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:152456 |
Download
Filename: PEER_stage2_10.1016_j.specom.2011.03.005.pdf
Licence: CC-BY-NC-ND 4.0