A corpus of audio-visual Lombard speech with frontal and profile views

Abstract

This paper presents a bi-view (front and side) audiovisual Lombard speech corpus, which is freely available for download. It contains 5400 utterances (2700 Lombard and 2700 plain reference utterances), produced by 54 talkers, with each utterance in the dataset following the same sentence format as the audiovisual “Grid” corpus [Cooke, Barker, Cunningham, and Shao (2006). J. Acoust. Soc. Am. 120(5), 2421–2424]. Analysis of this dataset confirms previous research, showing prominent acoustic, phonetic, and articulatory speech modifications in Lombard speech. In addition, gender differences are observed in the size of Lombard effect. Specifically, female talkers exhibit a greater increase in estimated vowel duration and a greater reduction in F2 frequency.

Metadata

Item Type:	Article
Authors/Creators:	Alghamdi, N. Maddock, S. Marxer, R. Barker, J. Brown, G.J. https://orcid.org/0000-0001-8565-5476
Copyright, Publisher and Additional Information:	© 2018 Acoustical Society of America. This is an author produced version of a paper subsequently published in Journal of the Acoustical Society of America. Uploaded in accordance with the publisher's self-archiving policy.
Dates:	Accepted: 29 May 2018 Published (online): 26 June 2018 Published: 26 June 2018
Institution:	The University of Sheffield
Academic Units:	The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield)
Depositing User:	Symplectic Sheffield
Date Deposited:	13 Jun 2018 11:16
Last Modified:	27 Jan 2020 14:21
Published Version:	https://doi.org/10.1121/1.5042758
Status:	Published
Publisher:	Acoustical Society of America
Refereed:	Yes
Identification Number:	10.1121/1.5042758
Open Archives Initiative ID (OAI ID):	oai:eprints.whiterose.ac.uk:131924

CORE (COnnecting REpositories)

A corpus of audio-visual Lombard speech with frontal and profile views

Abstract

Metadata

Download

Accepted Version

Export

Statistics