Bailey, George orcid.org/0000-0001-5137-8394 (2016) Automatic detection of sociolinguistic variation using forced alignment. In: University of Pennsylvania Working Papers in Linguistics:Selected Papers from New Ways of Analyzing Variation (NWAV 44). University of Pennsylvania Working Papers in Linguistics . , pp. 10-20.
Abstract
Forced alignment software is now widely used in contemporary sociolinguistics, and is quickly becoming a crucial methodological tool as an increasing number of studies begin to utilise ‘big data.’ This study investigates the possibility of taking forced alignment one step further towards the goal of complete automation; specifically, it expands the functionality of FAVE-align to fully automate the coding of three sociolinguistic variables in British English: (th)-fronting, (td)-deletion, and (h)-dropping. This involved the expansion of pronouncing dictionaries to reflect the surface output of these variable rules; FAVE then compares the fit of competing acoustic models with the speech signal to determine the surface variant. It does so with an impressive degree of accuracy, largely comparable to inter-transcriber agreement for all variables; however, the pattern of its mistakes, which are largely false positives, suggests a difficulty in identifying the voiceless segments of (td) and (th). Although it is reassuring that inter-transcriber agreement was also lowest for these tokens, it should be noted that FAVE’s accuracy decreases in faster speech rates while no comparable effect is found for agreement among human transcribers.
Metadata
Item Type: | Proceedings Paper |
---|---|
Authors/Creators: |
|
Dates: |
|
Institution: | The University of York |
Academic Units: | The University of York > Faculty of Arts and Humanities (York) > Language and Linguistic Science (York) |
Depositing User: | Pure (York) |
Date Deposited: | 03 Dec 2018 09:50 |
Last Modified: | 08 Feb 2025 00:11 |
Status: | Published |
Series Name: | University of Pennsylvania Working Papers in Linguistics |
Related URLs: | |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:139456 |