Future word contexts in neural network language models

Abstract

Recently, bidirectional recurrent network language models (bi-RNNLMs) have been shown to outperform standard, unidirectional, recurrent neural network language models (uni-RNNLMs) on a range of speech recognition tasks. This indicates that future word context information beyond the word history can be useful. However, bi-RNNLMs pose a number of challenges as they make use of the complete previous and future word context information. This impacts both training efficiency and their use within a lattice rescoring framework. In this paper these issues are addressed by proposing a novel neural network structure, succeeding word RNNLMs (suRNNLMs). Instead of using a recurrent unit to capture the complete future word contexts, a feedforward unit is used to model a finite number of succeeding, future, words. This model can be trained much more efficiently than bi-RNNLMs and can also be used for lattice rescoring. Experimental results on a meeting transcription task (AMI) show the proposed model consistently outperformed uni-RNNLMs and yield only a slight degradation compared to bi-RNNLMs in N-best rescoring. Additionally, performance improvements can be obtained using lattice rescoring and subsequent confusion network decoding.

Metadata

Item Type:	Proceedings Paper
Authors/Creators:	Chen, X. Liu, X. Ragni, A. Wang, Y. Gales, M.J.F.
Copyright, Publisher and Additional Information:	© 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works. Reproduced in accordance with the publisher's self-archiving policy.
Keywords:	Bidirectional recurrent neural network; language model; succeeding words; speech recognition
Dates:	Published (online): 25 January 2018 Published: 16 December 2017
Institution:	The University of Sheffield
Academic Units:	The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield)
Depositing User:	Symplectic Sheffield
Date Deposited:	24 Oct 2019 13:27
Last Modified:	13 Sep 2024 15:38
Status:	Published
Publisher:	IEEE
Refereed:	Yes
Identification Number:	10.1109/ASRU.2017.8268922
Related URLs:	arXiv URL
Open Archives Initiative ID (OAI ID):	oai:eprints.whiterose.ac.uk:150523

CORE (COnnecting REpositories)

Future word contexts in neural network language models

Abstract

Metadata

Download

Accepted Version

Export

Statistics