Items where authors include "Saz, O."

Export as [feed] Atom [feed] RSS
Number of items: 18.

Article

Deena, S. orcid.org/0000-0001-5417-0556, Hasan, M., Doulaty, M. et al. (2 more authors) (2019) Recurrent neural network language model adaptation for multi-genre broadcast speech recognition and alignment. IEEE/ACM Transactions on Audio, Speech and Language Processing, 27 (3). pp. 572-582. ISSN 2329-9290

Saz, O. and Hain, T. (2017) Acoustic Adaptation to Dynamic Background Conditions with Asynchronous Transformations. Computer, Speech & Language, 41. pp. 180-194. ISSN 0885-2308

Proceedings Paper

Hain, T., Christian, J., Saz, O. et al. (6 more authors) (2016) webASR 2 - Improved cloud based speech technology. In: Proceedings of Interspeech 2016. 17th Annual Conference of the International Speech Communication Association (Interspeech), 08-12 Sep 2016, San Francisco, CA. ISCA .

Doulaty, M., Saz, O., Ng, R.W.M. et al. (1 more author) (2016) Automatic Genre and Show Identification of Broadcast Media. In: Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech). Interspeech 2016, 08-12 Sep 2016, San Francisco. ISCA .

Deena, S., Hasan, M., Doulaty, M. et al. (2 more authors) (2016) Combining feature and model-based adaptation of RNNLMs for multi-genre broadcast speech recognition. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Interspeech 2016, 08-12 Sep 2016, San Francisco, USA. , pp. 2343-2347.

Ng, W., Nicolao, M., Saz, O. et al. (5 more authors) (2016) The Sheffield language recognition system in NIST LRE 2015. In: Proceedings of The Speaker and Language Recognition Workshop Odyssey 2016. Speaker Odyssey, 21-24 Jun 2016, Bilbao, Spain. ISCA , pp. 181-187.

Doulaty Bashkand, M., Saz, O. and Hain, T. (2015) Data-Selective Transfer Learning for Multi-Domain Speech Recognition. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 16th Annual Conference of the International Speech Communication Association, 06-10 Sep 2015, Dresden, Germany. ISCA (International Speech Communication Association) , pp. 2897-2901.

Doulaty Bashkand, M., Saz, O. and Hain, T. (2015) Unsupervised Domain Discovery Using Latent Dirichlet Allocation for Acoustic Modelling in Speech Recognition. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 16th Annual Conference of the International Speech Communication Association, 06-10 Sep 2015, Dresden, Germany. ISCA (International Speech Communication Association) , pp. 3640-3644.

Milner, R., Saz, O., Deena, S. et al. (3 more authors) (2015) The 2015 Sheffield System for Longitudinal Diarisation of Broadcast Media. In: Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU). 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13-17 Dec 2015, Scottsdale, AZ. IEEE , pp. 632-638. ISBN 978-1-4799-7291-3

Saz, O., Doulaty, M., Deena, S. et al. (5 more authors) (2015) The 2015 Sheffield System for Transcription of Multi–Genre Broadcast Media. In: Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU). 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), December 13-17, 2015, Scottsdale, Arizona, USA. IEEE . ISBN 978-1-4799-7291-3

Doulaty, M., Saz, O., Ng, R.W.M. et al. (1 more author) (2015) Latent Dirichlet Allocation Based Organisation of Broadcast Media Archives for Deep Neural Network Adaptation. In: Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU). 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13-17 Dec 2015, Scottsdale, AZ. IEEE , pp. 130-136. ISBN 978-1-4799-7291-3

Bell, P., Gales, M., Hain, T. orcid.org/0000-0003-0939-3464 et al. (8 more authors) (2015) The MGB Challenge: Evaluating Multi-genre Broadcast Media Recognition. In: Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU). 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13-17 Dec 2015, Scottsdale, AZ. IEEE , pp. 687-693. ISBN 978-1-4799-7291-3

Ng, R.W.M., Doulaty, M., Doddipatla, R. et al. (7 more authors) (2014) The USFD Spoken Language Translation System for IWSLT 2014. In: Federico, M., Stücker, S. and Yvon, F., (eds.) Proceedings of the 11th International Workshop on Spoken Language Translation (SLT 2014). The 11th International Workshop on Spoken Language Translation (IWSLT), Dec. 03-04, 2015, Lake Tahoe, US. IWSLT 2014 , pp. 86-91.

Saz, O., Doulaty, M. and Hain, T. (2014) Background-tracking acoustic features for genre identification of broadcast shows. In: Spoken Language Technology Workshop (SLT), 2014 IEEE. Spoken Language Technology Workshop (SLT), 07-10 Dec 2014, South Lake Tahoe, NV. IEEE , 118 - 123. ISBN 9781479971299

Ng, R.W.M., Doulaty, M., Doddipatla, R. et al. (7 more authors) (2014) The USFD SLT System for IWSLT 2014. In: Federico, M., Stücker, S. and Yvon, F., (eds.) Proceedings of the International Workshop on Spoken Language Translation. 11th International Workshop on Spoken Language Translation, 04-05 Dec 2014, Lake Tahoe, California (USA). IWSLT , http://workshop2014.iwslt.org/64.php .

Saz, O. and Hain, T. orcid.org/0000-0003-0939-3464 (2014) Using contextual information in Joint Factor Eigenspace MLLR for speech recognition in diverse scenarios. In: Proceedings of the 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 04-09 May 2014, Florence, Italy. IEEE .

Saz, O. and Hain, T. orcid.org/0000-0003-0939-3464 (2013) Asynchronous factorisation of speaker and background with feature transforms in speech recognition. In: INTERSPEECH-2013. INTERSPEECH 2013 - 14th Annual Conference of the International Speech Communication Association, 25-29 Aug 2013, Lyon, France. ISCA , pp. 1238-1242.

Lanchantin, P., Bell, P.J., Gales, M.J.F. et al. (9 more authors) (2013) Automatic Transcription of Multi-Genre Media Archives. In: CEUR Workshop Proceedings. First Workshop on Speech, Language and Audio in Multimedia, August 22-23, 2013, Marseille, France. , 26–31-26–31.

This list was generated on Sun Apr 21 15:44:54 2024 BST.