Items where authors include "Hain, T."

Export as [feed] Atom [feed] RSS
Number of items: 96.

Article

Ravenscroft, W., Goetze, S. orcid.org/0000-0003-1044-7343 and Hain, T. (2022) Att-TasNet: attending to encodings in time-domain audio speech separation of noisy, reverberant speech mixtures. Frontiers in Signal Processing, 2. 856968. ISSN 2673-8198

Shi, Y., Huang, Q. and Hain, T. orcid.org/0000-0003-0939-3464 (2021) H-VECTORS : improving the robustness in utterance-level speaker embeddings using a hierarchical attention model. Neural Networks, 142. pp. 329-339. ISSN 0893-6080

El Hannani, A., Errattahi, R., Salmam, F.Z. et al. (2 more authors) (2021) Evaluation of the effectiveness and efficiency of state-of-the-art features and models for automatic speech recognition error detection. Journal of Big Data, 8. 5. ISSN 2196-1115

Errattahia, R., Hannani, A.E.L., Hain, T. orcid.org/0000-0003-0939-3464 et al. (1 more author) (2019) System-independent ASR error detection and classification using Recurrent Neural Network. Computer Speech and Language, 55. pp. 187-199. ISSN 0885-2308

Deena, S. orcid.org/0000-0001-5417-0556, Hasan, M., Doulaty, M. et al. (2 more authors) (2019) Recurrent neural network language model adaptation for multi-genre broadcast speech recognition and alignment. IEEE/ACM Transactions on Audio, Speech and Language Processing, 27 (3). pp. 572-582. ISSN 2329-9290

Saz Torralba, O., Deena, S., Doulaty, M. et al. (6 more authors) (2018) Lightly supervised alignment of subtitles on multi-genre broadcasts. Multimedia Systems, 77 (23). pp. 30533-30550. ISSN 0942-4962

Ng, W., Nicolao, M. and Hain, T. (2017) Unsupervised crosslingual adaptation of tokenisers for spoken language recognition. Computer Speech and Language, 46. pp. 327-342. ISSN 0885-2308

Saz, O. and Hain, T. (2017) Acoustic Adaptation to Dynamic Background Conditions with Asynchronous Transformations. Computer, Speech & Language, 41. pp. 180-194. ISSN 0885-2308

Hain, T. orcid.org/0000-0003-0939-3464, Burget, L., Dines, J. et al. (7 more authors) (2012) Transcribing meetings with the AMIDA systems. IEEE Transactions on Audio, Speech and Language Processing, 20 (2). pp. 486-498. ISSN 1558-7916

Conference or Workshop Item

Loweimi, E., Doulaty, M., Barker, J. et al. (1 more author) (2015) Emotion Recognition from the Speech Signal by Effective Combination of Generative and Discriminative Models. In: USES 2015 - The University of Sheffield Engineering Symposium, 24 Jun 2015, The Octagon Centre, University of Sheffield.

Proceedings Paper

Close, G., Ravenscroft, W., Hain, T. et al. (1 more author) (2024) Multi-CMGAN+/+: leveraging multi-objective speech quality metric prediction for speech enhancement. In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024), 14-19 Apr 2024, Seoul, Korea. Institute of Electrical and Electronics Engineers (IEEE) , pp. 351-355. ISBN 979-8-3503-4486-8

Mogridge, R., Close, G., Sutherland, R. et al. (4 more authors) (2024) Non-intrusive speech intelligibility prediction for hearing-impaired users using intermediate ASR features and human memory models. In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024), 14-19 Apr 2024, Seoul, Korea. Institute of Electrical and Electronics Engineers (IEEE) , pp. 306-310. ISBN 979-8-3503-4486-8

Ahmad, R., Farooq, M.U. and Hain, T. orcid.org/0000-0003-0939-3464 (2024) Progressive unsupervised domain adaptation for ASR using ensemble models and multi-stage training. In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024), 14-19 Apr 2024, Seoul, Korea. Institute of Electrical and Electronics Engineers (IEEE) , pp. 11466-11470. ISBN 979-8-3503-4485-1

Meghanani, A. and Hain, T. orcid.org/0000-0003-0939-3464 (2024) SCORE: Self-supervised correspondence fine-tuning for improved content representations. In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024), 14-19 Apr 2024, Seoul, Korea. Institute of Electrical and Electronics Engineers (IEEE) , pp. 12086-12090. ISBN 979-8-3503-4486-8

Meghanani, A. and Hain, T. orcid.org/0000-0003-0939-3464 (2024) Deriving translational acoustic sub-word embeddings. In: 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU) Proceedings. 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 16-20 Dec 2023, Taipei, Taiwan. Institute of Electrical and Electronics Engineers (IEEE) . ISBN 9798350306903

Ravenscroft, J.W. orcid.org/0000-0002-0780-3303, Goetze, S. and Hain, T. (2024) On time domain conformer models for monaural speech separation in noisy reverberant acoustic environments. In: 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE ASRU 2023), 16-20 Dec 2023, Taipei, Taiwan. Institute of Electrical and Electronics Engineers (IEEE) . ISBN 979-8-3503-0690-3

Islam, E., Hain, T. orcid.org/0000-0003-0939-3464 and Nomo Sudro, P. (2024) Simulation of teacher-learner interaction in English language pronunciation learning. In: 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE ASRU 2023), 16-20 Dec 2023, Taipei, Taiwan. Institute of Electrical and Electronics Engineers (IEEE) . ISBN 979-8-3503-0690-3

Ravenscroft, W. orcid.org/0000-0002-0780-3303, Goetze, S. and Hain, T. (2023) Combining conformer and dual-path-transformer networks for single channel noisy reverberant speech separation. In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024), 14-19 Apr 2024, Seoul, Korea. Institute of Electrical and Electronics Engineers (IEEE) , pp. 11491-11495. ISBN 979-8-3503-4486-8

Nomo Sudro, P., Ragni, A. and Hain, T. orcid.org/0000-0003-0939-3464 (2023) Adapting pretrained models for adult to child voice conversion. In: 2023 31st European Signal Processing Conference (EUSIPCO) Proceedings. 2023 31st European Signal Processing Conference (EUSIPCO), 04-08 Sep 2023, Helsinki, Finland. Institute of Electrical and Electronics Engineers (IEEE) , pp. 271-275. ISBN 9789464593600

Ravenscroft, J. orcid.org/0000-0002-0780-3303, Goetze, S. and Hain, T. (2023) On data sampling strategies for training neural network speech separation models. In: 2023 31st European Signal Processing Conference (EUSIPCO). 31st European Signal Processing Conference (EUSIPCO 2023), 04-08 Sep 2023, Helsinki, Finland. Institute of Electrical and Electronics Engineers (IEEE) . ISBN 978-9-4645-9360-0

Ollerenshaw, A., Jalal, M.A. and Hain, T. orcid.org/0000-0003-0939-3464 (2023) Probing statistical representations for End-to-End ASR. In: 2023 31st European Signal Processing Conference (EUSIPCO) Proceedings. 2023 31st European Signal Processing Conference (EUSIPCO), 04-08 Sep 2023, Helsinki, Finland. Institute of Electrical and Electronics Engineers (IEEE) , pp. 401-405. ISBN 9789464593600

Close, G.L., Ravenscroft, W., Hain, T. orcid.org/0000-0003-0939-3464 et al. (1 more author) (2023) The University of Sheffield CHiME-7 UDASE challenge speech enhancement system. In: Proc. 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023). 7th International Workshop on Speech Processing in Everyday Environments (CHiME 2023), 25 Aug 2023, Dublin, Ireland. International Speech Communication Association (ISCA) , pp. 33-38.

Farooq, M.U. and Hain, T. orcid.org/0000-0003-0939-3464 (2023) Learning cross-lingual mappings for data augmentation to improve low-resource speech recognition. In: Interspeech 2023 Proceedings. Interspeech 2023, 20-24 Aug 2023, Dublin, Ireland. International Speech Communication Association , pp. 5072-5076.

Islam, E. orcid.org/0000-0002-5329-0414, Park, C. orcid.org/0000-0001-6671-1671 and Hain, T. (2023) Exploring speech representations for proficiency assessment in language learning. In: 9th Workshop on Speech and Language Technology in Education (SLaTE) Proceedings. 9th Workshop on Speech and Language Technology in Education (SLaTE), 18-20 Aug 2023, Dublin, Ireland. International Speech Communication Association (ISCA) , pp. 151-155.

Ravenscroft, W., Goetze, S. orcid.org/0000-0003-1044-7343 and Hain, T. (2023) Deformable temporal convolutional networks for monaural noisy reverberant speech separation. In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 04-10 Jun 2023, Rhodes Island, Greece. Institute of Electrical and Electronics Engineers (IEEE) . ISBN 9781728163284

Close, G., Ravenscroft, W., Hain, T. orcid.org/0000-0003-0939-3464 et al. (1 more author) (2023) Perceive and predict: self-supervised speech representation based loss functions for speech enhancement. In: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Proceedings. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 04-10 Jun 2023, Rhodes Island, Greece. Institute of Electrical and Electronics Engineers (IEEE) . ISBN 9781728163284

Close, G. orcid.org/0000-0002-9478-5421, Hain, T. and Goetze, S. (2023) PAMGAN+/-: Improving phase-aware speech enhancement performance via expanded discriminator training. In: AES Convention Europe 2023: 154th Audio Engineering Society Conference. AES Europe 2023: 154th Engineering Society Convention, 13-15 May 2023, Espoo, Helsinki, FInland. Audio Engineering Society , p. 10656.

Ollerenshaw, A., Jalal, M.A. and Hain, T. orcid.org/0000-0003-0939-3464 (2022) Insights of neural representations in multi-banded and multi-channel convolutional transformers for end-to-end ASR. In: Proceedings of 2022 30th European Signal Processing Conference (EUSIPCO). 2022 30th European Signal Processing Conference (EUSIPCO), 29 Aug - 02 Sep 2022, Belgrade, Serbia. Institute of Electrical and Electronics Engineers (IEEE) , pp. 434-438. ISBN 9781665467995

Close, G. orcid.org/0000-0002-9478-5421, Hain, T. orcid.org/0000-0003-0939-3464 and Goetze, S. orcid.org/0000-0003-1044-7343 (2022) MetricGAN+/-: increasing robustness of noise reduction on unseen data. In: Proceedings of 2022 30th European Signal Processing Conference (EUSIPCO). 2022 30th European Signal Processing Conference (EUSIPCO), 29 Aug - 02 Sep 2022, Belgrade, Serbia. Institute of Electrical and Electronics Engineers (IEEE) , Belgrade, Serbia , pp. 165-169. ISBN 9781665467995

Ravenscroft, W., Goetze, S. and Hain, T. orcid.org/0000-0003-0939-3464 (2022) Receptive field analysis of temporal convolutional networks for monaural speech dereverberation. In: Proceedings of 30th European Signal Processing Conference (EUSIPCO 2022). 2022 30th European Signal Processing Conference (EUSIPCO), 29 Aug - 02 Sep 2022, Belgrade, Serbia. Institute of Electrical and Electronics Engineers (IEEE) , pp. 80-84. ISBN 9781665467995

Ravenscroft, W., Goetze, S. and Hain, T. (2022) Utterance weighted multi-dilation temporal convolutional networks for monaural speech dereverberation. In: Proceedings of 2022 International Workshop on Acoustic Signal Enhancement (IWAENC). 2022 International Workshop on Acoustic Signal Enhancement (IWAENC), 05-08 Sep 2022, Bamberg, Germany. Institute of Electrical and Electronics Engineers (IEEE) . ISBN 9781665468688

Farooq, M.U. and Hain, T. orcid.org/0000-0003-0939-3464 (2022) Investigating the impact of cross-lingual acoustic-phonetic similarities on multilingual speech recognition. In: Interspeech 2022 - 23rd Annual Conference of the International Speech Communication Association. Interspeech 2022 - Human and Humanizing Speech Technology, 18-22 Sep 2022, Incheon, Korea. International Speech Communication Association , pp. 3849-3853.

Close, G., Hollands, S., Hain, T. et al. (1 more author) (2022) Non-intrusive speech intelligibility estimated by metric prediction for hearing impaired individuals for the clarity prediction challenge 1. In: Interspeech 2022 - 23rd Annual Conference of the International Speech Communication Association. Interspeech 2022 - Human and Humanizing Speech Technology, 18-22 Sep 2022, Incheon, Korea. International Speech Communication Association , pp. 3483-3487.

Farooq, M.U., Haniya Narayana, D.A. and Hain, T. orcid.org/0000-0003-0939-3464 (2022) Non-linear pairwise language mappings for low-resource multilingual acoustic model fusion. In: Interspeech 2022 - 23rd Annual Conference of the International Speech Communication Association. Interspeech 2022 - Human and Humanizing Speech Technology, 18-22 Sep 2022, Incheon, Korea. International Speech Communication Association , pp. 4850-4854.

Huang, S., Chen, M., Xu, Y. et al. (2 more authors) (2021) WINVC : one-shot voice conversion with weight adaptive instance normalization. In: Pham, D.N., Theeramunkong, T., Governatori, G. and Liu, F., (eds.) PRICAI 2021: Trends in Artificial Intelligence 18th Pacific Rim International Conference on Artificial Intelligence, PRICAI 2021, Hanoi, Vietnam, November 8–12, 2021, Proceedings, Part II. The 18th Pacific Rim International Conference on Artificial Intelligence (PRICAI), 08-12 Nov 2021, Hanoi, Vietnam (virtual). Springer International Publishing , pp. 559-573. ISBN 9783030893620

Ollerenshaw, A. orcid.org/0000-0001-5779-1905, Jalal, M.A. and Hain, T. orcid.org/0000-0003-0939-3464 (2021) Insights on neural representations for end-to-end speech recognition. In: Heřmanský, H., Černocký, H., Burget, L., Lamel, L., Scharenborg, O. and Motlicek, P., (eds.) Interspeech 2021. Interspeech 2021, 30 Aug - 03 Sep 2021, Brno, Czechia. ISCA - International Speech Communication Association , pp. 4079-4083.

Huang, Q. and Hain, T. orcid.org/0000-0003-0939-3464 (2021) Improving audio anomalies recognition using temporal convolutional attention networks. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 06-11 Jun 2021, Toronto, ON, Canada. Institute of Electrical and Electronics Engineers , pp. 6473-6477. ISBN 9781728176062

Chen, M., Shi, Y. and Hain, T. orcid.org/0000-0003-0939-3464 (2021) Towards low-resource StarGAN voice conversion using weight adaptive instance normalization. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 06-11 Jun 2021, Toronto, ON, Canada. Institute of Electrical and Electronics Engineers . ISBN 9781728176062

Shi, Y. and Hain, T. orcid.org/0000-0003-0939-3464 (2021) Supervised speaker embedding de-mixing in two-speaker environment. In: 2021 IEEE Spoken Language Technology Workshop (SLT). 2021 IEEE Spoken Language Technology Workshop (SLT), 19-22 Jan 2021, Shenzhen, China. Institute of Electrical and Electronics Engineers , pp. 758-765. ISBN 9781728170671

Shi, Y. and Hain, T. orcid.org/0000-0003-0939-3464 (2021) Contextual joint factor acoustic embeddings. In: 2021 IEEE Spoken Language Technology Workshop (SLT). 2021 IEEE Spoken Language Technology Workshop (SLT), 19-22 Jan 2021, Shenzhen, China. Institute of Electrical and Electronics Engineers , pp. 750-757. ISBN 9781728170671

Shi, Y., Huang, Q. and Hain, T. orcid.org/0000-0003-0939-3464 (2020) Robust speaker recognition using speech enhancement and attention model. In: Lee, K.A., Koshinaka, T. and Shinoda, K., (eds.) Proceedings of the Speaker and Language Recognition Workshop (Odyssey 2020). The Speaker and Language Recognition Workshop (Odyssey 2020), 01-05 Nov 2020, Tokyo, Japan. ISCA - International Speech Communication Association , pp. 451-458.

Chen, M. and Hain, T. orcid.org/0000-0003-0939-3464 (2020) Unsupervised acoustic unit representation learning for voice conversion using WaveNet auto-encoders. In: Meng, H., Xu, B. and Zheng, T., (eds.) Interspeech 2020. Interspeech 2020, 25-29 Oct 2020, Shanghai, China. ISCA - International Speech Communication Association , pp. 4866-4870.

Jalal, M.A., Milner, R. orcid.org/0000-0001-8924-0593 and Hain, T. orcid.org/0000-0003-0939-3464 (2020) Empirical interpretation of speech emotion perception with attention based model for speech emotion Recognition. In: Proceedings of Interspeech 2020. Interspeech 2020, 25-29 Oct 2020, Shanghai, China (Online). International Speech Communication Association (ISCA) , pp. 4113-4117.

Huang, Q. and Hain, T. orcid.org/0000-0003-0939-3464 (2020) Exploration of audio quality assessment and anomaly localisation using attention models. In: Meng, H., Xu, B. and Zheng, T., (eds.) Proceedings of Interspeech 2020. Interspeech 2020, 25-29 Oct 2020, Shanghai, China. ISCA - International Speech Communication Association , pp. 4611-4615.

Jalal, M.A., Milner, R., Hain, T. orcid.org/0000-0003-0939-3464 et al. (1 more author) (2020) Removing Bias with Residual Mixture of Multi-View Attention for Speech Emotion Recognition. In: Interspeech 2020. Interspeech 2020, 25-29 Oct 2020, Shanghai, China. ISCA - International Speech Communication Association , pp. 4084-4088.

Shi, Y., Huang, Q. and Hain, T. orcid.org/0000-0003-0939-3464 (2020) Speaker re-identification with speaker dependent speech enhancement. In: Meng, H., Xu, B. and Zheng, T., (eds.) Proceedings of Interspeech 2020. Interspeech 2020, 25-29 Oct 2020, Shanghai, China. ISCA - International Speech Communication Association , pp. 1530-1534.

Shi, Y., Huang, Q. and Hain, T. orcid.org/0000-0003-0939-3464 (2020) Weakly supervised training of hierarchical attention networks for speaker identification. In: Meng, H., Xu, B. and Zheng, T., (eds.) Proceedings of Interspeech 2020. Interspeech 2020, 25-29 Oct 2020, Shanghai, China. ISCA - International Speech Communication Association , pp. 2992-2996.

Shi, Y., Huang, Q. and Hain, T. orcid.org/0000-0003-0939-3464 (2020) H-vectors : utterance-level speaker embedding using a hierarchical attention model. In: ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ICASSP 2020 - 45th International Conference on Acoustics, Speech, and Signal Processing, 04-08 May 2020, Barcelona, Spain (virtual). Institute of Electrical and Electronics Engineers , pp. 7579-7583. ISBN 9781509066322

Jalal, M.A., Loweimi, E., Moore, R.K. orcid.org/0000-0003-0065-3311 et al. (1 more author) (2019) Learning temporal clusters using capsule routing for speech emotion recognition. In: Proceedings of Interspeech 2019. Interspeech 2019, 15-19 Sep 2019, Graz, Austria. ISCA , pp. 1701-1705.

Loweimi, E., Barker, J.P. and Hain, T. orcid.org/0000-0003-0939-3464 (2018) Exploring the use of group delay for generalised VTS based noise compensation. In: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing Proceedings. 2018 IEEE International Conference on Acoustics, Speech and Signal Processing, 15-20 Apr 2018, Calgary, Alberta, Canada. IEEE . ISBN 978-1-5386-4658-8

Nicolao, M. orcid.org/0000-0002-4680-2549, Sanders, M. and Hain, T. orcid.org/0000-0003-0939-3464 (2018) Improved acoustic modelling for automatic literacy assessment of children. In: Proceedings of Interspeech 2018. Interspeech 2018, 02-06 Sep 2018, Hyderabad, India. ISCA , pp. 1666-1670.

Deena, S., Ng, R.W.M., Madhyashtha, P. et al. (2 more authors) (2018) Exploring the use of Acoustic Embeddings in Neural Machine Translation. In: Proceedings of IEEE Automatic Speech Recognition and Understanding Workshop. 2017 IEEE Automatic Speech Recognition and Understanding Workshop, December 16-20, 2017, Okinawa, Japan. IEEE . ISBN 978-1-5090-4788-8

Loweimi, E., Barker, J. orcid.org/0000-0002-1684-5660 and Hain, T. orcid.org/0000-0003-0939-3464 (2018) On the usefulness of the speech phase spectrum for pitch extraction. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Interspeech 2018, 02-06 Sep 2018, Hyderabad, India. ISCA , pp. 696-700.

Deena, S., Ng, R.W.M., Madhyashta, P. et al. (2 more authors) (2017) Semi-supervised adaptation of RNNLMs by fine-tuning with domain-specific auxiliary features. In: Proceedings of INTERSPEECH 2017: Conference of the International Speech Communication Association. INTERSPEECH 2017: Conference of the International Speech Communication Association, 20-24 Aug 2017, Stockholm. ISCA , pp. 2715-2719.

Wu, C., Ng, R.W.M., Torralba, O.S. et al. (1 more author) (2017) Analysing acoustic model changes for active learning in automatic speech recognition. In: International Conference on Systems, Signals and Image Processing (IWSSIP). International Conference on Systems, Signals and Image Processing (IWSSIP), 22-24 May 2017, Poznań, Poland. IEEE . ISBN 978-1-5090-6344-4

Milner, R. and Hain, T. orcid.org/0000-0003-0939-3464 (2017) DNN approach to speaker diarisation using speaker channels. In: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), March 5-9, 2017, New Orleans, USA. IEEE , pp. 4925-4929. ISBN 9781509041176

Ng, W.M., Kwan, A.C.M., Lee, T. et al. (1 more author) (2017) ShefCE: A Cantonese-English Bilingual Speech Corpus for Pronunciation Assessment. In: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing. 2017 IEEE International Conference on Acoustics, Speech and Signal Processing, 05/03/2017-09/03/2017, New Orleans, USA. Institute of Electrical and Electronics Engineers . ISBN 978-1-5090-4117-6

Loweimi, E., Barker, J. orcid.org/0000-0002-1684-5660 and Hain, T. orcid.org/0000-0003-0939-3464 (2017) Statistical Normalisation of Phase-based Feature Representation For Robust Speech Recognition. In: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE International Conference on Acoustics Speech and Signal Processing, 05/03/2017 - 09/03/2017, New Orleans. Institute of Electrical and Electronics Engineers . ISBN 978-1-5090-4117-6

Loweimi, E., Barker, J. orcid.org/0000-0002-1684-5660 and Hain, T. orcid.org/0000-0003-0939-3464 (2017) Channel Compensation in the Generalised Vector Taylor Series Approach to Robust ASR. In: Lacerda, F., (ed.) Interspeech 2017. Interspeech 2017, 20-24 Aug 2017, Stockholm. ISCA , pp. 2466-2470.

Loweimi, E., Barker, J. orcid.org/0000-0002-1684-5660, Torralba, O.S. et al. (1 more author) (2017) Robust Source-Filter Separation of Speech Signal in the Phase Domain. In: Proceedings of the Annual Conference of the International Speech Communication Association. Interspeech 2017, 20-24 Aug 2017, Stockholm. ISCA .

Olcoz, J., Saz Torralba, O. and Hain, T. (2016) Error correction in lightly supervised alignment of broadcast subtitles. In: Proceedings of Interspeech 2016. 17th Annual Conference of the International Speech Communication Association (Interspeech), 08-12 Sep 2016, San Francisco, CA. ISCA , pp. 2110-2114.

Hain, T., Christian, J., Saz, O. et al. (6 more authors) (2016) webASR 2 - Improved cloud based speech technology. In: Proceedings of Interspeech 2016. 17th Annual Conference of the International Speech Communication Association (Interspeech), 08-12 Sep 2016, San Francisco, CA. ISCA .

Casanueva, I., Hain, T., Nicolao, M. et al. (1 more author) (2016) Using phone features to improve dialogue state tracking generalisation to unseen states. In: Proceeding of SIGDIAL 2016. The 17th Annual SIGdial Meeting on Discourse and Dialogue, 13-15 Sep 2016, Los Angeles, USA. . ISBN 978-1-945626-23-4

Ng, R., Chettri, B. and Hain, T. orcid.org/0000-0003-0939-3464 (2016) Combining weak tokenisers for phonotactic language recognition in a resource-constrained setting. In: Interspeech 2016. Interspeech, 09-12 Sep 2016, San Francisco, CA. ISCA , pp. 2939-2943.

Loweimi, E., Barker, J. and Hain, T. orcid.org/0000-0003-0939-3464 (2016) Use of Generalised Nonlinearity in Vector Taylor Series Noise Compensation for Robust Speech Recognition. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Interspeech 2016, 08-12 Sep 2016, San Fransisco. , pp. 3798-3802.

Doulaty, M., Saz, O., Ng, R.W.M. et al. (1 more author) (2016) Automatic Genre and Show Identification of Broadcast Media. In: Proceedings of the 17th Annual Conference of the International Speech Communication Association (Interspeech). Interspeech 2016, 08-12 Sep 2016, San Francisco. ISCA .

Al-Shareef, S. and Hain, T. orcid.org/0000-0003-0939-3464 (2016) Colloquialising modern standard Arabic text for improved speech recognition. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Interspeech 2016, 08-12 Sep 2016, San Francisco, USA. , pp. 1345-1349.

Deena, S., Hasan, M., Doulaty, M. et al. (2 more authors) (2016) Combining feature and model-based adaptation of RNNLMs for multi-genre broadcast speech recognition. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Interspeech 2016, 08-12 Sep 2016, San Francisco, USA. , pp. 2343-2347.

Milner, R. and Hain, T. orcid.org/0000-0003-0939-3464 (2016) DNN-based speaker clustering for speaker diarisation. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Interspeech 2016, 08-12 Sep 2016, San Francisco, USA. , pp. 2185-2189.

Casanueva, I., Hain, T. orcid.org/0000-0003-0939-3464 and Green, P. (2016) Improving generalisation to new speakers in spoken dialogue state tracking. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Interspeech 2016, 08-12 Sep 2016, San Francisco, USA. , pp. 2726-2730.

Liu, Y., Fox, C., Hasan, M. et al. (1 more author) (2016) The Sheffield Wargame Corpus - Day two and day three. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Interspeech 2016, 08-12 Sep 2016, San Francisco, USA. ISCA , pp. 3833-3837.

Ng, W., Nicolao, M., Saz, O. et al. (5 more authors) (2016) The Sheffield language recognition system in NIST LRE 2015. In: Proceedings of The Speaker and Language Recognition Workshop Odyssey 2016. Speaker Odyssey, 21-24 Jun 2016, Bilbao, Spain. ISCA , pp. 181-187.

Errattahi, R., El Hannani, A., Ouahmane, H. et al. (1 more author) (2016) Automatic Speech Recognition Errors Detection Using Supervised Learning Techniques. In: 2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA). 13th International Conference of Computer Systems and Applications (AICCSA), Nov 29 – Dec 02, 2016, Agadir, Morocco. IEEE .

Nicolao, M., Christensen, H., Cunningham, S. et al. (2 more authors) (2016) A Framework for Collecting Realistic Recordings of Dysarthric Speech - the homeService Corpus. In: Proceedings of LREC 2016. LREC 2016, 24-27 May 2016, Portorož, Slovenia. European Language Resources Association . ISBN 978-2-9517408-9-1

Milner, R. and Hain, T. orcid.org/0000-0003-0939-3464 (2016) Segment-oriented evaluation of speaker diarisation performance. In: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Proceedings of the 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 20-25 Mar 2016, Shanghai, China. IEEE . ISBN 978-1-4799-9988-0

Ng, R.W.M., Shah, K., Specia, L. orcid.org/0000-0002-5495-3128 et al. (1 more author) (2016) Groupwise learning for ASR k-best list reranking in spoken language translation. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 20-25 Mar 2016, Shanghai. http://dx.doi.org/10.1109/ICASSP.2016.7472853, 2016-M . , pp. 6120-6124. ISBN 9781479999880

Doulaty Bashkand, M., Saz, O. and Hain, T. (2015) Data-Selective Transfer Learning for Multi-Domain Speech Recognition. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 16th Annual Conference of the International Speech Communication Association, 06-10 Sep 2015, Dresden, Germany. ISCA (International Speech Communication Association) , pp. 2897-2901.

Loweimi, E., Barker, J. orcid.org/0000-0002-1684-5660 and Hain, T. orcid.org/0000-0003-0939-3464 (2015) Source-filter Separation of Speech Signal in the Phase Domain. In: 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5. Interspeech 2015, 06-10 Sep 2016, Dresden, Germany. ISCA , pp. 598-602. ISBN 978-1-5108-1790-6

Doulaty Bashkand, M., Saz, O. and Hain, T. (2015) Unsupervised Domain Discovery Using Latent Dirichlet Allocation for Acoustic Modelling in Speech Recognition. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 16th Annual Conference of the International Speech Communication Association, 06-10 Sep 2015, Dresden, Germany. ISCA (International Speech Communication Association) , pp. 3640-3644.

Nicolao, M., Beeston, A.V. orcid.org/0000-0003-2796-1947 and Hain, T. orcid.org/0000-0003-0939-3464 (2015) Automatic assessment of English learner pronunciation using discriminative classifiers. In: Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on. 40th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 19-24 Apr 2015, Brisbane, Australia. IEEE , pp. 5351-5355.

Liu, Y., Karanasou, P. and Hain, T. (2015) An Investigation into Speaker Informed DNN Front-end for LVCSR. In: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 40th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 19-24 Apr 2015, Brisbane, Australia. IEEE Conference Publications . IEEE , IEEE Xplore . ISBN 978-1-4673-6997-8/15

Milner, R., Saz, O., Deena, S. et al. (3 more authors) (2015) The 2015 Sheffield System for Longitudinal Diarisation of Broadcast Media. In: Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU). 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13-17 Dec 2015, Scottsdale, AZ. IEEE , pp. 632-638. ISBN 978-1-4799-7291-3

Saz, O., Doulaty, M., Deena, S. et al. (5 more authors) (2015) The 2015 Sheffield System for Transcription of Multi–Genre Broadcast Media. In: Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU). 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), December 13-17, 2015, Scottsdale, Arizona, USA. IEEE . ISBN 978-1-4799-7291-3

Doulaty, M., Saz, O., Ng, R.W.M. et al. (1 more author) (2015) Latent Dirichlet Allocation Based Organisation of Broadcast Media Archives for Deep Neural Network Adaptation. In: Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU). 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13-17 Dec 2015, Scottsdale, AZ. IEEE , pp. 130-136. ISBN 978-1-4799-7291-3

Bell, P., Gales, M., Hain, T. orcid.org/0000-0003-0939-3464 et al. (8 more authors) (2015) The MGB Challenge: Evaluating Multi-genre Broadcast Media Recognition. In: Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU). 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13-17 Dec 2015, Scottsdale, AZ. IEEE , pp. 687-693. ISBN 978-1-4799-7291-3

Zhang, P., Liu, Y. and Hain, T. (2014) Semi-Supervised DNN Training in Meeting Recognition. In: Proceedings of. 2014 IEEE Spoken Language Technology Workshop (SLT 2014), 07-10 Dec 2014, South Lake Tahoe, California and Nevada, USA. .

Ng, R.W.M., Doulaty, M., Doddipatla, R. et al. (7 more authors) (2014) The USFD Spoken Language Translation System for IWSLT 2014. In: Federico, M., Stücker, S. and Yvon, F., (eds.) Proceedings of the 11th International Workshop on Spoken Language Translation (SLT 2014). The 11th International Workshop on Spoken Language Translation (IWSLT), Dec. 03-04, 2015, Lake Tahoe, US. IWSLT 2014 , pp. 86-91.

Saz, O., Doulaty, M. and Hain, T. (2014) Background-tracking acoustic features for genre identification of broadcast shows. In: Spoken Language Technology Workshop (SLT), 2014 IEEE. Spoken Language Technology Workshop (SLT), 07-10 Dec 2014, South Lake Tahoe, NV. IEEE , 118 - 123. ISBN 9781479971299

Ng, R.W.M., Doulaty, M., Doddipatla, R. et al. (7 more authors) (2014) The USFD SLT System for IWSLT 2014. In: Federico, M., Stücker, S. and Yvon, F., (eds.) Proceedings of the International Workshop on Spoken Language Translation. 11th International Workshop on Spoken Language Translation, 04-05 Dec 2014, Lake Tahoe, California (USA). IWSLT , http://workshop2014.iwslt.org/64.php .

Loweimi, E., Barker, J. and Hain, T. (2014) Compression of Model-based Group Delay Function for Robust Speech Recognition. In: The University of Sheffield Engineering Symposium Conference Proceedings Vol. 1. USES 2014 - The University of Sheffield Engineering Symposium, 24 June 2014, The Octagon Centre, University of Sheffield. .

Saz, O. and Hain, T. orcid.org/0000-0003-0939-3464 (2014) Using contextual information in Joint Factor Eigenspace MLLR for speech recognition in diverse scenarios. In: Proceedings of the 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 04-09 May 2014, Florence, Italy. IEEE .

Liu, Y., Zhang, P. and Hain, T. (2014) Using neural network front-ends on far field multiple microphones based speech recognition. In: Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on. 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 04-09 May 2014, Florence, Italy. IEEE , pp. 5542-5546.

Saz, O. and Hain, T. orcid.org/0000-0003-0939-3464 (2013) Asynchronous factorisation of speaker and background with feature transforms in speech recognition. In: INTERSPEECH-2013. INTERSPEECH 2013 - 14th Annual Conference of the International Speech Communication Association, 25-29 Aug 2013, Lyon, France. ISCA , pp. 1238-1242.

Lanchantin, P., Bell, P.J., Gales, M.J.F. et al. (9 more authors) (2013) Automatic Transcription of Multi-Genre Media Archives. In: CEUR Workshop Proceedings. First Workshop on Speech, Language and Audio in Multimedia, August 22-23, 2013, Marseille, France. , 26–31-26–31.

Fox, C.W., Liu, Y., Zwyssig, E. et al. (1 more author) (2013) The Sheffield Wargames Corpus. In: Proceedings of Interspeech 2013. Interspeech 2013, 25-29 Aug 2013, Lyon, France. ISCA .

Preprint

Close, G. orcid.org/0000-0002-9478-5421, Hain, T. orcid.org/0000-0003-0939-3464 and Goetze, S. orcid.org/0000-0003-1044-7343 (2023) Non intrusive intelligibility predictor for hearing impaired individuals using self supervised speech representations. [Preprint] (Submitted)

This list was generated on Sat Apr 20 19:17:31 2024 BST.