Items where authors include "Ragni, A."
Article
Chen, X., Liu, X., Wang, Y. et al. (3 more authors) (2019) Exploiting future word contexts in neural network language models for speech recognition. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 27 (9). pp. 1444-1454. ISSN 2329-9290
Wu, C., Gales, M.J.F., Ragni, A. et al. (2 more authors) (2018) Improving interpretability and regularization in deep learning. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 26 (2). pp. 256-265. ISSN 2329-9290
Proceedings Paper
Leung, W.-Z. orcid.org/0009-0003-4888-1951, Cross, M., Ragni, A. et al. (1 more author) (2024) Training data augmentation for dysarthric automatic speech recognition by text-to-dysarthric-speech synthesis. In: Proceedings of Interspeech 2024. Interspeech 2024, 01-05 Sep 2024, Kos island, Greece. International Speech Communication Association (ISCA) , pp. 2494-2498.
Sun, W., Tu, Z. and Ragni, A. orcid.org/0000-0003-0634-4456 (2024) Energy-based models for speech synthesis. In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024), 14-19 Apr 2024, COEX, Seoul, Korea. Institute of Electrical and Electronics Engineers (IEEE) , pp. 12667-12671. ISBN 979-8-3503-4486-8
Mogridge, R., Close, G., Sutherland, R. et al. (4 more authors) (2024) Non-intrusive speech intelligibility prediction for hearing-impaired users using intermediate ASR features and human memory models. In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024), 14-19 Apr 2024, Seoul, Korea. Institute of Electrical and Electronics Engineers (IEEE) , pp. 306-310. ISBN 979-8-3503-4486-8
Cross, M. and Ragni, A. orcid.org/0000-0003-0634-4456 (2024) What happens to diffusion model likelihood when your model is conditional? In: Coelho, C., Zimmering, B., Fernanda, M., Costa, P., Ferras, L.L. and Niggemann, O., (eds.) Proceedings of Machine Learning Research. 1st ECAI Workshop on “Machine Learning Meets Differential Equations: From Theory to Applications”, 20 Oct 2024, Santiago de Compostela, Spain. Proceedings of Machine Learning Research , pp. 1-14.
Yuan, R., Ma, Y., Li, Y. et al. (22 more authors) (2023) MARBLE: Music Audio Representation Benchmark for Universal Evaluation. In: Advances in Neural Information Processing Systems (NeurIPS 2023). 37th Conference on Neural Information Processing Systems (NeurIPS 2023), 10-16 Dec 2023, New Orleans, USA. Neural Information Processing Systems Foundation, Inc. (NeurIPS) .
Ma, Y., Yuan, R., Li, Y. et al. (12 more authors) (2023) On the effectiveness of speech self-supervised learning for music. In: Sarti, A., Antonacci, F., Sandler, M., Bestagini, P., Dixon, S., Liang, B., Richard, G. and Pauwels, J., (eds.) ISMIR 2023: 24th International Society for Music Information Retrieval Conference proceedings. 24th International Society for Music Information Retrieval Conference (ISMIR 2023), 05-09 Nov 2023, Milan, Italy. International Society for Music Information Retrieval (ISMIR) , pp. 457-465. ISBN 978-1-7327299-3-3
Nomo Sudro, P., Ragni, A. and Hain, T. orcid.org/0000-0003-0939-3464 (2023) Adapting pretrained models for adult to child voice conversion. In: 2023 31st European Signal Processing Conference (EUSIPCO) Proceedings. 2023 31st European Signal Processing Conference (EUSIPCO), 04-08 Sep 2023, Helsinki, Finland. Institute of Electrical and Electronics Engineers (IEEE) , pp. 271-275. ISBN 9789464593600
Flynn, R. and Ragni, A. orcid.org/0000-0003-0634-4456 (2023) Leveraging cross-utterance context for ASR decoding. In: Proceedings of Interspeech 2023. INTERSPEECH 2023, 20-24 Aug 2024, Dublin, Ireland. ISCA - International Speech Communication Association , pp. 1359-1363.
Nicholls, D., Knill, K., Gales, M.J.F. et al. (2 more authors) (2023) Speak & improve: L2 English speaking practice tool. In: Proceedings of Interspeech 2023. INTERSPEECH 2023, 20-24 Aug 2024, Dublin, Ireland. International Speech Communication Association (ISCA) , pp. 3669-3670.
Li, Y., Yuan, R., Zhang, G. et al. (11 more authors) (2022) LV-49: MAP-Music2Vec: A Simple and Effective Baseline for Self-Supervised Music Audio Representation Learning. In: 23rd International Society for Music Information Retrieval Conference (ISMIR 2022). 23rd International Society for Music Information Retrieval Conference (ISMIR 2022), 04-08 Dec 2022, Bengaluru, India. International Society for Music Information Retrieval (ISMIR) .
Li, Y., Zhang, G., Yang, B. et al. (4 more authors) (2022) HERB: Measuring hierarchical regional bias in pre-trained language models. In: He, Y., Ji, H., Liu, Y., Li, S. and Chang, C.-H., (eds.) Findings of the Association for Computational Linguistics: AACL-IJCNLP 2022. The 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 20-23 Nov 2022, Online. Association for Computational Linguistics , pp. 334-346. ISBN 9781959429043
Li, Q., Ness, P.M., Ragni, A. et al. (1 more author) (2019) Bi-directional lattice recurrent neural networks for confidence estimation. In: ICASSP 2019. ICASSP 2019, 12-17 May 2019, Brighton, UK. IEEE , pp. 6755-6759. ISBN 9781479981311
Ragni, A., Li, Q., Gales, M.J.F. et al. (1 more author) (2019) Confidence estimation and deletion prediction using bidirectional recurrent neural networks. In: 2018 IEEE Spoken Language Technology Workshop (SLT). IEEE Spoken Language Technology Workshop (SLT), 18-21 Dec 2018, Athens, Greece. IEEE , pp. 204-211. ISBN 9781538643358
Wang, Y., Wong, J.H.M., Gales, M.J.F. et al. (2 more authors) (2018) Sequence teacher-student training of acoustic models for automatic free speaking language assessment. In: 2018 IEEE Spoken Language Technology Workshop (SLT). 2018 IEEE Spoken Language Technology Workshop (SLT), 18-21 Dec 2018, Athens, Greece. IEEE . ISBN 9781538643358
Wang, Y., Chen, X., Gales, M.J.F. et al. (2 more authors) (2018) Phonetic and graphemic systems for multi-genre broadcast transcription. In: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ICASSP 2018 - Signal Processing and Artificial Intelligence: Changing the World, 15-20 Apr 2018, Calgary, AB, Canada. IEEE . ISBN 9781538646595
Chen, O., Ragni, A. orcid.org/0000-0003-0634-4456, Gales, M. et al. (1 more author) (2018) Active memory networks for language modeling. In: Proceedings of Interspeech 2018. Interspeech 2018, 02-06 Sep 2018, Hyderabad, India. International Speech Communication Association (ISCA) , pp. 3338-3342.
Ragni, A. orcid.org/0000-0003-0634-4456 and Gales, M. (2018) Automatic speech recognition system development in the "wild". In: Interspeech 2018. Interspeech 2018, 02-06 Sep 2018, Hyderabad, India. International Speech Communication Association (ISCA) , pp. 2217-2221.
Knill, K., Gales, M., Kyriakopoulos, K. et al. (4 more authors) (2018) Impact of ASR performance on free speaking language assessment. In: Interspeech 2018. Interspeech 2018, 02-06 Sep 2018, Hyderabad, India. International Speech Communication Association (ISCA) , pp. 1641-1645.
Chen, X., Liu, X., Ragni, A. et al. (2 more authors) (2017) Future word contexts in neural network language models. In: 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 16-20 Dec 2017, Okinawa, Japan. IEEE . ISBN 9781509047895
Chen, X., Ragni, A., Liu, X. et al. (1 more author) (2017) Investigating bidirectional recurrent neural network language models for speech recognition. In: Proceedings of Interspeech 2017. Interspeech 2017, 20-24 Aug 2017, Stockholm, Sweden. International Speech Communication Association (ISCA) , pp. 269-273.
Knill, K.M., Gales, M.J.F., Kyriakopoulos, K. et al. (2 more authors) (2017) Use of graphemic lexicons for spoken language assessment. In: Proceedings of Interspeech 2017. Interspeech 2017, 20-24 Aug 2017, Stockholm, Sweden. International Speech Communication Association (ISCA) , pp. 2774-2778.
Gales, M.J.F., Knill, K.M. and Ragni, A. (2017) Low-resource speech recognition and keyword-spotting. In: Karpov, A., Potapova, R. and Mporas, I., (eds.) Speech and Computer : 19th International Conference, SPECOM 2017. 19th International Conference, SPECOM 2017, 12-16 Sep 2017, Hatfield, UK. Springer International Publishing , pp. 3-19. ISBN 9783319664286
Malinin, A., Knill, K., Ragni, A. orcid.org/0000-0003-0634-4456 et al. (2 more authors) (2017) An attention based model for off-topic spontaneous spoken response detection : an initial study. In: Engwall, O. and Lopes, J.D., (eds.) 7th ISCA Workshop on Speech and Language Technology in Education (SLaTE). 7th ISCA Workshop on Speech and Language Technology in Education (SLaTE), 25-26 Aug 2017, Stockholm, Sweden. SLaTE Conference Proceedings . ISCA , pp. 144-149.
Malinin, A., Ragni, A. orcid.org/0000-0003-0634-4456, Knill, K. et al. (1 more author) (2017) Incorporating uncertainty into deep learning for spoken language assessment. In: Barzilay, R. and Kan, M.-Y., (eds.) Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2 : Short Papers). 55th Annual Meeting of the Association for Computational Linguistics, 30 Jul - 04 Aug 2017, Vancouver, Canada. Association for Computational Linguistics . ISBN 9781945626760
Ragni, A. orcid.org/0000-0003-0634-4456, Saunders, D., Zahemszky, P. et al. (3 more authors) (2017) Morph-to-word transduction for accurate and efficient automatic speech recognition and keyword search. In: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 05-09 Mar 2017, New Orleans, LA, USA. IEEE , pp. 5770-5774. ISBN 9781509041183
Chen, X., Ragni, A. orcid.org/0000-0003-0634-4456, Vasilakes, J. et al. (3 more authors) (2017) Recurrent neural network language models for keyword search. In: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 05-09 Mar 2017, New Orleans, LA, USA. IEEE , pp. 5775-5779. ISBN 9781509041183
Ragni, A. orcid.org/0000-0003-0634-4456, Wu, C., Gales, M.J.F. et al. (2 more authors) (2017) Stimulated training for automatic speech recognition and keyword search in limited resource conditions. In: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 05-09 Mar 2017, New Orleans, LA, USA. IEEE , pp. 4830-4834. ISBN 9781509041183
Yang, J., Ragni, A. orcid.org/0000-0003-0634-4456, Gales, M.J.F. et al. (1 more author) (2016) Log-linear system combination using structured support vector machines. In: Interspeech 2016. Interspeech 2016, 08-12 Sep 2016, San Francisco, CA, USA. International Speech Communication Association (ISCA) .
Ragni, A. orcid.org/0000-0003-0634-4456, Dakin, E., Chen, X. et al. (2 more authors) (2016) Multi-language neural network language models. In: Interspeech 2016. Interspeech 2016, 08-12 Sep 2016, San Francisco, CA, USA. International Speech Communication Association (ISCA) .
Yang, J., Zhang, C., Ragni, A. orcid.org/0000-0003-0634-4456 et al. (2 more authors) (2016) System combination with log-linear models. In: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 20-25 Mar 2016, Shanghai, China. IEEE . ISBN 9781479999880
Cui, J., Kingsbury, B., Ramabhadran, B. et al. (16 more authors) (2016) Multilingual representations for low resource speech recognition and keyword search. In: 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU). IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13-17 Dec 2015, Scottsdale, AZ, USA. IEEE , pp. 259-266. ISBN 9781479972913
van Dalen, R.C., Yang, J., Wang, H. et al. (3 more authors) (2016) Structured discriminative models using deep neural-network features. In: 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU). IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13-17 Dec 2015, Scottsdale, AZ, USA. IEEE , pp. 160-166. ISBN 9781479972913
Mendels, G., Cooper, E., Soto, V. et al. (5 more authors) (2015) Improving speech recognition and keyword search for low resource languages using web data. In: INTERSPEECH 2015 : 16th Annual Conference of the International Speech Communication Association. INTERSPEECH 2015 : 16th Annual Conference of the International Speech Communication Association, 06-10 Sep 2015, Dresden, Germany. International Speech Communication Association (ISCA) , pp. 829-833.
Wang, H., Ragni, A. orcid.org/0000-0003-0634-4456, Gales, M.J.F. et al. (3 more authors) (2015) Joint decoding of tandem and hybrid systems for improved keyword spotting on low resource languages. In: INTERSPEECH 2015 : 16th Annual Conference of the International Speech Communication Association. INTERSPEECH 2015 : 16th Annual Conference of the International Speech Communication Association, 06-10 Sep 2015, Dresden, Germany. International Speech Communication Association (ISCA) , pp. 3660-3664.
Gales, M.J.F., Knill, K.M. and Ragni, A. orcid.org/0000-0003-0634-4456 (2015) Unicode-based graphemic systems for limited resource languages. In: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ICASSP 2015 - IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 19-24 Apr 2015, Brisbane, QLD, Australia. IEEE . ISBN 9781467369978
Ragni, A. orcid.org/0000-0003-0634-4456, Gales, M.J.F. and Knill, K.M. (2015) A language space representation for speech recognition. In: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 19-24 Apr 2015, Brisbane, QLD, Australia. IEEE , pp. 4634-4638. ISBN 9781467369978
Rath, S.P., Knill, K.M., Ragni, A. orcid.org/0000-0003-0634-4456 et al. (1 more author) (2014) Combining tandem and hybrid systems for improved speech recognition and keyword spotting on low resource languages. In: INTERSPEECH 2014 : 15th Annual Conference of the International Speech Communication Association. INTERSPEECH 2014 : 15th Annual Conference of the International Speech Communication Association, 14-18 Sep 2014, Singapore. International Speech Communication Association (ISCA) , pp. 835-839.
Ragni, A. orcid.org/0000-0003-0634-4456, Knill, K.M., Rath, S.P. et al. (1 more author) (2014) Data augmentation for low resource languages. In: INTERSPEECH 2014 : 15th Annual Conference of the International Speech Communication Association. INTERSPEECH 2014 : 15th Annual Conference of the International Speech Communication Association, 14-18 Sep 2014, Singapore. International Speech Communication Association (ISCA) , pp. 810-814.
Knill, K.M., Gales, M.J.F., Ragni, A. orcid.org/0000-0003-0634-4456 et al. (1 more author) (2014) Language independent and unsupervised acoustic models for speech recognition and keyword spotting. In: INTERSPEECH 2014 : 15th Annual Conference of the International Speech Communication Association. INTERSPEECH 2014 : 15th Annual Conference of the International Speech Communication Association, 14-18 Sep 2014, Singapore. International Speech Communication Association (ISCA) .
Yoshioka, T., Ragni, A. orcid.org/0000-0003-0634-4456 and Gales, M.J.F. (2014) Investigation of unsupervised adaptation of DNN acoustic models with filter bank input. In: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 04-09 May 2014, Florence, Italy. IEEE , pp. 6344-6348. ISBN 9781479928934
Gales, M.J.F., Knill, K.M., Ragni, A. orcid.org/0000-0003-0634-4456 et al. (1 more author) (2014) Speech recognition and keyword spotting for low-resource languages : Babel project research at CUED. In: Fourth International Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU-2014). Fourth International Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU-2014), 14-16 May 2014, St. Petersburg, Russia. International Speech Communication Association (ISCA) , pp. 16-23.
van Dalen, R.C., Ragni, A. orcid.org/0000-0003-0634-4456 and Gales, M.J.F. (2013) Efficient decoding with generative score-spaces using the expectation semiring. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 26-31 May 2013, Vancouver, BC, Canada. IEEE , pp. 7619-7623. ISBN 9781479903566
Gales, M.J.F., Ragni, A. orcid.org/0000-0003-0634-4456, Zhang, A. et al. (1 more author) (2012) Structured discriminative models for speech recognition. In: Symposium on Machine Learning in Speech and Language Processing (MLSLP). Symposium on Machine Learning in Speech and Language Processing (MLSLP), 14 Sep 2012, Portland, Oregon, USA. International Speech Communication Association .
Roupakia, Z., Ragni, A. orcid.org/0000-0003-0634-4456 and Gales, M.J.F. (2012) Rapid nonlinear speaker adaptation for large-vocabulary continuous speech recognition. In: INTERSPEECH 2012 : 13th Annual Conference of the International Speech Communication Association. INTERSPEECH 2012 : 13th Annual Conference of the International Speech Communication Association, 09-13 Sep 2012, Portland, OR, USA. International Speech Communication Association (ISCA) , pp. 1784-1787.
Ragni, A. orcid.org/0000-0003-0634-4456 and Gales, M.J.F. (2012) Inference algorithms for generative score-spaces. In: 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 25-30 Mar 2012, Kyoto, Japan. IEEE , pp. 4149-4152. ISBN 9781467300452
Ragni, A. orcid.org/0000-0003-0634-4456 and Gales, M.J.F. (2012) Derivative kernels for noise robust ASR. In: IEEE Workshop on Automatic Speech Recognition & Understanding. IEEE Workshop on Automatic Speech Recognition & Understanding, 11-15 Dec 2011, Waikoloa, HI, USA. IEEE , pp. 119-124. ISBN 9781467303651
Ragni, A. orcid.org/0000-0003-0634-4456 and Gales, M.J.F. (2011) Structured discriminative models for noise robust continuous speech recognition. In: 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 22-27 May 2011, Prague, Czech Republic. IEEE , pp. 4788-4791. ISBN 9781457705380
Gales, M.J.F., Ragni, A. orcid.org/0000-0003-0634-4456, AlDamarki, H. et al. (1 more author) (2010) Support vector machines for noise robust ASR. In: 2009 IEEE Workshop on Automatic Speech Recognition & Understanding. IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU), 13-17 Dec 2009, Merano, Italy. IEEE , pp. 205-210. ISBN 9781424454785
Ragni, A. orcid.org/0000-0003-0634-4456 (2007) Initial experiments with Estonian speech recognition. In: Nivre, J., Kaalep, H.-J., Muischnek, K. and Koit, M., (eds.) Proceedings of the 16th Nordic Conference of Computational Linguistics (NODALIDA 2007). 16th Nordic Conference of Computational Linguistics (NODALIDA 2007), 25-26 May 2007, Tartu, Estonia. University of Tartu, Estonia , pp. 249-252. ISBN 9789985405147
Preprint
Li, Y., Yuan, R., Zhang, G. et al. (15 more authors) (2024) MERT: Acoustic music understanding model with large-scale self-supervised training. [Preprint] (Submitted)
Flynn, R. and Ragni, A. orcid.org/0000-0003-0634-4456 (2023) How much context does my attention-based ASR system need? [Preprint] (Submitted)