Items where authors include "Ragni, A."

Jump to: Article | Proceedings Paper | Preprint

Number of items: 61.

Article

Mogridge, R. orcid.org/0000-0002-5686-070X and Ragni, A. orcid.org/0000-0003-0634-4456 (2026) Minerva 2 for speech and language tasks. Computer Speech & Language, 95. 101843. ISSN 0885-2308

Chen, X., Liu, X., Wang, Y. et al. (3 more authors) (2019) Exploiting future word contexts in neural network language models for speech recognition. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 27 (9). pp. 1444-1454. ISSN 2329-9290

Wu, C., Gales, M.J.F., Ragni, A. et al. (2 more authors) (2018) Improving interpretability and regularization in deep learning. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 26 (2). pp. 256-265. ISSN 2329-9290

Proceedings Paper

Cross, M. and Ragni, A. (2025) Flowing straighter with conditional flow matching for accurate speech enhancement. In: Proceedings of the 2nd ECAI Workshop on "Machine Learning Meets Differential Equations: From Theory to Applications". 2nd ECAI Workshop on "Machine Learning Meets Differential Equations: From Theory to Applications", 25-30 Oct 2025, Bologna, Italy. Proceedings of Machine Learning Research , pp. 121-132.

Que, S. and Ragni, A. orcid.org/0000-0003-0634-4456 (2025) VisualSpeech: Enhancing Prosody Modeling in TTS Using Video. In: Scharenborg, O., Oertel, C. and Truong, K., (eds.) Proceedings of Interspeech 2025. Interspeech 2025, 17-21 Aug 2025, Rotterdam, The Netherlands. International Speech Communication Association (ISCA) , pp. 3778-3782.

Sun, W. and Ragni, A. (2025) Score-based training for energy-based TTS models. In: Interspeech 2025. Interspeech 2025, 17-21 Aug 2025, Rotterdam, The Netherlands. ISCA , pp. 5528-5532.

Leung, W.-Z. orcid.org/0009-0003-4888-1951, Cross, M., Ragni, A. et al. (1 more author) (2024) Training data augmentation for dysarthric automatic speech recognition by text-to-dysarthric-speech synthesis. In: Proceedings of Interspeech 2024. Interspeech 2024, 01-05 Sep 2024, Kos island, Greece. International Speech Communication Association (ISCA) , pp. 2494-2498.

Sun, W., Tu, Z. and Ragni, A. orcid.org/0000-0003-0634-4456 (2024) Energy-based models for speech synthesis. In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024), 14-19 Apr 2024, COEX, Seoul, Korea. Institute of Electrical and Electronics Engineers (IEEE) , pp. 12667-12671. ISBN 979-8-3503-4486-8

Mogridge, R., Close, G., Sutherland, R. et al. (4 more authors) (2024) Non-intrusive speech intelligibility prediction for hearing-impaired users using intermediate ASR features and human memory models. In: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024), 14-19 Apr 2024, Seoul, Korea. Institute of Electrical and Electronics Engineers (IEEE) , pp. 306-310. ISBN 979-8-3503-4486-8

Cross, M. and Ragni, A. orcid.org/0000-0003-0634-4456 (2024) What happens to diffusion model likelihood when your model is conditional? In: Coelho, C., Zimmering, B., Fernanda, M., Costa, P., Ferras, L.L. and Niggemann, O., (eds.) Proceedings of Machine Learning Research. 1st ECAI Workshop on “Machine Learning Meets Differential Equations: From Theory to Applications”, 20 Oct 2024, Santiago de Compostela, Spain. Proceedings of Machine Learning Research , pp. 1-14.

Yuan, R., Ma, Y., Li, Y. et al. (22 more authors) (2023) MARBLE: Music Audio Representation Benchmark for Universal Evaluation. In: Advances in Neural Information Processing Systems (NeurIPS 2023). 37th Conference on Neural Information Processing Systems (NeurIPS 2023), 10-16 Dec 2023, New Orleans, USA. Neural Information Processing Systems Foundation, Inc. (NeurIPS) .

Ma, Y., Yuan, R., Li, Y. et al. (12 more authors) (2023) On the effectiveness of speech self-supervised learning for music. In: Sarti, A., Antonacci, F., Sandler, M., Bestagini, P., Dixon, S., Liang, B., Richard, G. and Pauwels, J., (eds.) ISMIR 2023: 24th International Society for Music Information Retrieval Conference proceedings. 24th International Society for Music Information Retrieval Conference (ISMIR 2023), 05-09 Nov 2023, Milan, Italy. International Society for Music Information Retrieval (ISMIR) , pp. 457-465. ISBN 978-1-7327299-3-3

Nomo Sudro, P., Ragni, A. and Hain, T. orcid.org/0000-0003-0939-3464 (2023) Adapting pretrained models for adult to child voice conversion. In: 2023 31st European Signal Processing Conference (EUSIPCO) Proceedings. 2023 31st European Signal Processing Conference (EUSIPCO), 04-08 Sep 2023, Helsinki, Finland. Institute of Electrical and Electronics Engineers (IEEE) , pp. 271-275. ISBN 9789464593600

Flynn, R. and Ragni, A. orcid.org/0000-0003-0634-4456 (2023) Leveraging cross-utterance context for ASR decoding. In: Proceedings of Interspeech 2023. INTERSPEECH 2023, 20-24 Aug 2024, Dublin, Ireland. ISCA - International Speech Communication Association , pp. 1359-1363.

Nicholls, D., Knill, K., Gales, M.J.F. et al. (2 more authors) (2023) Speak & improve: L2 English speaking practice tool. In: Proceedings of Interspeech 2023. INTERSPEECH 2023, 20-24 Aug 2024, Dublin, Ireland. International Speech Communication Association (ISCA) , pp. 3669-3670.

Li, Y., Yuan, R., Zhang, G. et al. (11 more authors) (2022) LV-49: MAP-Music2Vec: A Simple and Effective Baseline for Self-Supervised Music Audio Representation Learning. In: 23rd International Society for Music Information Retrieval Conference (ISMIR 2022). 23rd International Society for Music Information Retrieval Conference (ISMIR 2022), 04-08 Dec 2022, Bengaluru, India. International Society for Music Information Retrieval (ISMIR) .

Li, Y., Zhang, G., Yang, B. et al. (4 more authors) (2022) HERB: Measuring hierarchical regional bias in pre-trained language models. In: He, Y., Ji, H., Liu, Y., Li, S. and Chang, C.-H., (eds.) Findings of the Association for Computational Linguistics: AACL-IJCNLP 2022. The 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 20-23 Nov 2022, Online. Association for Computational Linguistics , pp. 334-346. ISBN 9781959429043

Li, Q., Ness, P.M., Ragni, A. et al. (1 more author) (2019) Bi-directional lattice recurrent neural networks for confidence estimation. In: ICASSP 2019. ICASSP 2019, 12-17 May 2019, Brighton, UK. IEEE , pp. 6755-6759. ISBN 9781479981311

Ragni, A., Li, Q., Gales, M.J.F. et al. (1 more author) (2019) Confidence estimation and deletion prediction using bidirectional recurrent neural networks. In: 2018 IEEE Spoken Language Technology Workshop (SLT). IEEE Spoken Language Technology Workshop (SLT), 18-21 Dec 2018, Athens, Greece. IEEE , pp. 204-211. ISBN 9781538643358

Wang, Y., Wong, J.H.M., Gales, M.J.F. et al. (2 more authors) (2018) Sequence teacher-student training of acoustic models for automatic free speaking language assessment. In: 2018 IEEE Spoken Language Technology Workshop (SLT). 2018 IEEE Spoken Language Technology Workshop (SLT), 18-21 Dec 2018, Athens, Greece. IEEE . ISBN 9781538643358

Wang, Y., Chen, X., Gales, M.J.F. et al. (2 more authors) (2018) Phonetic and graphemic systems for multi-genre broadcast transcription. In: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ICASSP 2018 - Signal Processing and Artificial Intelligence: Changing the World, 15-20 Apr 2018, Calgary, AB, Canada. IEEE . ISBN 9781538646595

Chen, O., Ragni, A. orcid.org/0000-0003-0634-4456, Gales, M. et al. (1 more author) (2018) Active memory networks for language modeling. In: Proceedings of Interspeech 2018. Interspeech 2018, 02-06 Sep 2018, Hyderabad, India. International Speech Communication Association (ISCA) , pp. 3338-3342.

Ragni, A. orcid.org/0000-0003-0634-4456 and Gales, M. (2018) Automatic speech recognition system development in the "wild". In: Interspeech 2018. Interspeech 2018, 02-06 Sep 2018, Hyderabad, India. International Speech Communication Association (ISCA) , pp. 2217-2221.

Knill, K., Gales, M., Kyriakopoulos, K. et al. (4 more authors) (2018) Impact of ASR performance on free speaking language assessment. In: Interspeech 2018. Interspeech 2018, 02-06 Sep 2018, Hyderabad, India. International Speech Communication Association (ISCA) , pp. 1641-1645.

Chen, X., Liu, X., Ragni, A. et al. (2 more authors) (2017) Future word contexts in neural network language models. In: 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU). 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 16-20 Dec 2017, Okinawa, Japan. IEEE . ISBN 9781509047895

Chen, X., Ragni, A., Liu, X. et al. (1 more author) (2017) Investigating bidirectional recurrent neural network language models for speech recognition. In: Proceedings of Interspeech 2017. Interspeech 2017, 20-24 Aug 2017, Stockholm, Sweden. International Speech Communication Association (ISCA) , pp. 269-273.

Knill, K.M., Gales, M.J.F., Kyriakopoulos, K. et al. (2 more authors) (2017) Use of graphemic lexicons for spoken language assessment. In: Proceedings of Interspeech 2017. Interspeech 2017, 20-24 Aug 2017, Stockholm, Sweden. International Speech Communication Association (ISCA) , pp. 2774-2778.

Gales, M.J.F., Knill, K.M. and Ragni, A. (2017) Low-resource speech recognition and keyword-spotting. In: Karpov, A., Potapova, R. and Mporas, I., (eds.) Speech and Computer : 19th International Conference, SPECOM 2017. 19th International Conference, SPECOM 2017, 12-16 Sep 2017, Hatfield, UK. Springer International Publishing , pp. 3-19. ISBN 9783319664286

Malinin, A., Knill, K., Ragni, A. orcid.org/0000-0003-0634-4456 et al. (2 more authors) (2017) An attention based model for off-topic spontaneous spoken response detection : an initial study. In: Engwall, O. and Lopes, J.D., (eds.) 7th ISCA Workshop on Speech and Language Technology in Education (SLaTE). 7th ISCA Workshop on Speech and Language Technology in Education (SLaTE), 25-26 Aug 2017, Stockholm, Sweden. SLaTE Conference Proceedings . ISCA , pp. 144-149.

Malinin, A., Ragni, A. orcid.org/0000-0003-0634-4456, Knill, K. et al. (1 more author) (2017) Incorporating uncertainty into deep learning for spoken language assessment. In: Barzilay, R. and Kan, M.-Y., (eds.) Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2 : Short Papers). 55th Annual Meeting of the Association for Computational Linguistics, 30 Jul - 04 Aug 2017, Vancouver, Canada. Association for Computational Linguistics . ISBN 9781945626760

Ragni, A. orcid.org/0000-0003-0634-4456, Saunders, D., Zahemszky, P. et al. (3 more authors) (2017) Morph-to-word transduction for accurate and efficient automatic speech recognition and keyword search. In: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 05-09 Mar 2017, New Orleans, LA, USA. IEEE , pp. 5770-5774. ISBN 9781509041183

Chen, X., Ragni, A. orcid.org/0000-0003-0634-4456, Vasilakes, J. et al. (3 more authors) (2017) Recurrent neural network language models for keyword search. In: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 05-09 Mar 2017, New Orleans, LA, USA. IEEE , pp. 5775-5779. ISBN 9781509041183

Ragni, A. orcid.org/0000-0003-0634-4456, Wu, C., Gales, M.J.F. et al. (2 more authors) (2017) Stimulated training for automatic speech recognition and keyword search in limited resource conditions. In: 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 05-09 Mar 2017, New Orleans, LA, USA. IEEE , pp. 4830-4834. ISBN 9781509041183

Yang, J., Ragni, A. orcid.org/0000-0003-0634-4456, Gales, M.J.F. et al. (1 more author) (2016) Log-linear system combination using structured support vector machines. In: Interspeech 2016. Interspeech 2016, 08-12 Sep 2016, San Francisco, CA, USA. International Speech Communication Association (ISCA) .

Ragni, A. orcid.org/0000-0003-0634-4456, Dakin, E., Chen, X. et al. (2 more authors) (2016) Multi-language neural network language models. In: Interspeech 2016. Interspeech 2016, 08-12 Sep 2016, San Francisco, CA, USA. International Speech Communication Association (ISCA) .

Yang, J., Zhang, C., Ragni, A. orcid.org/0000-0003-0634-4456 et al. (2 more authors) (2016) System combination with log-linear models. In: 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 20-25 Mar 2016, Shanghai, China. IEEE . ISBN 9781479999880

Cui, J., Kingsbury, B., Ramabhadran, B. et al. (16 more authors) (2016) Multilingual representations for low resource speech recognition and keyword search. In: 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU). IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13-17 Dec 2015, Scottsdale, AZ, USA. IEEE , pp. 259-266. ISBN 9781479972913

van Dalen, R.C., Yang, J., Wang, H. et al. (3 more authors) (2016) Structured discriminative models using deep neural-network features. In: 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU). IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 13-17 Dec 2015, Scottsdale, AZ, USA. IEEE , pp. 160-166. ISBN 9781479972913

Mendels, G., Cooper, E., Soto, V. et al. (5 more authors) (2015) Improving speech recognition and keyword search for low resource languages using web data. In: INTERSPEECH 2015 : 16th Annual Conference of the International Speech Communication Association. INTERSPEECH 2015 : 16th Annual Conference of the International Speech Communication Association, 06-10 Sep 2015, Dresden, Germany. International Speech Communication Association (ISCA) , pp. 829-833.

Wang, H., Ragni, A. orcid.org/0000-0003-0634-4456, Gales, M.J.F. et al. (3 more authors) (2015) Joint decoding of tandem and hybrid systems for improved keyword spotting on low resource languages. In: INTERSPEECH 2015 : 16th Annual Conference of the International Speech Communication Association. INTERSPEECH 2015 : 16th Annual Conference of the International Speech Communication Association, 06-10 Sep 2015, Dresden, Germany. International Speech Communication Association (ISCA) , pp. 3660-3664.

Gales, M.J.F., Knill, K.M. and Ragni, A. orcid.org/0000-0003-0634-4456 (2015) Unicode-based graphemic systems for limited resource languages. In: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). ICASSP 2015 - IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 19-24 Apr 2015, Brisbane, QLD, Australia. IEEE . ISBN 9781467369978

Ragni, A. orcid.org/0000-0003-0634-4456, Gales, M.J.F. and Knill, K.M. (2015) A language space representation for speech recognition. In: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 19-24 Apr 2015, Brisbane, QLD, Australia. IEEE , pp. 4634-4638. ISBN 9781467369978

Rath, S.P., Knill, K.M., Ragni, A. orcid.org/0000-0003-0634-4456 et al. (1 more author) (2014) Combining tandem and hybrid systems for improved speech recognition and keyword spotting on low resource languages. In: INTERSPEECH 2014 : 15th Annual Conference of the International Speech Communication Association. INTERSPEECH 2014 : 15th Annual Conference of the International Speech Communication Association, 14-18 Sep 2014, Singapore. International Speech Communication Association (ISCA) , pp. 835-839.

Ragni, A. orcid.org/0000-0003-0634-4456, Knill, K.M., Rath, S.P. et al. (1 more author) (2014) Data augmentation for low resource languages. In: INTERSPEECH 2014 : 15th Annual Conference of the International Speech Communication Association. INTERSPEECH 2014 : 15th Annual Conference of the International Speech Communication Association, 14-18 Sep 2014, Singapore. International Speech Communication Association (ISCA) , pp. 810-814.

Knill, K.M., Gales, M.J.F., Ragni, A. orcid.org/0000-0003-0634-4456 et al. (1 more author) (2014) Language independent and unsupervised acoustic models for speech recognition and keyword spotting. In: INTERSPEECH 2014 : 15th Annual Conference of the International Speech Communication Association. INTERSPEECH 2014 : 15th Annual Conference of the International Speech Communication Association, 14-18 Sep 2014, Singapore. International Speech Communication Association (ISCA) .

Yoshioka, T., Ragni, A. orcid.org/0000-0003-0634-4456 and Gales, M.J.F. (2014) Investigation of unsupervised adaptation of DNN acoustic models with filter bank input. In: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 04-09 May 2014, Florence, Italy. IEEE , pp. 6344-6348. ISBN 9781479928934

Gales, M.J.F., Knill, K.M., Ragni, A. orcid.org/0000-0003-0634-4456 et al. (1 more author) (2014) Speech recognition and keyword spotting for low-resource languages : Babel project research at CUED. In: Fourth International Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU-2014). Fourth International Workshop on Spoken Language Technologies for Under-Resourced Languages (SLTU-2014), 14-16 May 2014, St. Petersburg, Russia. International Speech Communication Association (ISCA) , pp. 16-23.

van Dalen, R.C., Ragni, A. orcid.org/0000-0003-0634-4456 and Gales, M.J.F. (2013) Efficient decoding with generative score-spaces using the expectation semiring. In: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, 26-31 May 2013, Vancouver, BC, Canada. IEEE , pp. 7619-7623. ISBN 9781479903566

Gales, M.J.F., Ragni, A. orcid.org/0000-0003-0634-4456, Zhang, A. et al. (1 more author) (2012) Structured discriminative models for speech recognition. In: Symposium on Machine Learning in Speech and Language Processing (MLSLP). Symposium on Machine Learning in Speech and Language Processing (MLSLP), 14 Sep 2012, Portland, Oregon, USA. International Speech Communication Association .

Roupakia, Z., Ragni, A. orcid.org/0000-0003-0634-4456 and Gales, M.J.F. (2012) Rapid nonlinear speaker adaptation for large-vocabulary continuous speech recognition. In: INTERSPEECH 2012 : 13th Annual Conference of the International Speech Communication Association. INTERSPEECH 2012 : 13th Annual Conference of the International Speech Communication Association, 09-13 Sep 2012, Portland, OR, USA. International Speech Communication Association (ISCA) , pp. 1784-1787.

Ragni, A. orcid.org/0000-0003-0634-4456 and Gales, M.J.F. (2012) Inference algorithms for generative score-spaces. In: 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 25-30 Mar 2012, Kyoto, Japan. IEEE , pp. 4149-4152. ISBN 9781467300452

Ragni, A. orcid.org/0000-0003-0634-4456 and Gales, M.J.F. (2012) Derivative kernels for noise robust ASR. In: IEEE Workshop on Automatic Speech Recognition & Understanding. IEEE Workshop on Automatic Speech Recognition & Understanding, 11-15 Dec 2011, Waikoloa, HI, USA. IEEE , pp. 119-124. ISBN 9781467303651

Ragni, A. orcid.org/0000-0003-0634-4456 and Gales, M.J.F. (2011) Structured discriminative models for noise robust continuous speech recognition. In: 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 22-27 May 2011, Prague, Czech Republic. IEEE , pp. 4788-4791. ISBN 9781457705380

Gales, M.J.F., Ragni, A. orcid.org/0000-0003-0634-4456, AlDamarki, H. et al. (1 more author) (2010) Support vector machines for noise robust ASR. In: 2009 IEEE Workshop on Automatic Speech Recognition & Understanding. IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU), 13-17 Dec 2009, Merano, Italy. IEEE , pp. 205-210. ISBN 9781424454785

Ragni, A. orcid.org/0000-0003-0634-4456 (2007) Initial experiments with Estonian speech recognition. In: Nivre, J., Kaalep, H.-J., Muischnek, K. and Koit, M., (eds.) Proceedings of the 16th Nordic Conference of Computational Linguistics (NODALIDA 2007). 16th Nordic Conference of Computational Linguistics (NODALIDA 2007), 25-26 May 2007, Tartu, Estonia. University of Tartu, Estonia , pp. 249-252. ISBN 9789985405147

Preprint

Tan, X., Zhao, M. and Ragni, A. (2025) Discrete-time diffusion-like models for speech synthesis. [Preprint] (Submitted)

Bartley, C. and Ragni, A. (2025) How I built ASR for endangered languages with a spoken dictionary. [Preprint] (Submitted)

Cassini, S., Hain, T. and Ragni, A. orcid.org/0000-0003-0634-4456 (2025) Emphasis sensitivity in speech representations. [Preprint] (Submitted)

Sun, W. and Ragni, A. orcid.org/0000-0003-0634-4456 (2025) Score-based training for energy-based TTS models. [Preprint] (Submitted)

Li, Y., Yuan, R., Zhang, G. et al. (15 more authors) (2024) MERT: Acoustic music understanding model with large-scale self-supervised training. [Preprint] (Submitted)

Flynn, R. and Ragni, A. orcid.org/0000-0003-0634-4456 (2023) How much context does my attention-based ASR system need? [Preprint] (Submitted)

This list was generated on Sun Jan 18 19:05:31 2026 GMT.