Discrete-time diffusion-like models for speech synthesis

This is a preprint and may not have undergone formal peer review

Tan, X., Zhao, M. and Ragni, A. (Submitted: 2025) Discrete-time diffusion-like models for speech synthesis. [Preprint - arXiv] (Submitted)

Abstract

Metadata

Item Type: Preprint
Authors/Creators:
  • Tan, X.
  • Zhao, M.
  • Ragni, A.
Copyright, Publisher and Additional Information:

© 2025 The Author(s). This preprint is made available under a Creative Commons Attribution 4.0 International License. (https://creativecommons.org/licenses/by/4.0/)

Keywords: diffusion models; flow matching; iterative process; speech synthesis
Dates:
  • Submitted: 13 October 2025
Institution: The University of Sheffield
Academic Units: The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield)
Date Deposited: 06 Jan 2026 15:34
Last Modified: 06 Jan 2026 15:34
Status: Submitted
Identification Number: 10.48550/arXiv.2509.18470
Open Archives Initiative ID (OAI ID):

Download

Export

Statistics