University of Cambridge > Talks.cam > Cambridge Centre for Analysis talks > On modern techniques for parallel waveform generation of speech

On modern techniques for parallel waveform generation of speech

Add to your list(s) Download to your calendar using vCal

  • UserLorenzo Foglianti, Papercup
  • ClockTuesday 05 February 2019, 16:00-17:00
  • HouseMR5.

If you have a question about this talk, please contact CCA.

Industrial seminar

At Papercup, we aim to translate the world’s content. What this means in practice is to translate audio from an input language to an output language. In this talk, we will focus on what we consider the most interesting part of this problem, which is the function mapping text to audio. Over the past few years, Machine Learning research has made a giant leap forward in the quality of the synthesised audio compared to more traditional methods. However, these methods are inherently autoregressive and therefore cannot be parallelised on modern machines. Because of this, these methods can rarely be deployed in practice. Hence, the synthesis time is limited by the nature of the model, rather than the hardware. In this talk, we present a new class of models, called Flows, which allows us to generate audio in a non autoregressive way. We will also show sample audio synthesised by state of the art models.

This talk is part of the Cambridge Centre for Analysis talks series.

Tell a friend about this talk:

This talk is included in these lists:

Note that ex-directory lists are not shown.

 

© 2006-2024 Talks.cam, University of Cambridge. Contact Us | Help and Documentation | Privacy and Publicity