Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

On modern techniques for parallel waveform generation of speech

Add to your list(s) Download to your calendar using vCal

Lorenzo Foglianti, Papercup
Tuesday 05 February 2019, 16:00-17:00
MR5.

If you have a question about this talk, please contact CCA.

Industrial seminar

At Papercup, we aim to translate the world’s content. What this means in practice is to translate audio from an input language to an output language. In this talk, we will focus on what we consider the most interesting part of this problem, which is the function mapping text to audio. Over the past few years, Machine Learning research has made a giant leap forward in the quality of the synthesised audio compared to more traditional methods. However, these methods are inherently autoregressive and therefore cannot be parallelised on modern machines. Because of this, these methods can rarely be deployed in practice. Hence, the synthesis time is limited by the nature of the model, rather than the hardware. In this talk, we present a new class of models, called Flows, which allows us to generate audio in a non autoregressive way. We will also show sample audio synthesised by state of the art models.

This talk is part of the Cambridge Centre for Analysis talks series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

On modern techniques for parallel waveform generation of speech

This talk is included in these lists:

Other lists

Other talks