|COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring.|
The effect of normalization -- a case study in speech synthesis
If you have a question about this talk, please contact Shakir Mohamed.
Undirected graphical models are ubiquitous in application domains of machine learning. However the normalization constants in these models are often difficult to compute, and as a result are frequently dropped altogether. In this talk we’ll look at the qualitative effect of this lack of normalization in the domain of statistical speech synthesis.
Specifically we’ll compare the predictive distributions of the standard unnormalized speech synthesis model, its globally-normalized undirected counterpart, and a more tractable directed graphical model. Along the way, we’ll highlight some of the general issues surrounding the choice between undirected and directed graphical models for sequence data.
The introduction to speech synthesis I will give will be aimed entirely at machine learners with no background in modelling speech, and will I hope be realistic (close to state-of-the-art), self-contained and framed in terms of probabilistic modelling.
This talk is part of the Machine Learning Reading Group @ CUED series.
This talk is included in these lists:
Note that ex-directory lists are not shown.
Other listsIntroduction to Fractional Calculus Gurdon Institute Talks Science, Technology and Mathematics Education (STeM)
Other talksMixing dynamics in pulsed fuel jets Inferno XXV, Purgatorio XXV, Paradiso XXV Maps and seafarers in the English Channel (eighteenth century) Making the invisible visible: The nm scale perspective 'Mutant stem cell behaviour in squamous carcinogenesis' The Geometry of Machine Translation