|COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring.|
The effect of normalization -- a case study in speech synthesis
If you have a question about this talk, please contact Shakir Mohamed.
Undirected graphical models are ubiquitous in application domains of machine learning. However the normalization constants in these models are often difficult to compute, and as a result are frequently dropped altogether. In this talk we’ll look at the qualitative effect of this lack of normalization in the domain of statistical speech synthesis.
Specifically we’ll compare the predictive distributions of the standard unnormalized speech synthesis model, its globally-normalized undirected counterpart, and a more tractable directed graphical model. Along the way, we’ll highlight some of the general issues surrounding the choice between undirected and directed graphical models for sequence data.
The introduction to speech synthesis I will give will be aimed entirely at machine learners with no background in modelling speech, and will I hope be realistic (close to state-of-the-art), self-contained and framed in terms of probabilistic modelling.
This talk is part of the Machine Learning Reading Group @ CUED series.
This talk is included in these lists:
Note that ex-directory lists are not shown.
Other listsAsian Archaeology Group CRASSH lectures Machine Learning Summary
Other talks‘Vinorelbine In Mesothelioma trial: the rationale’ Molecular Motors and Switches at Surfaces The virus host and virus genetics of severe influenza infection The properties of silica glass relevant to their use in ballistics applications The Topsy-Turvy Arctic: Navigating the Polar Airspace The 2015 Ageing Summit