COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > Machine Learning @ CUED > Learning Common Grammar from Multilingual Corpus / Online Multiscale Dynamic Topic Models
Learning Common Grammar from Multilingual Corpus / Online Multiscale Dynamic Topic ModelsAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Zoubin Ghahramani. I will give the following two talks. “Learning Common Grammar from Multilingual Corpus” We propose a corpus-based probabilistic framework to extract hidden common syntax across languages from non-parallel multilingual corpora in an unsupervised fashion. For this purpose, we assume a generative model for multilingual corpora, where each sentence is generated from a language dependent probabilistic context-free grammar (PCFG), and these PCF Gs are generated from a prior grammar that is common across languages. We also develop a variational method for efficient inference. Experiments on a non-parallel multilingual corpus of eleven languages demonstrate the feasibility of the proposed method. “Online Multiscale Dynamic Topic Models” We propose an online topic model for sequentially analyzing the time evolution of topics in document collections. Topics naturally evolve with multiple timescales. For example, some words may be used consistently over one hundred years, while other words emerge and disappear over periods of a few days. Thus, in the proposed model, current topic-specific distributions over words are assumed to be generated based on the multiscale word distributions of the previous epoch. Considering both the long-timescale dependency as well as the short-timescale dependency yields a more robust model. We derive efficient online inference procedures based on a stochastic EM algorithm, in which the model is sequentially updated using newly obtained data. This talk is part of the Machine Learning @ CUED series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsLogic & Semantics for Dummies ndk22's list Engineering - Mechanics Colloquia Research SeminarsOther talksUnbiased Estimation of the Eigenvalues of Large Implicit Matrices Ancient DNA studies of early modern humans and late Neanderthals Intrinsically Motivating Teachers;STIR's use of Data Driven Insight to Iterate, Pivot and (where necessary) Fail Fast Whence the force of the law? John Rawls and the course of American legal philosophy Fukushima and the Law The Digital Doctor: Hope, Hype, and Harm at the Dawn of Medicine’s Computer Age 70th Anniversary Celebration A polyfold lab report EU LIFE Lecture - "Histone Chaperones Maintain Cell Fates and Antagonize Reprogramming in C. elegans and Human Cells" A feast of languages: multilingualism in neuro-typical and atypical populations High-Dimensional Collocation for Lognormal Diffusion Problems |