You need to be logged in to carry this out. If you don't have an account, feel free to create one. |
COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > NLIP Seminar Series > Unsupervised Word Alignment and Part of Speech Induction with Undirected Models
Unsupervised Word Alignment and Part of Speech Induction with Undirected ModelsAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Thomas Lippincott. This talk explores unsupervised learning in undirected graphical models for two problems in natural language processing. Undirected models can incorporate arbitrary, non-independent features computed over random variables, thereby overcoming the inherent limitation of directed models, which require that features factor according to the conditional independencies of an acyclic generative process. Using word alignment (finding lexical correspondences in parallel texts) and bilingual part-of-speech induction (jointly learning syntactic categories for two languages from parallel data) as case studies, we show that relaxing the acyclicity requirement lets us formulate more succinct models that make fewer counterintuitive independence assumptions. Experiments confirm that our undirected alignment model yields consistently better performance than directed model baselines, according to both intrinsic and extrinsic measures. With POS tagging, we find more tentative results. Analysis reveals that our parameter learner tends to get caught in shallow local optima corresponding to poor tagging solutions. Switching to an alternative learning objective (contrastive estimation; Smith and Eisner, 2005) improves the stability and performance, but it suggests that non-convex objectives may be a larger problem in undirected models than with directed models. This talk is part of the NLIP Seminar Series series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsLarmor Society Cambridge Centre for Analysis talks Automating Biology using Robot ScientistsOther talksA feast of languages: multilingualism in neuro-typical and atypical populations Epigenetics - Why DNA Is Not Your Destiny Part Ib Group Project Presentations New micro-machines, new materials Panel comparisons: Challenor, Ginsbourger, Nobile, Teckentrup and Beck On the climate change conversation Picturing the Heart in 2020 The Digital Doctor: Hope, Hype, and Harm at the Dawn of Medicine’s Computer Age A polyfold lab report Inferring the Evolutionary History of Cancers: Statistical Methods and Applications Protein Folding, Evolution and Interactions Symposium Observation of photon antibunching from a potential SAW-driven single-photon source |