COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > Natural Language Processing Reading Group > Unsupervised Morphological Segmentation with Log-Linear Models
Unsupervised Morphological Segmentation with Log-Linear ModelsAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Diarmuid Ó Séaghdha. At this session of the NLIP Reading Group we’ll be discussing the following paper: Hoifung Poon, Colin Cherry and Kristina Toutanova. 2009. Unsupervised Morphological Segmentation with Log-Linear Models. In Proceedings of NAACL -09. Abstract: Morphological segmentation breaks words into morphemes (the basic semantic units). It is a key component for natural language processing systems. Unsupervised morphological segmentation is attractive, because in every language there are virtually unlimited supplies of text, but very few labeled resources. However, most existing model-based systems for unsupervised morphological segmentation use directed generative models, making it difficult to leverage arbitrary overlapping features that are potentially helpful to learning. In this paper, we present the first log-linear model for unsupervised morphological segmentation. Our model uses overlapping features such as morphemes and their contexts, and incorporates exponential priors inspired by the minimum description length (MDL) principle. We present efficient algorithms for learning and inference by combining contrastive estimation with sampling. Our system, based on monolingual features only, outperforms a state-of-the-art system by a large margin, even when the latter uses bilingual information such as phrasal alignment and phonetic correspondence. On the Arabic Penn Treebank, our system reduces F1 error by 11% compared to Morfessor. This talk is part of the Natural Language Processing Reading Group series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsType the title of a new list here Women of Mathematics throughout Europe Cambridge Forum of Science and HumanitiesOther talksMigration in Science My ceramic practice, and Moon Jars for the 21st century Beacon Salon # 8 The Dawn of the Antibiotic Age Environmental shocks and demographic consequences in England: 1280-1325 and 1580-1640 compared A domain-decomposition-based model reduction method for convection-diffusion equations with random coefficients Recent developments and debates in East Asian monsoon palaeoclimatology Towards bulk extension of near-horizon geometries ***PLEASE NOTE THIS SEMINAR IS CANCELLED*** Sustainability of livestock production: water, welfare and woodland Single Cell Seminars (September) Coatable photovoltaics (Title t o be confirmed) Foster Talk - CANCELLED - Redox Oscillations in the Circadian Clockwork |