COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > Natural Language Processing Reading Group > Unsupervised Multilingual Learning for Morphological Segmentation
Unsupervised Multilingual Learning for Morphological SegmentationAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Diarmuid Ó Séaghdha. At this session of the NLIP Reading Group we’ll be discussing the following paper: Benjamin Snyder and Regina Barzilay. 2008. Unsupervised Multilingual Learning for Morphological Segmentation. In Proceedings of ACL -08. Abstract: For centuries, the deep connection between languages has brought about major discoveries about human communication. In this paper we investigate how this powerful source of information can be exploited for unsupervised language learning. In particular, we study the task of morphological segmentation of multiple languages. We present a nonparametric Bayesian model that jointly induces morpheme segmentations of each language under consideration and at the same time identifies cross-lingual morpheme patterns, or abstract morphemes. We apply our model to three Semitic languages: Arabic, Hebrew, Aramaic, as well as to English. Our results demonstrate that learning morphological models in tandem reduces error by up to 24% relative to monolingual models. Furthermore, we provide evidence that our joint model achieves better performance when applied to languages from the same family. This talk is part of the Natural Language Processing Reading Group series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsEmmy Noether Society Soul Food EPRG Public EventsOther talksStereodivergent Catalysis, Strategies and Tactics Towards Secondary Metabolites as enabling tools for the Study of Natural Products Biology Statistical analysis of biotherapeutic datasets to facilitate early ‘Critical Quality Attribute’ characterization. Laser Printed Organic Electronics, Metal-Organic Framework - Polymer Nanofiber Composites for Gas Separation Kidney cancer: the most lethal urological malignancy Putting Feminist New Materialism to work through affective methodologies in early childhood research Grammar Variational Autoencoder Throwing light on organocatalysis: new opportunities in enantioselective synthesis “Modulating Tregs in Cancer and Autoimmunity” Singularities of Hermitian-Yang-Mills connections and the Harder-Narasimhan-Seshadri filtration Genomic Approaches to Cancer To be confirmed Ethics for the working mathematician, seminar 10: Mathematicians being leaders. |