COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > NLIP Seminar Series > A fast and simple algorithm for training neural probabilistic language models
A fast and simple algorithm for training neural probabilistic language modelsAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Ekaterina Kochmar. In spite of their superior performance, neural probabilistic language models (NPLMs) remain far less widely used than n-gram models due to their notoriously long training times, which are measured in weeks even for moderately-sized datasets. Training NPL Ms is computationally expensive because they are explicitly normalized, which leads to having to consider all words in the vocabulary when computing the log-likelihood gradients. We propose a fast and simple algorithm for training NPL Ms based on noise-contrastive estimation, a newly-introduced procedure for estimating unnormalized continuous distributions. We investigate the behaviour of the algorithm on the Penn Treebank corpus and show that it reduces the training times by more than an order of magnitude without affecting the quality of the resulting models. The algorithm is also more efficient and much more stable than importance sampling because it requires far fewer noise samples to perform well. We demonstrate the scalability of the proposed approach by training several neural language models on a 47M-word corpus with a 80K-word vocabulary, obtaining state-of-the-art results on the Microsoft Research Sentence Completion Challenge dataset. Joint work with Yee Whye Teh This talk is part of the NLIP Seminar Series series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsisrael Pathology Valedictory Seminars Peter Whittle Lecture CUEX Presents: On Foot Across China And Other Human Powered Adventures Mathematical Modeling Violence Research CentreOther talksComputing High Resolution Health(care) Treatment Centre Simulation Paracelsus' Chickens - Strange Tales from the History of Chemistry Pruning and grafting syntactic trees for cross-lingual transfer tasks Transcriptional control of pluripotent stem cell fate by the Nucleosome Remodelling and Deacetylation (NuRD) complex Molecular mechanisms of cardiomyopathies in patients with severe non-ischemic heart failure The Global Warming Sceptic Constructing the virtual fundamental cycle Cambridge - Corporate Finance Theory Symposium September 2017 - Day 2 Atiyah Floer conjecture Title to be confirmed |