Randomized Language Models via Perfect Hash Functions
- 👤 Speaker: Lin Sun
- 📅 Date & Time: Monday 22 February 2010, 12:30 - 13:30
- 📍 Venue: GS15, Computer Laboratory
Abstract
At this session of the NLIP Reading Group we’ll be discussing the following paper:
David Talbot and Thorsten Brants. 2008. Randomized Language Models via Perfect Hash Functions. In Proceedings of ACL -08.
Abstract: We propose a succinct randomized language model which employs a perfect hash function to encode fingerprints of n-grams and their associated probabilities, backoff weights, or other parameters. The scheme can represent any standard n-gram model and is easily combined with existing model reduction techniques such as entropy-pruning. We demonstrate the space-savings of the scheme via machine translation experiments within a distributed language modeling framework.
Series This talk is part of the Natural Language Processing Reading Group series.
Included in Lists
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- GS15, Computer Laboratory
- Guy Emerson's list
- Natural Language Processing Reading Group
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Lin Sun
Monday 22 February 2010, 12:30-13:30