Accurate CCG Parsing with Approximate Language Intersection and Task-specific Optimization
- đ¤ Speaker: Michael Auli
- đ Date & Time: Friday 06 May 2011, 12:00 - 13:00
- đ Venue: FW26, Computer Laboratory
Abstract
Combinatory Categorial Grammar (CCG) parsing is a longstanding problem in computational linguistics, due to the complexities associated with its mild context-sensitivity. Via an oracle experiment, we show that the upper bound on accuracy of a CCG parser is significantly lowered when its search space is pruned using a supertagger, though the supertagger also prunes many bad parses.
Inspired by this analysis, we design a single model with both supertagging and parsing features, rather than separating them into distinct models chained together in a pipeline. To overcome the resulting complexity, we experiment with two approximation algorithms for language intersection: loopy belief propagation and dual decomposition.
The second part of this talk deals with task-specific optimisation of parsing models. We adopt the softmax-margin training objective which minimises a bound on expected risk for a given loss function but requires the loss to decompose over the predicted structure, which is not true of F-measure. We present a novel dynamic programming algorithm which allows us to use it with F-measure leading to substantial gains in accuracy on CCG Bank.
Each of the presented methods improves over the state-of-the-art. Moreover, the improvements are additive, obtaining the best reported results on this task. Our algorithms are general and we expect them to apply to other parsing problems, including lexcalized tree adjoining grammar and context-free grammar.
Series This talk is part of the NLIP Seminar Series series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- Computer Education Research
- Computing Education Research
- Department of Computer Science and Technology talks and seminars
- FW26, Computer Laboratory
- Graduate-Seminars
- Guy Emerson's list
- Interested Talks
- Language Sciences for Graduate Students
- ndk22's list
- NLIP Seminar Series
- ob366-ai4er
- PMRFPS's
- rp587
- School of Technology
- Simon Baker's List
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Michael Auli
Friday 06 May 2011, 12:00-13:00