Latent TAG Derivations for Semantic Role Labeling
- đ¤ Speaker: Anoop Sarkar, Simon Fraser University
- đ Date & Time: Friday 12 March 2010, 12:00 - 13:00
- đ Venue: SW01, Computer Laboratory
Abstract
(Joint work with Yudong Liu and Gholamreza Haffari)
Semantic Role Labeling (SRL) is a natural language processing task that aims to identify and label all the arguments for each predicate occurring in a sentence. SRL is difficult because arguments can appear in different syntactic positions relative to the predicate due to syntactic alternations. Furthermore, complex syntactic embedding can create long-distance dependencies between predicate and argument. As in other natural language learning tasks, identifying discriminative features plays an important role and all state-of-the-art SRL systems use high-quality statistical parsers as a source of features in order to identify and classify semantic roles.
In statistical parsing the use of latent information (such as state-splitting of non-terminals in a context-free grammar) has led to substantial improvements in parsing accuracy. However, apart from the sentence simplification approach of Vickrey and Koller (2008), latent information has not been exploited for semantic role labeling. In our work, we take the output of a statistical parser and then decompose the phrase structure tree into a large number of hidden Tree-adjoining grammar (TAG) derivations. Each hidden or latent TAG derivation represents a different way of representing the structural dependency relationship between the predicate and argument.
We hypothesize that positive and negative examples of individual semantic roles can be reliably distinguished by possibly different latent TAG features. Motivated by this insight we show that latent support vector machines (LSVMs) can be used for the SRL task by exploiting these latent TAG features. In experiments on the PropBank-CoNLL 2005 data set, our method significantly outperforms the state of the art (even compared to models using global constraints or global inference over multiple parses). We show that latent SVMs offer an interesting new framework for NLP tasks, and using experimental analysis we examine how and why the method is effective at exploiting the latent TAG features in order to improve the precision of identifying and classifying semantic roles.
Series This talk is part of the NLIP Seminar Series series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- Computer Education Research
- Computing Education Research
- Department of Computer Science and Technology talks and seminars
- Graduate-Seminars
- Guy Emerson's list
- Interested Talks
- Language Sciences for Graduate Students
- ndk22's list
- NLIP Seminar Series
- ob366-ai4er
- PMRFPS's
- rp587
- School of Technology
- Simon Baker's List
- SW01, Computer Laboratory
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Anoop Sarkar, Simon Fraser University
Friday 12 March 2010, 12:00-13:00