Multiword Expressions and Compositionality Detection: Giving Word Embeddings a Hard Time
- đ¤ Speaker: Aline Villavicencio (Federal University of Rio Grande do Sul) đ Website
- đ Date & Time: Thursday 06 October 2016, 11:00 - 12:00
- đ Venue: GR04, English Faculty, 9 West Road (Sidgwick Site)
Abstract
In this talk I start with an overview of Multiword Expressions (MWEs) like compound nouns and verb particle constructions, which have proved a challenge for computational analysis. These expressions need to be treated as a unit at some level of linguistic description. In particular, they display a wide range of compositionality, from more compositional cases like police car to more idiomatic MWEs like kick the bucket. Models for representing words and MWEs in semantic space, and their ability to capture compositionality/idiomaticity will be compared for three languages: English, French and Portuguese. The impact of some factors like the degree of corpus pre-processing and the size of context for the performance of these models will be discussed. I discuss the findings of a large-scale multilingual evaluation of DSMs for predicting the degree of semantic compositionality of nominal compounds on 4 datasets for English and French.
Series This talk is part of the Language Technology Lab Seminars series.
Included in Lists
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- GR04, English Faculty, 9 West Road (Sidgwick Site)
- Guy Emerson's list
- Interested Talks
- Language Sciences for Graduate Students
- Language Technology Lab Seminars
- ndk22's list
- ob366-ai4er
- rp587
- Simon Baker's List
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)



Thursday 06 October 2016, 11:00-12:00