Pitfalls in Evaluation of Multilingual Text Representations
- 👤 Speaker: Goran Glavaš
- 📅 Date & Time: Thursday 03 December 2020, 11:00 - 12:00
- 📍 Venue: https://teams.microsoft.com/l/meetup-join/19%3ameeting_YmQyY2ViNDgtZDE1MC00MzZhLWFjZGItOWFmMjM2OTI1ZDQy%40thread.v2/0?context=%7b%22Tid%22%3a%2249a50445-bdfa-4b79-ade3-547b4f3986e9%22%2c%22Oid%22%3a%2230bfe2fc-8896-487c-84f2-f4b8875a60b2%22%7d
Abstract
Multilingual representation spaces, spanned by multilingual word embeddings or massively multilingual transformers, conceptually enable modeling of meaning across a wide range of languages and language transfer of task-specific NLP models from resource-rich to resource-lean languages. It is not yet clear, however, to which extent this conceptual promise holds in practice. Recent models, both cross-lingual word embedding models and multilingual transformers, have been praised for being able to induce multilingual representation spaces without any explicit supervision (i.e., without any word-level alignments or parallel corpora). In this talk, I will point to some prominent shortcomings and pitfalls of existing evaluations of multilingual representation spaces, which mask important limitations of state-of-the-art multilingual representation models. Remedying for some of these evaluation shortcomings, portrays meaning representation and language transfer capabilities of current state-of-the-art multilingual representation spaces in a less favorable light.
Series This talk is part of the Language Technology Lab Seminars series.
Included in Lists
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- Guy Emerson's list
- https://teams.microsoft.com/l/meetup-join/19%3ameeting_YmQyY2ViNDgtZDE1MC00MzZhLWFjZGItOWFmMjM2OTI1ZDQy%40thread.v2/0?context=%7b%22Tid%22%3a%2249a50445-bdfa-4b79-ade3-547b4f3986e9%22%2c%22Oid%22%3a%2230bfe2fc-8896-487c-84f2-f4b8875a60b2%22%7d
- Interested Talks
- Language Sciences for Graduate Students
- Language Technology Lab Seminars
- ndk22's list
- ob366-ai4er
- rp587
- Simon Baker's List
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Thursday 03 December 2020, 11:00-12:00