University of Cambridge > Talks.cam > NLIP Seminar Series > Typological Feature Prediction and Blinding for Cross-Lingual NLP

Typological Feature Prediction and Blinding for Cross-Lingual NLP

Add to your list(s) Download to your calendar using vCal

  • UserJohannes Bjerva (Aalborg University)
  • ClockFriday 21 May 2021, 12:00-13:00
  • HouseVirtual (Zoom).

If you have a question about this talk, please contact Huiyuan Xie.

Join Zoom Meeting https://cl-cam-ac-uk.zoom.us/j/99610057235?pwd=dGV0bENBa1lxTmFaa28rSmg2dFlldz09

Meeting ID: 996 1005 7235 Passcode: 525413

I will discuss the usefulness of typological features in NLP , particularly in cross-lingual settings. On the one hand, typological features from databases such as the World Atlas of Language Structures (WALS) seem promising for cross-lingual NLP , as annotations for useful aspects of language exist even for very low-resource languages. Furthermore, missing features in WALS can be predicted with relatively high success, and has been the focus of much recent work (e.g. in the SIGTYP 2020 shared task). When it comes to application of these features, however, previous work has only found minor benefits from using typological information in actual NLP modelling. In recent work (to appear at EACL 2021 ), we hypothesised that these minor gains might stem from that a model trained in a cross-lingual setting picks up on typological cues from the input data, thus overshadowing the utility of explicitly using such features. We verify this hypothesis by blinding a model to typological information, and investigate how cross-lingual sharing and performance is impacted. While this sheds some light on the matter, I want to further explore the question of the usefulness of typological feature prediction in general, and the use of such features in NLP .

This talk is part of the NLIP Seminar Series series.

Tell a friend about this talk:

This talk is included in these lists:

Note that ex-directory lists are not shown.

 

© 2006-2024 Talks.cam, University of Cambridge. Contact Us | Help and Documentation | Privacy and Publicity