COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > Language Technology Lab Seminars > Zero-shot learning and out-of-distribution generalization: two sides of the same coin
Zero-shot learning and out-of-distribution generalization: two sides of the same coinAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Haim Dubossarsky. Recent advances in large pre-trained language models have shifted the NLP community’s attention to new challenges: (a) training models with zero, or very few, examples, and (b) generalizing to out-of-distribution examples. In this talk, I will argue that the two are intimately related, and describe ongoing (read, new!) work in those directions. First, I will describe a new pre-training scheme for open-domain question answering that is based on the notion of “recurring spans” across different paragraphs. We show this training scheme leads to a zero-shot retriever that is competitive with DPR (which trains on thousands of examples), and is more robust w.r.t the test distribution. Second, I will focus on compositional generalization, a particular type of out-of-distribution generalization setup where models need to generalize to structures that are unobserved at training time. I will show that the view that seq2seq models categorically do not generalize to new compositions is false, and present a more nuanced analysis, which elucidates what are the conditions under which models struggle to compositionally generalize. This talk is part of the Language Technology Lab Seminars series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsTAPAS Lunchtime Seminars dh539 Type the title of a new list hereOther talksA giant molecular halo around a z ∼ 2 quasar Network science and network medicine: New strategies for understanding and treating the biological basis of mental ill-health Statistics Clinic Lent 2022 IV The Anne McLaren Lecture: Embryonic and adult neural stem cells- what underlies their difference Traditional Medicine Goes Global: Pan-African Precedents, Cultural Decolonization, and Cold War Rights/Properties’ Welcome to day two |