Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Unsupervised Question Answering

Add to your list(s) Download to your calendar using vCal

Patrick Lewis, UCL
Thursday 24 October 2019, 11:00-12:00
Board room, Faculty of English, 9 West Rd (Sidgwick Site).

If you have a question about this talk, please contact Edoardo Maria Ponti.

Obtaining training data for Question Answering (QA) is time-consuming and costly, and existing QA datasets are only available for limited domains and languages. In this talk, we’ll explore to what extent high quality training data is actually required for Extractive QA, and investigate the possibility of unsupervised Extractive QA. We approach this problem by first learning to generate context, question and answer triples in an unsupervised manner, which we then use to synthesize Extractive QA training data automatically. We find that modern QA models can learn to answer human questions surprisingly well using only synthetic training data. We demonstrate that, without using the SQuAD training data at all, our approach achieves 56.4 F1 on SQuAD v1 (64.5 F1 when the answer is a Named entity mention), outperforming early supervised models. We will also explore methods to build cross-lingual Question Answering models which do not require cross-lingual supervision (zero-shot language transfer), as well as the challenge of how to fairly evaluate their performance in many target languages.

This talk is part of the Language Technology Lab Seminars series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

Unsupervised Question Answering

This talk is included in these lists:

Other lists

Other talks