Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Document Summarisation: Modelling, Datasets and Verification of Content

Add to your list(s) Download to your calendar using vCal

Shay Cohen, University of Edinburgh
Thursday 11 November 2021, 11:00-12:00
https://cam-ac-uk.zoom.us/j/97599459216?pwd=QTRsOWZCOXRTREVnbTJBdXVpOXFvdz09.

If you have a question about this talk, please contact Marinela Parovic.

Within Natural Language Processing, document summarisation is one of the central problems. It has both short-term societal implications and long-term implications in terms of the success of AI. I will describe advances made in this area with respect to three different aspects: methodology and modelling, dataset development and enforcing factuality of summaries. In relation to modelling, I will show how reinforcement learning can be used to directly maximise the metric by which the summaries are being evaluated. With regards to dataset development, I will describe a dataset that we released for summarisation, XSum, in which a single sentence is used to describe the content of a whole article. The dataset has become a standard benchmark for summarisation. Finally, in relation to factuality, I will show how one can improve the quantitative factuality of summaries by re-ranking them in a beam based on a “verification” model.

This talk is part of the Language Technology Lab Seminars series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

Document Summarisation: Modelling, Datasets and Verification of Content

This talk is included in these lists:

Other lists

Other talks