University of Cambridge > Talks.cam > Language Technology Lab Seminars > Document Summarisation: Modelling, Datasets and Verification of Content

Log in

University Account

External (via Google)

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Document Summarisation: Modelling, Datasets and Verification of Content

Download to your calendar using vCal

Shay Cohen, University of Edinburgh
Thursday 11 November 2021, 11:00-12:00
https://cam-ac-uk.zoom.us/j/97599459216?pwd=QTRsOWZCOXRTREVnbTJBdXVpOXFvdz09.

If you have a question about this talk, please contact Marinela Parovic .

Within Natural Language Processing, document summarisation is one of the central problems. It has both short-term societal implications and long-term implications in terms of the success of AI. I will describe advances made in this area with respect to three different aspects: methodology and modelling, dataset development and enforcing factuality of summaries. In relation to modelling, I will show how reinforcement learning can be used to directly maximise the metric by which the summaries are being evaluated. With regards to dataset development, I will describe a dataset that we released for summarisation, XSum, in which a single sentence is used to describe the content of a whole article. The dataset has become a standard benchmark for summarisation. Finally, in relation to factuality, I will show how one can improve the quantitative factuality of summaries by re-ranking them in a beam based on a “verification” model.

This talk is part of the Language Technology Lab Seminars series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Document Summarisation: Modelling, Datasets and Verification of Content

📅 Download to calendar (vCal)

👤 Speaker: Shay Cohen, University of Edinburgh
📅 Date & Time: Thursday 11 November 2021, 11:00 - 12:00
📍 Venue: https://cam-ac-uk.zoom.us/j/97599459216?pwd=QTRsOWZCOXRTREVnbTJBdXVpOXFvdz09

Questions? Contact Marinela Parovic

Abstract

Series This talk is part of the Language Technology Lab Seminars series.

Included in Lists

Note: Ex-directory lists are not shown.

Log in

🔐 Log In

Information on

ℹ️ Information

Document Summarisation: Modelling, Datasets and Verification of Content

This talk is included in these lists:

Document Summarisation: Modelling, Datasets and Verification of Content

Abstract

Included in Lists

Log in

🔐 Log In

Information on

ℹ️ Information

Document Summarisation: Modelling, Datasets and Verification of Content

This talk is included in these lists:

Other lists

Other talks

Document Summarisation: Modelling, Datasets and Verification of Content

Abstract

Included in Lists