Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

10 Slides on Human Feedback

Add to your list(s) Download to your calendar using vCal

Max Bartolo (Cohere)
Friday 08 November 2024, 12:00-13:00
Zoom link: https://cam-ac-uk.zoom.us/j/4751389294?pwd=Z2ZOSDk0eG1wZldVWG1GVVhrTzFIZz09.

If you have a question about this talk, please contact Suchir Salhan.

In this talk, Max Bartolo will share a brief overview of the critical role human feedback plays in enhancing Large Language Model (LLM) performance and aligning model behaviours to human expectations. We will delve into key aspects of human feedback, examining some of its requirements, benefits, and challenges. We will explore questions along the lines of how, where, and who human feedback collection does or should concern. Finally, we will dig deeper into what optimising for human feedback signals means, and raise important questions about what we can improve going forward.

Speaker Biography

Max Bartolo is a researcher at Cohere leading the post-training team (Command), and working group co-chair for Dynabench at MLCommons. He completed a PhD, under the supervision of Pontus Stenetorp and Sebastian Riedel, with the UCL NLP group focused on the adversarial robustness of Language Models with humans and models in the loop.

This talk is part of the NLIP Seminar Series series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

10 Slides on Human Feedback

This talk is included in these lists:

Other lists

Other talks