COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
10 Slides on Human FeedbackAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Suchir Salhan. In this talk, Max Bartolo will share a brief overview of the critical role human feedback plays in enhancing Large Language Model (LLM) performance and aligning model behaviours to human expectations. We will delve into key aspects of human feedback, examining some of its requirements, benefits, and challenges. We will explore questions along the lines of how, where, and who human feedback collection does or should concern. Finally, we will dig deeper into what optimising for human feedback signals means, and raise important questions about what we can improve going forward. Speaker Biography Max Bartolo is a researcher at Cohere leading the post-training team (Command), and working group co-chair for Dynabench at MLCommons. He completed a PhD, under the supervision of Pontus Stenetorp and Sebastian Riedel, with the UCL NLP group focused on the adversarial robustness of Language Models with humans and models in the loop. This talk is part of the NLIP Seminar Series series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsCambridge Central Asia Forum Talks related to atmosphere and ocean dynamics and climate science Cambridge Network Healthcare SIGOther talksEnlightenment Scepticism and the Conditions for Political Stability Analysis of Longitudinal Data Visualizing strongly interacting quantum phases of matter Cambridge RNA Club - IN PERSON St Catharine's Political Economy Seminar - Professor Neil Lee - 'Innovation for all' Are there Underlying Principles of Protein Evolution? Reconsidering Law and Purpose in Biology |