![]() |
COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. | ![]() |
University of Cambridge > Talks.cam > Language Technology Lab Seminars > Behavioral machine learning
![]() Behavioral machine learningAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact shun shao. Abstract: While benchmark datasets have been important for making progress in supervised learning, they cannot capture the diversity of uses afforded by large language models and other general-purpose technologies. How can we trust metrics that don’t capture our experience using these models? This talk presents alternative approaches to evaluation that incorporate behavioral models of how people use them. In the first part of the talk, we study a human generalization function that arises when people make inferences about what an LLM can do based on their interactions with it. This motivates a new form of alignment: the best LLM is the one that allows people to make the most accurate inferences about where it will or won’t succeed. In the second part of the talk, we describe a new common task for evaluating how well people can steer generative models toward desired outputs. We find that humans struggle to steer text-to-image models, and we propose a new method, reinforcement learning for human steering (RLHS), that empirically improves steerability. Bio: Keyon Vafa is a postdoctoral fellow at the Harvard Data Science Initiative. His research focuses on developing ML methods to address economic questions along with using insights from the behavioral sciences to improve ML methods. Keyon completed his PhD in computer science from Columbia University, where he was an NSF GRFP Fellow and the recipient of the Morton B. Friedman Memorial Prize for excellence in engineering. He was a co-organizer of the NeurIPS 2024 Workshop on Behavioral Machine Learning and is a member of the early career board of the the Harvard Data Science Review. This talk is part of the Language Technology Lab Seminars series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsMycenaean Seminar Lady Margaret Beaufort Commemoration Event St Edmund's College Political Forum SECPFOther talksModelling sea ice dynamics using brittle dynamics: impact in pack ice and marginal ice zones : Scratching Beneath the Surface of Pan Communication: Intentions, Representations and Joint Attention Towards Global Maps of Anthropogenic Threats to Biodiversity and Their Contributions to Species Extinctions RSC Prizewinner Seminar - "Linking structure and properties in complex quantum materials" Multiphase Transport Phenomena and Energy Process Intensification Rothschild Public Lecture: Title TBC |