| COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. | ![]() |
University of Cambridge > Talks.cam > Cambridge Psychometrics Centre Seminars > Feedback Forensics: Measuring AI Personality By Comparing Observed Behaviour
Feedback Forensics: Measuring AI Personality By Comparing Observed BehaviourAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Luning Sun. If you are interested in attending the talk online, please email the organiser and ask for a Teams invite. Many personality tests ask participants hypothetical questions predicting their own behaviours. Yet, as with humans, self-predicted AI behaviour does not always match observed behaviour. In this talk, I will introduce Feedback Forensics: a toolkit to measure AI traits related to personality directly based on observed behaviour data. Comparing model behaviours to the same input relative to each other, our toolkit can measure a diverse set of traits related to the underlying personality, manner, and style of AI responses. I will share results describing traits exhibited by popular AI models as well as detecting the traits encouraged by human feedback. The talk will feature a live demo of our personality visualisation tool and attendees are invited to follow along via our online platform https://feedbackforensics.com/ (laptops are encouraged). Bio: Arduin is currently a PhD student in the Department of Computer Science in Cambridge working on AI model evaluation. His work focuses on understanding what desirable and undesirable model behaviours are reinforced by human and AI feedback. Prior to joining his current PhD programme, Arduin completed an MPhil in Machine Learning and Machine Intelligence in Cambridge’s Engineering Department. Recently, Arduin also worked on model evaluation within Apple’s Foundation Models team as an intern. This talk is part of the Cambridge Psychometrics Centre Seminars series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsInterdisciplinary Design: Debates and Seminars Data Visualization Series 2016 Semantics Lunch (Computer Laboratory)Other talksDiscover Climate Repair: is this for real? Film Screening: Plan Z - From Lab Coats to Handcuffs BSU Seminar: "Upgrading survival models with CARE" Group Presentation 4 Tutorial: Generalization in Reinforcement Learning: From Foundations to New Frontiers MicroRNA expression in histiocytic sarcomas of flat-coated retrievers |