COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
Engineering Safe AI
Add to your list(s)
Send you e-mail reminders
Further detail
Presentations and discussions about possible solutions to the value alignment problem. If you have a question about this list, please contact: Adrià Garriga Alonso. If you have a question about a specific talk, click on that talk to find its organiser. 0 upcoming talks and 38 talks in the archive. Engineering Safe AI: Robert MilesRobert Miles. Pavillion Room, Hughes Hall, University of Cambridge, Cambridge, CB1 2EW. Wednesday 29 January 2020, 19:00-20:30 Can Machines Read our Minds?Starting time 30min later than usual Brier Rigby Dames (University of Cambridge). Engineering Department, CBL Seminar room BE4-38. Wednesday 12 June 2019, 17:30-19:00 How useful is quantilization for mitigating specification-gaming?Speaker to be confirmed. Engineering Department, CBL Seminar room BE4-38. Wednesday 22 May 2019, 17:00-18:30 Misleading meta-objectives and hidden incentives for distributional shiftPaolo Bova (University of Cambridge). Engineering Department, CBL Seminar room BE4-38. Wednesday 08 May 2019, 17:00-19:00 Causal Reasoning from Meta-reinforcement LearningJakub Perlin (University of Cambridge). Engineering Department, CBL Seminar room BE4-38. Wednesday 13 March 2019, 17:00-19:00 Inverse Game TheoryGabija Maršalka. Engineering Department, CBL Seminar room BE4-38. Wednesday 06 March 2019, 17:00-19:00 Goals vs Utility FunctionsAdrià Garriga Alonso (University of Cambridge). Engineering Department, CBL Seminar room BE4-38. Wednesday 27 February 2019, 17:00-19:00 Who do we want to control human-level AI?Jade Leung (Center for the Governance of AI, University of Oxford). Boys Smith Room, Fisher Building, St John's College. Friday 22 February 2019, 19:30-21:00 Bayesian Theory of Mind: Modeling Joint Belief-Desire AttributionEdward Young (University of Cambridge). Engineering Department, CBL Seminar room BE4-38. Wednesday 20 February 2019, 17:00-19:00 Ambitious Value LearningAdrià Garriga Alonso (University of Cambridge). Cambridge University Engineering Department, CBL Seminar room BE4-38. Wednesday 13 February 2019, 17:00-19:00 Machine Theory of MindPaolo Bova (University of Cambridge). Cambridge University Engineering Department, CBL Seminar room BE4-38. Wednesday 06 February 2019, 17:00-19:00 Embedded AgencyAdrià Garriga Alonso (University of Cambridge). Cambridge University Engineering Department, CBL Seminar room BE4-38. Wednesday 30 January 2019, 17:00-19:00 Comprehensive AI ServicesAdrià Garriga Alonso (University of Cambridge). Cambridge University Engineering Department, CBL Seminar room BE4-38. Wednesday 23 January 2019, 17:00-19:00 Incomplete Contracting and AI AlignmentPaolo Bova (University of Cambridge). Cambridge University Engineering Department, CBL Seminar room BE4-38. Wednesday 28 November 2018, 17:00-19:00 The Algorithmic Foundations of Differential Privacy (Chapters 1 and 2)James Bell, University of Cambridge. Cambridge University Engineering Department, CBL Seminar room BE4-38. Wednesday 21 November 2018, 17:00-19:00 Dynamic Safe Interruptibility for Decentralized Multi-Agent Reinforcement LearningAdrià Garriga Alonso (University of Cambridge). Cambridge University Engineering Department, CBL Seminar room BE4-38. Wednesday 14 November 2018, 17:00-19:00 Measuring and avoiding side effects using relative reachabilityAdrià Garriga Alonso (University of Cambridge). Cambridge University Engineering Department, CBL Seminar room BE4-38. Wednesday 07 November 2018, 17:00-19:00 Interpretable Machine LearningTameem Adel (University of Cambridge). Wednesday 31 October 2018, 17:00-18:30 Scaling inverse reinforcement learning for human-compatible AIAdam Gleave, UC Berkeley. Tuesday 23 October 2018, 17:00-18:30 Motivation for this group, Goodhart's LawJames Bell, University of Cambridge. Wednesday 17 October 2018, 17:00-18:30 Approaches to avoiding negative side effectsAdrià Garriga Alonso (University of Cambridge). Wednesday 30 May 2018, 17:00-18:30 AI Safety Gridworlds: Is my agent 'safe'?Jessica Yung (University of Cambridge). Wednesday 28 February 2018, 17:00-18:30 Logical Induction: a computable approach to logical non-omniscienceAdrià Garriga Alonso (University of Cambridge). Wednesday 21 February 2018, 17:00-18:30 Decision Boundary Geometries and Robustness of Neural NetworksSven Wang (University of Cambridge). Wednesday 14 February 2018, 17:00-18:30 Decision Theory for AI safetyRichard Ngo (University of Cambridge). Wednesday 07 February 2018, 17:00-18:30 Safe Exploration in Reinforcement LearningFrances Ding (University of Cambridge). Wednesday 31 January 2018, 17:00-18:30 Amplification and dialogue as mechanisms for safe advanced AIBeth Barnes, Computer Lab, University of Cambridge. Wednesday 24 January 2018, 17:00-18:30 Last term summary + discussion of topic importanceAdrià Garriga Alonso (University of Cambridge). Wednesday 17 January 2018, 17:00-18:30 Counterargument to CIRL, and Safely Interruptible AgentsAdrià Garriga Alonso (University of Cambridge). Wednesday 06 December 2017, 17:00-18:30 Reinforcement learning with a corrupted reward functionTom McGrath, Imperial College London. Wednesday 29 November 2017, 17:00-18:30 Solomonoff Induction and a Definition of IntelligenceJames Bell, Richard Ngo (University of Cambridge). Wednesday 22 November 2017, 17:00-18:30 Deep Reinforcement Learning from Human PreferencesJessica Yung (University of Cambridge). Wednesday 15 November 2017, 17:00-18:30 An introduction to adversarial attacks and defencesYingzhen Li (University of Cambridge). Wednesday 08 November 2017, 17:00-18:30 'Off-Switch Games' and CorrigibilityRichard Ngo (University of Cambridge). Wednesday 01 November 2017, 17:00-18:30 Cooperative Inverse Reinforcement LearningRobert Pinsler (University of Cambridge). Wednesday 25 October 2017, 17:00-18:30 Engineering Safe AI seminar groupBeth Barnes, Computer Lab, University of Cambridge. Wednesday 18 October 2017, 17:00-18:30 Please see above for contact details for this list. |
Other listsAmnesty - China CRASSH Meeting the Challenge of Healthy Ageing in the 21st CenturyOther talksNew Insights in Immunopsychiatry (Provisional Title) Flow Cytometry Rethinking African Studies: The Wisdom of the Elders A passion for pottery: a photographer’s dream job Bears, Bulls and Boers: Market Making and Southern African Mining Finance, 1894-1899 CANCELLED: Alex Goodall: The US Marine Empire in the Caribbean and Central America, c.1870-1920 |