Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

AI Safety Gridworlds: Is my agent 'safe'?

Add to your list(s) Download to your calendar using vCal

Jessica Yung (University of Cambridge)
Wednesday 28 February 2018, 17:00-18:30
Cambridge University Engineering Department, CBL Seminar room BE4-38. For directions see http://learning.eng.cam.ac.uk/Public/Directions.

If you have a question about this talk, please contact Adrià Garriga Alonso.

AI Safety Gridworlds are a suite of 2D reinforcement learning environments that test for desirable safety properties of an agent, such as correct objective specification and robustness.

We will first discuss the paper’s approach to formalising safety properties in environments. Next, we will demo some of the environments and discuss whether they are reasonable tests of desirable properties. Finally, we will discuss why certain algorithms (among variations of RAINBOW and A2C ) seem to have better safety performance than others.

This week’s talk is a good opportunity to get a big-picture view of AI safety from a practical perspective. No prior knowledge of AI safety is needed.

You can try the environments for yourself by cloning this git repo: https://github.com/deepmind/ai-safety-gridworlds/tree/master/ai_safety_gridworlds

Paper: AI Safety Gridworlds (Leike et. al., 2017) https://arxiv.org/abs/1711.09883

This talk is part of the Engineering Safe AI series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

AI Safety Gridworlds: Is my agent 'safe'?

This talk is included in these lists:

Other lists

Other talks