University of Cambridge > Talks.cam > Engineering Safe AI > Embedded Agency

Embedded Agency

Add to your list(s) Download to your calendar using vCal

If you have a question about this talk, please contact AdriĆ  Garriga Alonso.

Most current theories of agency deal only with dualistic or Cartesian agents. That is, the agent that makes the decisions is itself outside of the world that it takes decisions in, and is not affected by it.

But technically this is not the case, as the algorithm that makes decisions is implemented using something (a brain, a computer) in the world, and modifications to that something can change the algorithm.

According to some views, understanding the theory behind embedded, non-Cartesian agency, is key to solving some important problems in AI safety (provably aligned self-modification, wireheading). Others think it’s not so important. We shall learn about current attempts to build embedded agency theories and discuss how important it is to continue work in that area.

Reading list:

- Embedded Agency sequence from MIRI : https://www.lesswrong.com/s/Rm6oQRJJmhGCcLvxh/p/i3BTagvt3HbPMx6PN

This talk is part of the Engineering Safe AI series.

Tell a friend about this talk:

This talk is included in these lists:

Note that ex-directory lists are not shown.

 

© 2006-2024 Talks.cam, University of Cambridge. Contact Us | Help and Documentation | Privacy and Publicity