Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Value Propagation: A Graphical Model for Bayesian Reinforcement Learning

Add to your list(s) Download to your calendar using vCal

Philipp Hennig (University of Cambridge)
Monday 17 November 2008, 11:00-12:00
TCM Seminar Room, Cavendish Laboratory, Department of Physics.

If you have a question about this talk, please contact Carl Scheffler.

I will present Bayesian Reinforcement Learning methods based on the model-free approach used in the Temporal Difference family of algorithms. Our implementations allow for the incorporation of prior knowledge in a principled way and automatically adapt their learning rate and backup depth. In policy iteration settings, they can guide exploration and converge on the optimal policy in an automated fashion, because they track the uncertainty of evaluations.

This talk is part of the Machine Learning Journal Club series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

Value Propagation: A Graphical Model for Bayesian Reinforcement Learning

This talk is included in these lists:

Other lists

Other talks