Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Partially Observable Markov Decision Processes (POMDPs)

Add to your list(s) Download to your calendar using vCal

Rowan McAllister (University of Cambridge), Alex Navarro
Thursday 02 June 2016, 14:30-16:00
Engineering Department, CBL Room 438.

If you have a question about this talk, please contact Yingzhen Li.

Partially observable Markov decision processes (POMDPs) are a general framework for sequential decision-making tasks when have uncertainty about the state of the world. Many important real-world problems can be characterised as POMD Ps including financial trading, spoken dialogue systems and autonomous car navigation. In recent years reinforcement learning has also been characterised as a POMDP opening up an entire literature of POMDP solutions to solving the infamous exploration-exploitation dilemma. In this talk we begin with simpler models including the Markov process and Hidden Markov models before moving onto their generalisation of POMD Ps. We show an important property of POMDP - that they are piecewise-linear and convex – a property which many POMDP algorithms exploit. Alternatively, tree-based search methods can be suitable online-yet-approximate POMDP solutions. Overall we aim to convey some of the main ideas discovered over the past 50 years in POMDP research and to bring the audience up to the present day POMDP literature.

This talk is part of the Machine Learning Reading Group @ CUED series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

Partially Observable Markov Decision Processes (POMDPs)

This talk is included in these lists:

Other lists

Other talks