COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > Signal Processing and Communications Lab Seminars > Efficient stochastic optimal control for navigation and motor planning
Efficient stochastic optimal control for navigation and motor planningAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Taylan Cemgil. This talk discusses a class of non-linear stochastic control problems that can be efficiently solved using a path integral. In this control formalism, the central concept of cost-to-go or value function becomes a free energy and methods and concepts from statistical physics can be readily applied, such as Monte Carlo sampling or the Laplace approximation. Qualitatively different optimal control strategies for different noise levels can be understood as a result of spontaneous symmetry breaking. When applied to a receding horizon problem in a stationary environment, the solution resembles the one obtained by traditional reinforcement learning with discounted reward. An advantage of the path integral control method over RL is that the control can be computed for the current state, without considering all other states and 2) that it can be easily generalized to time-dependent tasks. It is therefore a suitable approach for time dependent control. We further discuss exploration and an how agents can approximately compute their coordination using belief propagation. This talk is part of the Signal Processing and Communications Lab Seminars series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsCambridge Analysts' Knowledge Exchange Cancer Genetic Epidemiology Seminar Series CU Palestine SocietyOther talksThe role of the oculomotor system in visual attention and visual short-term memory Uncertainty Quantification of geochemical and mechanical compaction in layered sedimentary basins Environmental shocks and demographic consequences in England: 1280-1325 and 1580-1640 compared 5 selfish reasons to work reproducibly Solving the Reproducibility Crisis Vision Journal Club: feedforward vs back in figure ground segmentation Networks, resilience and complexity Graph Legendrians and SL2 local systems Lunchtime Talk: Helen's Bedroom Beyond truth-as-correspondence: realism for realistic people |