Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

BSU Seminar: "Efficient Sequential Experimentation: Bridging Model-Based Reinforcement Learning and Bayesian Optimal Experimental Design"

Add to your list(s) Download to your calendar using vCal

Alberto Caron, The Alan Turing Institute
Tuesday 04 February 2025, 14:00-15:00
Large Seminar Room, East Forvie Building, Forvie Site Robinson Way Cambridge CB2 0SR..

If you have a question about this talk, please contact Alison Quenault.

This will be a free hybrid seminar. To register to attend virtually, please click here: https://cam-ac-uk.zoom.us/webinar/register/WN_GS1_Dn3SQXydSuW9p3oNnQ

Sequential Experimental Design (SED) aims to adaptively collect data that most efficiently reduce uncertainty in a statistical model of interest in a multi-stage context, where each stage’s choice depends on the knowledge gained from previous ones. Although this is a classic task in many applied disciplines, recent advances in Reinforcement Learning (RL) offer interesting new perspectives for approaching this challenge. In this talk, I will first discuss how the Bayes-Adaptive Markov Decision Processes (BAMDPs) framework provides a principled way to balance exploration and exploitation in RL when key parameters are uncertain, and how it naturally lends itself to model multi-stage SED through a parametrized “sampling policy” that selects actions/experiments based on the current context. Building on ideas from Model-Based Reinforcement Learning (MBRL), I will then present a method for picking design choices in SED by “looking ahead” and maximizing a Cumulative Expected Information Gain (C-EIG) objective over H-steps, which can be approximated in the context of MBRL -based planning via a measure of disagreement among an ensemble of probabilistic dynamics models. By explicitly modelling how each design choice affects downstream data-collection, this approach avoids myopic behaviour and enables the selection of actions/experiments that are most informative in expectation not just immediately, but across an entire H-steps sequence of trials.

This talk is part of the MRC Biostatistics Unit Seminars series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

BSU Seminar: "Efficient Sequential Experimentation: Bridging Model-Based Reinforcement Learning and Bayesian Optimal Experimental Design"

This talk is included in these lists:

Other lists

Other talks