University of Cambridge > Talks.cam > MRC Biostatistics Unit Seminars > BSU Seminar: "Efficient Sequential Experimentation: Bridging Model-Based Reinforcement Learning and Bayesian Optimal Experimental Design"

BSU Seminar: "Efficient Sequential Experimentation: Bridging Model-Based Reinforcement Learning and Bayesian Optimal Experimental Design"

Add to your list(s) Download to your calendar using vCal

If you have a question about this talk, please contact Alison Quenault.

This will be a free hybrid seminar. To register to attend virtually, please click here: https://cam-ac-uk.zoom.us/webinar/register/WN_GS1_Dn3SQXydSuW9p3oNnQ

Sequential Experimental Design (SED) aims to adaptively collect data that most efficiently reduce uncertainty in a statistical model of interest in a multi-stage context, where each stage’s choice depends on the knowledge gained from previous ones. Although this is a classic task in many applied disciplines, recent advances in Reinforcement Learning (RL) offer interesting new perspectives for approaching this challenge. In this talk, I will first discuss how the Bayes-Adaptive Markov Decision Processes (BAMDPs) framework provides a principled way to balance exploration and exploitation in RL when key parameters are uncertain, and how it naturally lends itself to model multi-stage SED through a parametrized “sampling policy” that selects actions/experiments based on the current context. Building on ideas from Model-Based Reinforcement Learning (MBRL), I will then present a method for picking design choices in SED by “looking ahead” and maximizing a Cumulative Expected Information Gain (C-EIG) objective over H-steps, which can be approximated in the context of MBRL -based planning via a measure of disagreement among an ensemble of probabilistic dynamics models. By explicitly modelling how each design choice affects downstream data-collection, this approach avoids myopic behaviour and enables the selection of actions/experiments that are most informative in expectation not just immediately, but across an entire H-steps sequence of trials.

This talk is part of the MRC Biostatistics Unit Seminars series.

Tell a friend about this talk:

This talk is included in these lists:

Note that ex-directory lists are not shown.

 

© 2006-2025 Talks.cam, University of Cambridge. Contact Us | Help and Documentation | Privacy and Publicity