COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > MRC Biostatistics Unit Seminars > BSU Seminar: "Efficient Sequential Experimentation: Bridging Model-Based Reinforcement Learning and Bayesian Optimal Experimental Design"
BSU Seminar: "Efficient Sequential Experimentation: Bridging Model-Based Reinforcement Learning and Bayesian Optimal Experimental Design"Add to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Alison Quenault. This will be a free hybrid seminar. To register to attend virtually, please click here: https://cam-ac-uk.zoom.us/webinar/register/WN_GS1_Dn3SQXydSuW9p3oNnQ Sequential Experimental Design (SED) aims to adaptively collect data that most efficiently reduce uncertainty in a statistical model of interest in a multi-stage context, where each stage’s choice depends on the knowledge gained from previous ones. Although this is a classic task in many applied disciplines, recent advances in Reinforcement Learning (RL) offer interesting new perspectives for approaching this challenge. In this talk, I will first discuss how the Bayes-Adaptive Markov Decision Processes (BAMDPs) framework provides a principled way to balance exploration and exploitation in RL when key parameters are uncertain, and how it naturally lends itself to model multi-stage SED through a parametrized “sampling policy” that selects actions/experiments based on the current context. Building on ideas from Model-Based Reinforcement Learning (MBRL), I will then present a method for picking design choices in SED by “looking ahead” and maximizing a Cumulative Expected Information Gain (C-EIG) objective over H-steps, which can be approximated in the context of MBRL -based planning via a measure of disagreement among an ensemble of probabilistic dynamics models. By explicitly modelling how each design choice affects downstream data-collection, this approach avoids myopic behaviour and enables the selection of actions/experiments that are most informative in expectation not just immediately, but across an entire H-steps sequence of trials. This talk is part of the MRC Biostatistics Unit Seminars series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsGraduate Development Lecture Series C.P. Snow Lectures Global Food Security at CambridgeOther talksTBD Where is Amazonia on display (today)? A global approach to understanding Amazonian collections Talk by Bushra Mohamed, MSOMA Pianos, guitars and double decays Synchronization in Navier-Stokes turbulence and it's role in data-driven modeling Mass Spectrometry: Structural Proteomics in the LMB |