COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > CUED Control Group Seminars > Non-Stationary Representation Learning in Sequential Linear Bandits
Non-Stationary Representation Learning in Sequential Linear BanditsAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Xiaodong Cheng. Humans are naturally endowed with the ability to learn and transfer experience to later unseen tasks. One of the key mechanisms enabling such versatility is the abstraction of past experience into a ‘basis set’ of simpler representations that can be used to construct new strategies much more efficiently in future complex environments. What can we learn from humans when we design decision-making strategies? In this talk, we will talk about representation learning for multi-task decision-making in nonstationary environments. We consider the framework of sequential linear bandits, where the agent performs a series of tasks drawn from different environments. The embeddings of tasks in each environment share a low-dimensional feature extractor called representation, and representations are different across environments. We propose an online algorithm that facilitates efficient decision-making by learning and transferring non-stationary representations in an adaptive fashion. This talk is part of the CUED Control Group Seminars series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsTackling Obesity with Big Data: methods & models - One Day Seminar The obesity epidemic: Discussing the global health crisis dh539Other talksToward deciding the AE-theory of the Sigma02-enumeration degrees Classification problem for effective structures Statistics Clinic Summer 2022 I Computational Inverse Design of Deployable Structures Statistics Clinic Summer 2022 II Session 2 of ECR Showcase |