COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
University of Cambridge > Talks.cam > MRC Biostatistics Unit Seminars > Virtual Seminar: 'Statistical Inference with M-Estimators on Bandit Data’
Virtual Seminar: 'Statistical Inference with M-Estimators on Bandit Data’Add to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact Alison Quenault. If you would like to join this virtual seminar, please email alison.quenault@mrc-bsu.cam.ac.uk for more information. Bandit algorithms are increasingly used in real world sequential decision making problems, from online advertising to mobile health. As a result, there are more datasets collected using bandit algorithms and with that an increased desire to be able to use these datasets to answer scientific questions like: Did one type of ad increase the click-through rate more or lead to more purchases? In which contexts is a mobile health intervention effective? However, it has been shown that classical statistical estimators, like the ordinary least squares estimator, fail to provide reliable confidence intervals when used with bandit data. Recently methods have been developed to conduct statistical inference using simple models fit to data collected with multi-arm bandits. However there is a lack of general methods for conducting statistical inference using more complex models. In this work, we develop theory justifying the use of M-estimation (Van der Vaart, 2000), traditionally used with i.i.d data, to provide inferential methods for a large class of estimators—including least squares and maximum likelihood estimators—but now with data collected with (contextual) bandit algorithms. To do this we generalize the use of adaptive weights pioneered by Hadad et al. (2019) and Deshpande et al. (2018). Specifically, in settings in which the data is collected via a (contextual) bandit algorithm, we prove that adaptively weighted M-estimators are uniformly asymptotically normal and demonstrate empirically that we can use their asymptotic distribution to construct reliable confidence regions for a variety of inferential targets. This talk is part of the MRC Biostatistics Unit Seminars series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsMicrosoft Research Computational Science Seminars Graduate Development Lecture Series Type the title of a new list hereOther talksTitle: Unlocking the link between the artery and outcomes in patients with lower limb peripheral arterial disease Sir William Petty’s Treatise of Taxes and Contributions: Politics and Political Economy in the Interregnum and Restoration State The Speed of Pathogen Adaptation to Vaccines Welcome back meeting - plant display and social evening Capital and labour: Theoretical foundations of the economics of slavery Human Immunity – one cell at a time |