COOKIES: By using this website you agree that we can place Google Analytics Cookies on your device for performance monitoring. |
Bayesian Best-Arm IdentificationAdd to your list(s) Download to your calendar using vCal
If you have a question about this talk, please contact . Zoom passcode: 934913 In multi-armed bandits, a learner repeatedly chooses an arm to play, and receives a reward from the associated unknown probability distribution. We study the task of best-arm identification (BAI), where the learner is not only asked to sample an arm at each stage, but is also asked to output a recommendation (i.e., a guess for the arm with the largest mean reward) after a certain period. Unlike in another well-studied bandit setting, the learner is not interested in maximising the sum of rewards gathered during the exploration (or minimising regret), but only cares about the quality of her recommendation. We investigate a Bayesian-flavoured sampling rule called Top-Two Thompson sampling (TTTS). In particular, we justify its use for fixed-confidence BAI . We further propose a variant of TTTS called Top-Two Transportation Cost (T3C), which disposes of the computational burden of TTTS . As our main contribution, we provide the first sample complexity analysis of TTTS and T3C when coupled with a very natural Bayesian stopping rule, for bandits with Gaussian rewards, solving one of the open questions raised by Russo (2016). We also provide new posterior convergence results for TTTS under two models that are commonly used in practice: bandits with Gaussian and Bernoulli rewards and conjugate priors. This talk is part of the ML@CL Seminar Series series. This talk is included in these lists:
Note that ex-directory lists are not shown. |
Other listsData Insights Cambridge Tools and Techniques for Speech Research Type the title of a new list herecirculOther talksStacking and twisting 2D materials for quantum nano-optoelectronics: Fundamentals and applications TMS Week 9 - Prof. Leonard Susskind of Stanford University Brake-actuated steering of heavy good vehicles ‘African Entrepreneurship: the Fetish of Personal Responsibility’ Title TBD |