Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

CCIMI Seminar: Kernel-based Methods for Bandit Convex Optimization

Add to your list(s) Download to your calendar using vCal

Sébastien Bubeck (Microsoft Research Redmond)
Wednesday 04 October 2017, 16:00-17:00
MR12.

If you have a question about this talk, please contact Quentin Berthet.

A lot of progress has been made in recent years on extending classical multi-armed bandit strategies to very large set of actions. A particularly challenging setting is the one where the action set is continuous and the underlying cost function is convex, this is the so-called bandit convex optimization (BCO) problem. I will tell the story of BCO and explain some of the new ideas that we recently developed to solve it. I will focus on three new ideas from our recent work http://arxiv.org/abs/1607.03084 with Yin Tat Lee and Ronen Eldan: (i) a new connection between kernel methods and the popular multiplicative weights strategy; (ii) a new connection between kernel methods and one of Erdos’ favorite mathematical object, the Bernoulli convolution, and (iii) a new adaptive (and increasing!) learning rate for multiplicative weights. These ideas could be of broader interest in learning/algorithm’s design.

This talk is part of the CCIMI Seminar Series

This talk is part of the Statistics series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

CCIMI Seminar: Kernel-based Methods for Bandit Convex Optimization

This talk is included in these lists:

Other lists

Other talks