Several structured thresholding bandit problems

Video
Audio

Loading Video...

Duration: 0:57:14 | Added: 23 Jun 2021

Loading Video...

Duration: 0:57:14 | Added: 23 Jun 2021

OxCSML Seminar - Friday 28th May 2021, presented by Alexandra Carpentier (University of Magdeburg).

In this talk we will discuss the thresholding bandit problem, i.e. a sequential learning setting where the learner samples sequentially K unknown distributions for T times, and aims at outputting at the end the set of distributions whose means \mu_k are above a threshold \tau. We will study this problem under four structural assumptions, i.e. shape constraints: that the sequence of means is monotone, unimodal, concave, or unstructured (vanilla case). We will provide in each case minimax results on the performance of any strategies, as well as matching algorithms. This will highlight the fact that even more than in batch learning, structural assumptions have a huge impact in sequential learning.

Series:

Department of Statistics

People:

Alexandra Carpentier

Oxford Unit:

Department of Statistics

Keywords:

More in this Series...

Department of Statistics

The Department of Statistics at Oxford is a world leader in research including computational statistics and statistical methodology, applied probability, bioinformatics and mathematical genetics. In the 2014 Research Excellence Framework (REF), Oxford's Mathematical Sciences submission was ranked overall best in the UK.
This is an exciting time for the Department. We have now moved into our new home on St Giles and we are currently settling in.

Search Google Appliance

Several structured thresholding bandit problems

More in this Series...

Subscribe

Download Media