On the Complexity of Bandit and Derivative-Free Stochastic Convex Optimization
Toward the understanding of partial-monitoring games
Bounding the Gaussian Process Information Gain: Applications to PAC-Bayes and GP Bandit Optimization
Piecewise-Stationary Bandit Problems with Side Information
Piecewise-Stationary Bandit Problems with Side Observations
Mathematics of Machine Learning
Advanced Topics in RL
The KL-UCB Algorithm for Bounded Stochastic Bandits and Beyond
Forced-Exploration Based Algortihms for Playing in Stochastic Linear Bandits
Reinforcement Learning