GENALTI GIANMARCO | Cycle: XXXVIII |
Advisor: GATTI NICOLA
Tutor: CERI STEFANO
Major Research topic:
Stochastic multi-armed bandits with structure
Abstract:
This work aims to explore several research directions in stochastic sequential decision-making. The stochastic multi-armed bandit is a theoretical framework in which an agent must repeatedly decide among a predefined set of possible actions while receiving sequential feedback. The main challenge is balancing exploiting the most promising actions with exploring the whole set. An underlying mathematical model links different arms' rewards in bandits with structure. In real-world applications and theory, correctly exploiting this structure can remarkably boost an agent's performance. The focus of this work will be on bandits with temporal structure (i.e., time-dependent dynamics in the reward functions, such as the autoregressive bandits framework) and bandits with side-information feedback (i.e., settings in which the feedback is halfway between the full-information feedback and the bandit feedback, such as graph feedback or Lipschitz reward functions). The goal is to explore these settings and provide novel algorithms, together with theoretical results aimed at bridging the gap between different types of structures, presenting a unifying framework to analyze the theoretical guarantees of the provided approaches.
Cookies
We serve cookies. If you think that's ok, just click "Accept all". You can also choose what kind of cookies you want by clicking "Settings".
Read our cookie policy
Cookies
Choose what kind of cookies to accept. Your choice will be saved for one year.
Read our cookie policy
-
Necessary
These cookies are not optional. They are needed for the website to function. -
Statistics
In order for us to improve the website's functionality and structure, based on how the website is used. -
Experience
In order for our website to perform as well as possible during your visit. If you refuse these cookies, some functionality will disappear from the website. -
Marketing
By sharing your interests and behavior as you visit our site, you increase the chance of seeing personalized content and offers.