Dealing With Misspecification In ...
Document type :
Communication dans un congrès avec actes
Title :
Dealing With Misspecification In Fixed-Confidence Linear Top-m Identification
Author(s) :
Réda, Clémence [Auteur]
Maladies neurodéveloppementales et neurovasculaires [NeuroDiderot (UMR_S_1141 / U1141)]
Tirinzoni, Andrea [Auteur]
Scool [Scool]
Degenne, Rémy [Auteur]
Scool [Scool]
Maladies neurodéveloppementales et neurovasculaires [NeuroDiderot (UMR_S_1141 / U1141)]
Tirinzoni, Andrea [Auteur]
Scool [Scool]
Degenne, Rémy [Auteur]
Scool [Scool]
Conference title :
35th Conference on Neural Information Processing Systems
City :
Virtual
Country :
France
Start date of the conference :
2021
Publication date :
2021
English keyword(s) :
misspecification
linear bandits
fixed-confidence top-m identification
pure exploration
recommendation systems
multi-armed bandits
linear bandits
fixed-confidence top-m identification
pure exploration
recommendation systems
multi-armed bandits
HAL domain(s) :
Informatique [cs]/Intelligence artificielle [cs.AI]
Informatique [cs]/Apprentissage [cs.LG]
Mathématiques [math]/Statistiques [math.ST]
Informatique [cs]/Apprentissage [cs.LG]
Mathématiques [math]/Statistiques [math.ST]
English abstract : [en]
We study the problem of the identification of m arms with largest means under a fixed error rate δ (fixed-confidence Top-m identification), for misspecified linear bandit models. This problem is motivated by practical ...
Show more >We study the problem of the identification of m arms with largest means under a fixed error rate δ (fixed-confidence Top-m identification), for misspecified linear bandit models. This problem is motivated by practical applications, especially in medicine and recommendation systems, where linear models are popular due to their simplicity and the existence of efficient algorithms, but in which data inevitably deviates from linearity. In this work, we first derive a tractable lower bound on the sample complexity of any δ-correct algorithm for the general Top-m identification problem. We show that knowing the scale of the deviation from linearity is necessary to exploit the structure of the problem. We then describe the first algorithm for this setting, which is both practical and adapts to the amount of misspecification. We derive an upper bound to its sample complexity which confirms this adaptivity and that matches the lower bound when δ → 0. Finally, we evaluate our algorithm on both synthetic and real-world data, showing competitive performance with respect to existing baselines.Show less >
Show more >We study the problem of the identification of m arms with largest means under a fixed error rate δ (fixed-confidence Top-m identification), for misspecified linear bandit models. This problem is motivated by practical applications, especially in medicine and recommendation systems, where linear models are popular due to their simplicity and the existence of efficient algorithms, but in which data inevitably deviates from linearity. In this work, we first derive a tractable lower bound on the sample complexity of any δ-correct algorithm for the general Top-m identification problem. We show that knowing the scale of the deviation from linearity is necessary to exploit the structure of the problem. We then describe the first algorithm for this setting, which is both practical and adapts to the amount of misspecification. We derive an upper bound to its sample complexity which confirms this adaptivity and that matches the lower bound when δ → 0. Finally, we evaluate our algorithm on both synthetic and real-world data, showing competitive performance with respect to existing baselines.Show less >
Language :
Anglais
Peer reviewed article :
Oui
Audience :
Internationale
Popular science :
Non
Comment :
Virtual conference
Collections :
Source :
Files
- https://hal.archives-ouvertes.fr/hal-03409205/document
- Open access
- Access the document
- http://arxiv.org/pdf/2111.01479
- Open access
- Access the document
- https://hal.archives-ouvertes.fr/hal-03409205/document
- Open access
- Access the document
- https://hal.archives-ouvertes.fr/hal-03409205/document
- Open access
- Access the document
- reda2021dealing.pdf
- Open access
- Access the document
- 2111.01479
- Open access
- Access the document
- document
- Open access
- Access the document
- document
- Open access
- Access the document
- reda2021dealing.pdf
- Open access
- Access the document