Analyse non asymptotique d'un test séquentiel de détection de rupture et application aux bandits non stationnaires

Besson, Lilian; Kaufmann, Emilie

Document type :

Communication dans un congrès avec actes

Title :

Analyse non asymptotique d'un test séquentiel de détection de rupture et application aux bandits non stationnaires

Author(s) :

Besson, Lilian [Auteur]
CentraleSupélec
Institut d'Électronique et des Technologies du numéRique [IETR]
Sequential Learning [SEQUEL]
Signal, Communication et Electronique Embarquée [SCEE]
SUPELEC-Campus Rennes
Kaufmann, Emilie [Auteur]

Centre National de la Recherche Scientifique [CNRS]
Sequential Learning [SEQUEL]
Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189 [CRIStAL]

Conference title :

GRETSI 2019 - XXVIIème Colloque francophone de traitement du signal et des images

City :

Lille

Country :

France

Start date of the conference :

2019-08-26

HAL domain(s) :

Informatique [cs]/Apprentissage [cs.LG]
Mathématiques [math]/Statistiques [math.ST]
Statistiques [stat]/Machine Learning [stat.ML]

French abstract :

Nous étudions un test pour la détection séquentielle de rupture, basé sur le rapport de vraisemblance généralisé (GLR) et qui s'exprime en fonction de l'entropie relative binaire. Il s'applique à la détection de rupture ...
Show more >Nous étudions un test pour la détection séquentielle de rupture, basé sur le rapport de vraisemblance généralisé (GLR) et qui s'exprime en fonction de l'entropie relative binaire. Il s'applique à la détection de rupture sur la moyenne d'une distribution bornée, et nous obtenons un contrôle non-asymptotique de sa probabilité de fausse alarme et de son délai de détection. Nous expliquons son utilisation pour la prise de décision séquentielle en proposant la stratégie de bandit GLR-klUCB, efficace dans des modèles de bandit stationnaires par morceaux.Show less >

English abstract : [en]

We study a strategy for online change-point detection based on generalized likelihood ratios (GLR) and that can be expressed with the binary relative entropy. This test is used to detect a change in the mean of a bounded ...
Show more >We study a strategy for online change-point detection based on generalized likelihood ratios (GLR) and that can be expressed with the binary relative entropy. This test is used to detect a change in the mean of a bounded distribution, and we propose a non-asymptotic control of its false alarm probability and detection delay. We then explain how it can be useful for sequential decision making by proposing the GLR-klUCB bandit strategy, which is efficient in piece-wise stationary multi-armed bandit models.Show less >

Language :

Français

Peer reviewed article :

Oui

Audience :

Nationale

Popular science :

Non

Collections :

Centre de Recherche en Informatique, Signal et Automatique de Lille (CRIStAL) - UMR 9189

Source :

Harvested from HAL

Files

https://hal.inria.fr/hal-02152243/document
Open access
Access the document

https://hal.inria.fr/hal-02152243/document
Open access
Access the document

https://hal.inria.fr/hal-02152243/document
Open access
Access the document

document
Open access
Access the document

GRESTI__2019__Paper__Lilian_Besson__03-19.pdf
Open access
Access the document

document
Open access
Access the document

GRESTI__2019__Paper__Lilian_Besson__03-19.pdf
Open access
Access the document

Analyse non asymptotique d'un test séquentiel ... BibTeX CSV Excel RIS

Files

Analyse non asymptotique d'un test séquentiel ...

BibTeX

CSV

Excel

RIS