Minimax Excess Risk of First-Order Methods for Statistical Learning with Data-Dependent Oracles

Scaman, Kevin; Even, Mathieu; Bars, Batiste Le; Massoulié, Laurent

Document type :

Communication dans un congrès avec actes

Title :

Minimax Excess Risk of First-Order Methods for Statistical Learning with Data-Dependent Oracles

Author(s) :

Scaman, Kevin [Auteur]
Département d'informatique - ENS-PSL [DI-ENS]
Apprentissage, graphes et optimisation distribuée [ARGO]
Even, Mathieu [Auteur]
Département d'informatique - ENS-PSL [DI-ENS]
Bars, Batiste Le [Auteur]
Machine Learning in Information Networks [MAGNET]
Massoulié, Laurent [Auteur]
Département d'informatique - ENS-PSL [DI-ENS]
Apprentissage, graphes et optimisation distribuée [ARGO]

Conference title :

AISTATS 2024 - International Conference on Artificial Intelligence and Statistics

City :

Valencia

Country :

Espagne

Start date of the conference :

2024-05-02

Publication date :

2024

HAL domain(s) :

Statistiques [stat]/Machine Learning [stat.ML]
Informatique [cs]/Intelligence artificielle [cs.AI]

English abstract : [en]

In this paper, our aim is to analyse the generalization capabilities of first-order methods for statistical learning in multiple, different yet related, scenarios including supervised learning, transfer learning, robust ...
Show more >In this paper, our aim is to analyse the generalization capabilities of first-order methods for statistical learning in multiple, different yet related, scenarios including supervised learning, transfer learning, robust learning and federated learning. To do so, we provide sharp upper and lower bounds for the minimax excess risk of strongly convex and smooth statistical learning when the gradient is accessed through partial observations given by a data-dependent oracle. This novel class of oracles can query the gradient with any given data distribution, and is thus well suited to scenarios in which the training data distribution does not match the target (or test) distribution. In particular, our upper and lower bounds are proportional to the smallest mean square error achievable by gradient estimators, thus allowing us to easily derive multiple sharp bounds in the aforementioned scenarios using the extensive literature on parameter estimation.Show less >

Language :

Anglais

Peer reviewed article :

Oui

Audience :

Internationale

Popular science :

Non

ANR Project :

Resilient, Decentralized and Privacy-Preserving Machine Learning
PaRis Artificial Intelligence Research InstitutE

Comment :

22 pages, 0 figures

Collections :

Centre de Recherche en Informatique, Signal et Automatique de Lille (CRIStAL) - UMR 9189

Source :

Harvested from HAL

Files

document
Open access
Access the document

scaman24a.pdf
Open access
Access the document

2307.04679
Open access
Access the document

Minimax Excess Risk of First-Order Methods ... BibTeX CSV Excel RIS

Files

Minimax Excess Risk of First-Order Methods ...

BibTeX

CSV

Excel

RIS