Leveraging Data Geometry to Mitigate CSM in Steganalysis

Abecidan, Rony; Itier, Vincent; Boulanger, Jérémie; Bas, Patrick; Pevný, Tomáš

Type de document :

Autre communication scientifique (congrès sans actes - poster - séminaire...): Communication dans un congrès avec actes

Titre :

Leveraging Data Geometry to Mitigate CSM in Steganalysis

Auteur(s) :

Abecidan, Rony [Auteur]
Centre National de la Recherche Scientifique [CNRS]
Université de Lille
Centrale Lille
Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189 [CRIStAL]
Itier, Vincent [Auteur]
Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189 [CRIStAL]
Ecole nationale supérieure Mines-Télécom Lille Douai [IMT Nord Europe]
Boulanger, Jérémie [Auteur]
Université de Lille
Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189 [CRIStAL]
Bas, Patrick [Auteur]

Centrale Lille
Centre National de la Recherche Scientifique [CNRS]
Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189 [CRIStAL]
Pevný, Tomáš [Auteur]
Czech Technical University in Prague [CTU]

Titre de la manifestation scientifique :

IEEE International Workshop on Information Forensics and Security (WIFS 2023)

Ville :

Nuremberg

Pays :

Allemagne

Date de début de la manifestation scientifique :

2023-12-04

Date de publication :

2023-12-04

Mot(s)-clé(s) en anglais :

steganalysis
steganography
forensics
cover source mismatch
domain generalization
machine learning
domain adaptation
data adaptation

Discipline(s) HAL :

Informatique [cs]/Traitement du signal et de l'image [eess.SP]
Informatique [cs]/Intelligence artificielle [cs.AI]
Informatique [cs]/Cryptographie et sécurité [cs.CR]
Informatique [cs]/Vision par ordinateur et reconnaissance de formes [cs.CV]
Informatique [cs]/Multimédia [cs.MM]
Statistiques [stat]/Machine Learning [stat.ML]

Résumé en anglais : [en]

In operational scenarios, steganographers use sets of covers from various sensors and processing pipelines that differ significantly from those used by researchers to train steganalysis models. This leads to an inevitable ...
Lire la suite >In operational scenarios, steganographers use sets of covers from various sensors and processing pipelines that differ significantly from those used by researchers to train steganalysis models. This leads to an inevitable performance gap when dealing with out-of-distribution covers, commonly referred to as Cover Source Mismatch (CSM). In this study, we consider the scenario where test images are processed using the same pipeline. However, knowledge regarding both the labels and the balance between cover and stego is missing. Our objective is to identify a training dataset that allows for maximum generalization to our target. By exploring a grid of processing pipelines fostering CSM, we discovered a geometrical metric based on the chordal distance between subspaces spanned by DCTr features, that exhibits high correlation with operational regret while being not affected by the cover-stego balance. Our contribution lies in the development of a strategy that enables the selection or derivation of customized training datasets, enhancing the overall generalization performance for a given target. Experimental validation highlights that our geometry-based optimization strategy outperforms traditional atomistic methods given reasonable assumptions. Additional resources are available at github.com/RonyAbecidan/LeveragingGeometrytoMitigateCSM.Lire moins >

Langue :

Anglais

Comité de lecture :

Oui

Audience :

Internationale

Vulgarisation :

Non

Collections :

Centre de Recherche en Informatique, Signal et Automatique de Lille (CRIStAL) - UMR 9189

Source :

Harvested from HAL

Fichiers

document
Accès libre
Accéder au document

2023_wifs.pdf
Accès libre
Accéder au document

2310.04479
Accès libre
Accéder au document

Leveraging Data Geometry to Mitigate CSM ... BibTeX CSV Excel RIS

Fichiers

Leveraging Data Geometry to Mitigate CSM ...

BibTeX

CSV

Excel

RIS