The trilinear constraint adapted to solve ...
Document type :
Article dans une revue scientifique: Article original
Permalink :
Title :
The trilinear constraint adapted to solve data with strong patterns of outlying observations or missing values
Author(s) :
Gomez-Sanchez, Adrian [Auteur]
Laboratoire Avancé de Spectroscopie pour les Intéractions la Réactivité et l'Environnement - UMR 8516 [LASIRE]
Alburquerque, I. [Auteur]
Loza-Alvarez, P. [Auteur]
Ruckebusch, Cyril [Auteur]
Laboratoire Avancé de Spectroscopie pour les Intéractions la Réactivité et l'Environnement (LASIRE) - UMR 8516
De Juan, A. [Auteur]
Laboratoire Avancé de Spectroscopie pour les Intéractions la Réactivité et l'Environnement - UMR 8516 [LASIRE]
Alburquerque, I. [Auteur]
Loza-Alvarez, P. [Auteur]
Ruckebusch, Cyril [Auteur]
Laboratoire Avancé de Spectroscopie pour les Intéractions la Réactivité et l'Environnement (LASIRE) - UMR 8516
De Juan, A. [Auteur]
Journal title :
Chemometrics and Intelligent Laboratory Systems
Abbreviated title :
Chemometrics Intell. Lab. Syst.
Volume number :
231
Pages :
-
Publication date :
2022-11-12
ISSN :
0169-7439
English keyword(s) :
Excitation -emission fluorescence
Constraints
Multivariate curve resolution
Missing values
Trilinearity
Constraints
Multivariate curve resolution
Missing values
Trilinearity
HAL domain(s) :
Chimie/Chimie théorique et/ou physique
English abstract : [en]
The possibility to perform trilinear decompositions of data sets has the clear advantage of providing unique solutions. Excitation-emission fluorescence matrices (EEM) are the best known paradigm of chemical measurements ...
Show more >The possibility to perform trilinear decompositions of data sets has the clear advantage of providing unique solutions. Excitation-emission fluorescence matrices (EEM) are the best known paradigm of chemical measurements providing a trilinear structure associated with the configuration of excitation, emission and sample modes. Chemometric tools, such as Parallel Factor Analysis (PARAFAC) and Multivariate Curve Resolution-Alternating Least Squares (MCR-ALS) with trilinear constraint, assist in solving the mixture analysis problem by exploiting the trilinear behavior of the EEM measurements. However, the spectroscopic nature of EEM measurements makes that no emission signal can be recorded below the current excitation wavelength, generating a strong and systematic pattern of outlier (zero observations) in EEM data that challenges the classical analysis by MCR-ALS or PARAFAC. Several approaches have been proposed to deal with this problem, such as the identification of outlying values below the excitation wavelength and, thus, the use of data imputation in PARAFAC, but they show severe limitations when systematic outlying data patterns occur. In this paper, we propose a new implementation of the trilinear constraint in MCR-ALS algorithm to cope with EEM measurements where a strongly patterned of outlying data is present. This approach preserves the trilinear property and does not require any data imputation step to replace the outlying observations. Its performance is tested on simulated data, controlled pharmaceutical mixtures and hyperspectral images of a plant tissue (HSI). It should be noted that the approach proposed is applicable to EEM data, where a systematic pattern of outlying observations exist, but can be generalized to the treatment of any trilinear data set with a strong pattern of missing values.Show less >
Show more >The possibility to perform trilinear decompositions of data sets has the clear advantage of providing unique solutions. Excitation-emission fluorescence matrices (EEM) are the best known paradigm of chemical measurements providing a trilinear structure associated with the configuration of excitation, emission and sample modes. Chemometric tools, such as Parallel Factor Analysis (PARAFAC) and Multivariate Curve Resolution-Alternating Least Squares (MCR-ALS) with trilinear constraint, assist in solving the mixture analysis problem by exploiting the trilinear behavior of the EEM measurements. However, the spectroscopic nature of EEM measurements makes that no emission signal can be recorded below the current excitation wavelength, generating a strong and systematic pattern of outlier (zero observations) in EEM data that challenges the classical analysis by MCR-ALS or PARAFAC. Several approaches have been proposed to deal with this problem, such as the identification of outlying values below the excitation wavelength and, thus, the use of data imputation in PARAFAC, but they show severe limitations when systematic outlying data patterns occur. In this paper, we propose a new implementation of the trilinear constraint in MCR-ALS algorithm to cope with EEM measurements where a strongly patterned of outlying data is present. This approach preserves the trilinear property and does not require any data imputation step to replace the outlying observations. Its performance is tested on simulated data, controlled pharmaceutical mixtures and hyperspectral images of a plant tissue (HSI). It should be noted that the approach proposed is applicable to EEM data, where a systematic pattern of outlying observations exist, but can be generalized to the treatment of any trilinear data set with a strong pattern of missing values.Show less >
Language :
Anglais
Peer reviewed article :
Oui
Audience :
Internationale
Popular science :
Non
Administrative institution(s) :
Université de Lille
CNRS
CNRS
Collections :
Submission date :
2024-02-28T23:45:10Z
2024-03-20T09:25:45Z
2024-03-20T09:25:45Z
Files
- 1-s2.0-S0169743922002039-main.pdf
- Version éditeur
- Open access
- Access the document