Relationships between PCA and PLS-regression

Autores
Godoy, José Luis; Vega, Jorge Ruben; Marchetti, Jacinto Luis
Año de publicación
2014
Idioma
inglés
Tipo de recurso
artículo
Estado
versión publicada
Descripción
This work aims at comparing several features of Principal Component Analysis (PCA) and Partial Least Squares Regression (PLSR), as techniques typically utilized for modeling, output prediction, and monitoring of multivariate processes. First, geometric properties of the decomposition induced by PLSR are described in relation to the PCA of the separated input and output data (X-PCA and Y-PCA, respectively). Then, analogies between the models derived with PLSR and YX-PCA (i.e., PCA of the joint input-output variables) are presented; and regarding to process monitoring applications, the specific PLSR and YX-PCA fault detection indices are compared. Numerical examples are used to illustrate the relationships between latent models, output predictive models, and fault detection indices. The three alternative approaches (PLSR, YX-PCA and Y-PCA plus X-PCA) are compared with regard to their use for statistical modeling. In particular, a case study is simulated and the results are used for enhancing the comprehension of the PLSR properties and for evaluating the discriminatory capacity of the fault detection indices based on the PLSR and YX-PCA modeling alternatives. Some recommendations are given in order to choose the more appropriate approach for a specific application: 1) PLSR and YX-PCA have similar capacity for fault detection, but PLSR is recommended for process monitoring because present a better diagnosing capability; 2) PLSR is more reliable for output prediction purposes (e.g., for soft sensor development); and 3) YX-PCA is recommended for the analysis of latent patterns imbedded in datasets.
Fil: Godoy, José Luis. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Santa Fe. Instituto de Desarrollo Tecnológico Para la Industria Química (i); Argentina. Universidad Tecnológica Nacional. Facultad Regional Paraná; Argentina
Fil: Vega, Jorge Ruben. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Santa Fe. Instituto de Desarrollo Tecnológico Para la Industria Química (i); Argentina. Universidad Tecnologica Nacional; Argentina
Fil: Marchetti, Jacinto Luis. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Santa Fe. Instituto de Desarrollo Tecnológico Para la Industria Química (i); Argentina
Materia
Pls-Regression
Pca
Latent Models
Prediction Models
Fault Detection Indices
Nivel de accesibilidad
acceso abierto
Condiciones de uso
https://creativecommons.org/licenses/by-nc-nd/2.5/ar/
Repositorio
CONICET Digital (CONICET)
Institución
Consejo Nacional de Investigaciones Científicas y Técnicas
OAI Identificador
oai:ri.conicet.gov.ar:11336/9285

id CONICETDig_b0ceda0d974dfe063d8c53941c3d15b3
oai_identifier_str oai:ri.conicet.gov.ar:11336/9285
network_acronym_str CONICETDig
repository_id_str 3498
network_name_str CONICET Digital (CONICET)
spelling Relationships between PCA and PLS-regressionGodoy, José LuisVega, Jorge RubenMarchetti, Jacinto LuisPls-RegressionPcaLatent ModelsPrediction ModelsFault Detection Indiceshttps://purl.org/becyt/ford/2.11https://purl.org/becyt/ford/2This work aims at comparing several features of Principal Component Analysis (PCA) and Partial Least Squares Regression (PLSR), as techniques typically utilized for modeling, output prediction, and monitoring of multivariate processes. First, geometric properties of the decomposition induced by PLSR are described in relation to the PCA of the separated input and output data (X-PCA and Y-PCA, respectively). Then, analogies between the models derived with PLSR and YX-PCA (i.e., PCA of the joint input-output variables) are presented; and regarding to process monitoring applications, the specific PLSR and YX-PCA fault detection indices are compared. Numerical examples are used to illustrate the relationships between latent models, output predictive models, and fault detection indices. The three alternative approaches (PLSR, YX-PCA and Y-PCA plus X-PCA) are compared with regard to their use for statistical modeling. In particular, a case study is simulated and the results are used for enhancing the comprehension of the PLSR properties and for evaluating the discriminatory capacity of the fault detection indices based on the PLSR and YX-PCA modeling alternatives. Some recommendations are given in order to choose the more appropriate approach for a specific application: 1) PLSR and YX-PCA have similar capacity for fault detection, but PLSR is recommended for process monitoring because present a better diagnosing capability; 2) PLSR is more reliable for output prediction purposes (e.g., for soft sensor development); and 3) YX-PCA is recommended for the analysis of latent patterns imbedded in datasets.Fil: Godoy, José Luis. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Santa Fe. Instituto de Desarrollo Tecnológico Para la Industria Química (i); Argentina. Universidad Tecnológica Nacional. Facultad Regional Paraná; ArgentinaFil: Vega, Jorge Ruben. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Santa Fe. Instituto de Desarrollo Tecnológico Para la Industria Química (i); Argentina. Universidad Tecnologica Nacional; ArgentinaFil: Marchetti, Jacinto Luis. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Santa Fe. Instituto de Desarrollo Tecnológico Para la Industria Química (i); ArgentinaElsevier Science2014-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionhttp://purl.org/coar/resource_type/c_6501info:ar-repo/semantics/articuloapplication/pdfapplication/pdfapplication/pdfhttp://hdl.handle.net/11336/9285Godoy, José Luis; Vega, Jorge Ruben; Marchetti, Jacinto Luis; Relationships between PCA and PLS-regression; Elsevier Science; Chemometrics And Intelligent Laboratory Systems; 130; 1-2014; 182-1910169-7439enginfo:eu-repo/semantics/altIdentifier/doi/10.1016/j.chemolab.2013.11.008info:eu-repo/semantics/altIdentifier/url/http://www.sciencedirect.com/science/article/pii/S0169743913002189info:eu-repo/semantics/openAccesshttps://creativecommons.org/licenses/by-nc-nd/2.5/ar/reponame:CONICET Digital (CONICET)instname:Consejo Nacional de Investigaciones Científicas y Técnicas2025-09-03T09:56:45Zoai:ri.conicet.gov.ar:11336/9285instacron:CONICETInstitucionalhttp://ri.conicet.gov.ar/Organismo científico-tecnológicoNo correspondehttp://ri.conicet.gov.ar/oai/requestdasensio@conicet.gov.ar; lcarlino@conicet.gov.arArgentinaNo correspondeNo correspondeNo correspondeopendoar:34982025-09-03 09:56:46.005CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicasfalse
dc.title.none.fl_str_mv Relationships between PCA and PLS-regression
title Relationships between PCA and PLS-regression
spellingShingle Relationships between PCA and PLS-regression
Godoy, José Luis
Pls-Regression
Pca
Latent Models
Prediction Models
Fault Detection Indices
title_short Relationships between PCA and PLS-regression
title_full Relationships between PCA and PLS-regression
title_fullStr Relationships between PCA and PLS-regression
title_full_unstemmed Relationships between PCA and PLS-regression
title_sort Relationships between PCA and PLS-regression
dc.creator.none.fl_str_mv Godoy, José Luis
Vega, Jorge Ruben
Marchetti, Jacinto Luis
author Godoy, José Luis
author_facet Godoy, José Luis
Vega, Jorge Ruben
Marchetti, Jacinto Luis
author_role author
author2 Vega, Jorge Ruben
Marchetti, Jacinto Luis
author2_role author
author
dc.subject.none.fl_str_mv Pls-Regression
Pca
Latent Models
Prediction Models
Fault Detection Indices
topic Pls-Regression
Pca
Latent Models
Prediction Models
Fault Detection Indices
purl_subject.fl_str_mv https://purl.org/becyt/ford/2.11
https://purl.org/becyt/ford/2
dc.description.none.fl_txt_mv This work aims at comparing several features of Principal Component Analysis (PCA) and Partial Least Squares Regression (PLSR), as techniques typically utilized for modeling, output prediction, and monitoring of multivariate processes. First, geometric properties of the decomposition induced by PLSR are described in relation to the PCA of the separated input and output data (X-PCA and Y-PCA, respectively). Then, analogies between the models derived with PLSR and YX-PCA (i.e., PCA of the joint input-output variables) are presented; and regarding to process monitoring applications, the specific PLSR and YX-PCA fault detection indices are compared. Numerical examples are used to illustrate the relationships between latent models, output predictive models, and fault detection indices. The three alternative approaches (PLSR, YX-PCA and Y-PCA plus X-PCA) are compared with regard to their use for statistical modeling. In particular, a case study is simulated and the results are used for enhancing the comprehension of the PLSR properties and for evaluating the discriminatory capacity of the fault detection indices based on the PLSR and YX-PCA modeling alternatives. Some recommendations are given in order to choose the more appropriate approach for a specific application: 1) PLSR and YX-PCA have similar capacity for fault detection, but PLSR is recommended for process monitoring because present a better diagnosing capability; 2) PLSR is more reliable for output prediction purposes (e.g., for soft sensor development); and 3) YX-PCA is recommended for the analysis of latent patterns imbedded in datasets.
Fil: Godoy, José Luis. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Santa Fe. Instituto de Desarrollo Tecnológico Para la Industria Química (i); Argentina. Universidad Tecnológica Nacional. Facultad Regional Paraná; Argentina
Fil: Vega, Jorge Ruben. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Santa Fe. Instituto de Desarrollo Tecnológico Para la Industria Química (i); Argentina. Universidad Tecnologica Nacional; Argentina
Fil: Marchetti, Jacinto Luis. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Santa Fe. Instituto de Desarrollo Tecnológico Para la Industria Química (i); Argentina
description This work aims at comparing several features of Principal Component Analysis (PCA) and Partial Least Squares Regression (PLSR), as techniques typically utilized for modeling, output prediction, and monitoring of multivariate processes. First, geometric properties of the decomposition induced by PLSR are described in relation to the PCA of the separated input and output data (X-PCA and Y-PCA, respectively). Then, analogies between the models derived with PLSR and YX-PCA (i.e., PCA of the joint input-output variables) are presented; and regarding to process monitoring applications, the specific PLSR and YX-PCA fault detection indices are compared. Numerical examples are used to illustrate the relationships between latent models, output predictive models, and fault detection indices. The three alternative approaches (PLSR, YX-PCA and Y-PCA plus X-PCA) are compared with regard to their use for statistical modeling. In particular, a case study is simulated and the results are used for enhancing the comprehension of the PLSR properties and for evaluating the discriminatory capacity of the fault detection indices based on the PLSR and YX-PCA modeling alternatives. Some recommendations are given in order to choose the more appropriate approach for a specific application: 1) PLSR and YX-PCA have similar capacity for fault detection, but PLSR is recommended for process monitoring because present a better diagnosing capability; 2) PLSR is more reliable for output prediction purposes (e.g., for soft sensor development); and 3) YX-PCA is recommended for the analysis of latent patterns imbedded in datasets.
publishDate 2014
dc.date.none.fl_str_mv 2014-01
dc.type.none.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
http://purl.org/coar/resource_type/c_6501
info:ar-repo/semantics/articulo
format article
status_str publishedVersion
dc.identifier.none.fl_str_mv http://hdl.handle.net/11336/9285
Godoy, José Luis; Vega, Jorge Ruben; Marchetti, Jacinto Luis; Relationships between PCA and PLS-regression; Elsevier Science; Chemometrics And Intelligent Laboratory Systems; 130; 1-2014; 182-191
0169-7439
url http://hdl.handle.net/11336/9285
identifier_str_mv Godoy, José Luis; Vega, Jorge Ruben; Marchetti, Jacinto Luis; Relationships between PCA and PLS-regression; Elsevier Science; Chemometrics And Intelligent Laboratory Systems; 130; 1-2014; 182-191
0169-7439
dc.language.none.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv info:eu-repo/semantics/altIdentifier/doi/10.1016/j.chemolab.2013.11.008
info:eu-repo/semantics/altIdentifier/url/http://www.sciencedirect.com/science/article/pii/S0169743913002189
dc.rights.none.fl_str_mv info:eu-repo/semantics/openAccess
https://creativecommons.org/licenses/by-nc-nd/2.5/ar/
eu_rights_str_mv openAccess
rights_invalid_str_mv https://creativecommons.org/licenses/by-nc-nd/2.5/ar/
dc.format.none.fl_str_mv application/pdf
application/pdf
application/pdf
dc.publisher.none.fl_str_mv Elsevier Science
publisher.none.fl_str_mv Elsevier Science
dc.source.none.fl_str_mv reponame:CONICET Digital (CONICET)
instname:Consejo Nacional de Investigaciones Científicas y Técnicas
reponame_str CONICET Digital (CONICET)
collection CONICET Digital (CONICET)
instname_str Consejo Nacional de Investigaciones Científicas y Técnicas
repository.name.fl_str_mv CONICET Digital (CONICET) - Consejo Nacional de Investigaciones Científicas y Técnicas
repository.mail.fl_str_mv dasensio@conicet.gov.ar; lcarlino@conicet.gov.ar
_version_ 1842269422180368384
score 13.13397