Partial least-squares regression on design variables as an alternative to analysis of variance

https://doi.org/10.1016/S0003-2670(00)86303-5Get rights and content

Abstract

The partial least-squares (PLS) algorithm has become popular for explorative multivariate data analysis and for multivariate calibration. The same PLS algorithm can also be used for confirmatory data analysis. The discussion is limited to analysis of a single response variable. A close correspondence of PLS1 regression to classical analysis of variance (ANOVA) is demonstrated. The design of an experiment is described in terms of discrete design variables for main effects and simple interactions (dummy variables). These are used as regressors X = (x1, x2,…,) for modelling the response variable of the experiment, y. As in conventional use of PLS1 regression, the algorithm gives a concentrated model or diagram of the most important, y-relevant variability types in the X-data. In the present case, this gives the combination of design variables that models the variations in y. A simple plot of the resulting factor loadings immediately reveals the important design variables. Statistical tests and confidence regions in the PLS solution give additional safeguards against interpretation of spurious effects. The method is applied to two data sets. One concerns assessment of personal preference for blackcurrent juice, studied in a 25 factorial experiment; these data are also studied with missing values and as fractional factorials. The other ceoncers spectrophotometric absorbance-based colour assessments of pigment in strawberry jam in a 3-factor design with 2, 2 and 3 levels in the respective factors.

References (20)

  • H. Martens et al.

    Trends Anal. Chem.

    (1984)
  • T. Naes et al.

    Trends Anal. Chem.

    (1984)
  • H. Wold
  • S. Wold et al.
    (1983)
  • S. Wold et al.
  • H. Martens et al.
  • I. Helland, On the Structure of Partial Least Squares Regression, Report No. 21/86, Dept. of Mathematics and...
  • H. Martens et al.
  • T. Naes et al.

    Commun. Statist.

    (1985)
  • T. Naes et al.

    Appl. Statist.

    (1986)
There are more references available in the full text version of this article.

Cited by (21)

  • Optimising drone flight planning for measuring horticultural tree crop structure

    2020, ISPRS Journal of Photogrammetry and Remote Sensing
    Citation Excerpt :

    Also, there was multi-collinearity between some variables, e.g. between the average pitch angle and flying speed, and between pixel GSD, average image area, and image forward overlap. PLS is better equipped to analyse variables with multi-collinearity by projecting both the independent and dependent datasets to principal component spaces, while still maximising the covariance between independent and dependant datasets at the same time (Martens et al., 1986; Wold et al., 1984). We used both the original five-variable set (variable set 1) and the extended eight-variable set (variable set 2) to run the PLS regression against each quality indicator individually (Table 2) and generated a prediction model for each variable set.

  • Analysis of designed experiments by stabilised PLS regression and jack-knifing

    2001, Chemometrics and Intelligent Laboratory Systems
  • PARAFAC. Tutorial and applications

    1997, Chemometrics and Intelligent Laboratory Systems
View all citing articles on Scopus
1

Present address: Norwegian Computing Center, Blindern, Oslo 3 (Norway).

2

Present address: Instituto Agroquimica y Tecnologia de Alimentos, 46010 Valencia (Spain).

View full text