Like in a regular PCA, you can also display the plot of eigenvalues. You can choose to plot some summary statistics (minimum, maximum, mean, standard deviation) for quantitative variables, but also for the qualitative ones (counts and frequencies). All missing quantitative values are replaced by the mean of the variable while a "missing" category will be created for qualitative variables. If you have missing values in your dataset, ou can choose to not accept, remove or replace missing data. Several options ranging from the selection of data to the display of results are available such as introducing observation weights, filtering factors based on inertia, adding supplementary observations/variables and customizing charts.įor example, you can choose the best number of components by setting a maximum number or by setting a minimum percentage of variance to be explained by each component. Options for factorial analysis of mixed data in XLSTAT Sometimes, only the first and second components are necessary to explain a large percentage of the variance and so you will be able analyze a two-dimensional projection of what was initially dozens of variables. The observations and the variables end up being represented as points in orthogonal two-dimensional spaces. This is when the dimensional reduction occurs because a small number of components will be enough to explain a high percentage of the variance. The number of principal components is chosen depending on the explained percentage of variance of the model by each component.
RUN FACTORIAL APSIM SERIES
To do so, it does a series of statistical transformations, including calculations of the correlation matrix, eigenvalues and eigenvectors, on a set of qualitative and/or quantitative variables in order to project them on a vector space generated by orthogonal components. Similarly to other factorial analysis methods, PCAmix aims to reduce data dimensionality as well as to identify nearness between variables but also proximity between the observations. This method can be seen as a mixture of two popular methods of factorial analysis: Principal Component Analysis (PCA) which allows to study an observations/quantitative variables table and Multiple Correspondence Analysis (MCA) which allows to study an observations/qualitative variables table. The method used in Xlstat is called PCAmix and was developed by Chavent et al (2014). A few variants of this method have been developed since then (Escofier 1979, Pagès 2004). What is Factorial analysis of mixed data?įactorial analysis of mixed data is a method initially developed by Hill and Smith (1972).