10.14718/revfinanzpolitecon.v13.n2.2021.9 Artículo de investigación |
Rogelio Ladrón de Guevara Cortés*
Salvador Torra Porras**
Enric Monte Moreno***
* Ph.D. in Business Studies. Professor-Researcher. Institute of Research and Graduate Studies in Administrative Sciences (IIESCA). Universidad Veracruzana. Mexico.
roladron@uv.mx
0000-0001-9365-2080
** Ph.D. in Economic Sciences. Professor-Researcher. Department of Econometrics, Statistics, and Applied Economy. Faculty of Economics and Business. University of Barcelona. Spain.
storra@ub.edu
0000-0002-8786-8800
*** Ph.D. in Digital Signal Processing. Professor-Researcher. Department of Signal Theory and Communications. Barcelona School of Telecommunications Engineering. Polytechnic University of Catalonia. Spain.
enric.monte@upc.edu
0000-0002-4907-0494
Recibido: 18 de noviembre de 2020
Evaluación: 31 de marzo de 2021
Aprobado: 30 de julio de 2021
Abstract
This paper compares the dimension reduction or feature extraction techniques, e.g., Principal Component Analysis, Factor Analysis, Independent Component Analysis, and Neural Networks Principal Component Analysis, which are used as techniques for extracting the underlying systematic risk factors driving the returns on equities of the Mexican Stock Exchange, under a statistical approach to the Arbitrage Pricing Theory. This research is carried out according to two different perspectives. First, an evaluation from a theoretical and matrix scope is done, making parallelism among their particular mixing and demixing processes, as well as the attributes of the factors extracted by each method. Secondly, an empirical study to measure the level of accuracy in the reconstruction of the original variables is accomplished. In general, the results of this research point to Neural Networks Principal Component Analysis as the best technique from both theoretical and empirical standpoints.
Keywords: Neural Networks Principal Component Analysis, Independent Component Analysis, Factor Analysis, Principal Component Analysis, Mexican Stock Exchange.
JEL Classification: G12, G15, C45
Resumen
Este artículo compara las técnicas de reducción de dimensionalidad o de extracción de características: Análisis de Componentes Principales, Análisis Factorial, Análisis de Componentes Independientes y Análisis de Componentes Principales basado en Redes Neuronales, las cuales son usadas para extraer los factores de riesgo sistemático subyacentes que generan los rendimientos de las acciones de la Bolsa Mexicana de Valores, bajo un enfoque estadístico de la Teoría de Valoración por Arbitraje. Llevamos a cabo nuestra investigación de acuerdo a dos diferentes perspectivas. Primero, las evaluamos desde una perspectiva teórica y matricial, haciendo un paralelismo entre los particulares procesos de mezcla y separación de cada método. En segundo lugar, efectuamos un estudio empírico con el fin de medir el nivel de precisión en la reconstrucción de las variables originales.
Palabras clave: Análisis de Componentes Principales basado en Redes Neuronales, Análisis de Componentes Independientes, Análisis Factorial, Análisis de Componentes Principales, Bolsa Mexicana de Valores.
INTRODUCTION
Classic multivariate dimensional reduction techniques have been widely used in different fields of science to extract the underlying factors from large sets of data or to build synthetic indicators that range from natural to social sciences, such as Physics, Chemistry, Biology, Medicine, Astronomy, Psychology, Education, Management, Marketing, etc. There is a large amount of literature that focuses on the application of mainly, Principal Component Analysis and Factor Analysis, in different fields of knowledge.1
In previous research, three different dimension reduction or feature extraction techniques for extracting the underlying systematic risk factors driving the returns on equities in a statistical approach to the Arbitrage Pricing Theory (Ross, 1976) have been presented. This approach assumes a priori neither the number nor the nature of either the systematic risk factors or the sensitivity to them; therefore, in this paper, the estimation of both of them is performed by using extraction and econometric techniques in two different stages. The efforts to extract underlying factors with better characteristics led us to advance from classical multivariate techniques, such as Principal Component Analysis (PCA) and Factor Analysis (FA), to more advanced and sophisticated techniques -usually applied in fields like Engineering, Telecommunications, Astronomy, Biochemistry, Bioinformatics, Artificial Intelligence and Robotics- such as Independent Component Analysis (ICA) and Neural Networks Principal Component Analysis (NNPCA).
Although the main objective of each technique is similar -to reduce the dimension or to extract the main features from a set of variables-, they are different in nature, assumptions, principles, and internal algorithms; this makes it difficult and impracticable to compare their results, i.e., the matrices used in the processes of extraction and generation, and the underlying factors extracted. To solve this problem, the main objective of this paper is to propose a methodology to compare the four techniques, based on the degree of accuracy in the reconstruction of the observed variables using the underlying systematic factors extracted using each technique.
To accomplish this objective, first, a theoretical and matrix comparison among techniques is proposed, where their parallelism among their particular mixing and demixing processes is remarked. Then, this paper provides empirical evidence of their reconstruction accuracy using a set of measures usually implemented for fo-recastings, such as the Mean Absolute Error (MAE), the Mean Absolute Percentage Error (MAPE), The Root Mean Square Error (RMSE), the Theil's U statistic (U-Theil), the Confusion Matrix (CM), the Confusion Rate (CR), The chi-squared contrast of independence (x2), and the Pesaran & Timmermann's directional accuracy statistic (DA).
Comparative studies of all four techniques in literature are scarce, so the main contribution of this paper is precisely to fill this gap in the financial literature, providing a theoretical and empirical comparative study among PCA, FA, ICA, and NNPCA in the field of finance, by way of matrix parallelism among the four techniques and the analysis of the reconstruction accuracy of the observed returns on equities, using the different components or factors extracted through each technique.
In addition, this study contributes to providing evidence in two particular relevant contexts. On the one hand, the financial market studied, that in this case represents an important emergent Latin-American equity market -the Mexican Stock Exchange-, whose studies related to these kinds of techniques are uncommon as well. On the other hand, the time analyzed corresponds to the period previous to the last recognized financial bubble: the subprime crisis. In the current actual situation where is very likely that another financial and economic crisis strikes, derived from the effects of the COVID-19 pandemic, it is considered necessary to test the performance of these techniques in a similar period to show explanatory insights into these kinds of situations.
Although these types of statistical and computational techniques have both explanatory and forecasting attributes, the aim and scope of this paper are focused only on their explanatory power. The forecasting properties are out of the scope of this paper since they are considered in other additional researches. Likewise, the test of these techniques during crisis and post-crisis periods is being studied in other extensions of this research.
The main results of this research reveal, on the one hand, that from a theoretical perspective NNPCA seems to offer the most suitable attributes for the underlying factors in a statistical context of the Arbitrage Pricing Theory. On the other hand, from an empirical scope, although there is no clear supremacy of any of the four techniques, evidence points to NNPCA as the one with the best performance in the reconstruction of the returns on equities.
The structure of this paper is as follows: section 2 makes a review of literature focused on comparative studies of these techniques; section 3 proposes the matrix parallelism among the used techniques, explaining the attributes of the factors extracted with each one of them; section 4 describes the methodology carried out in the study, and section 5 shows the results of the empirical comparative study. Finally, section 6 draws some conclusions and section 7 presents the references.
REVIEW OF THE LITERATURE.
To the best of our knowledge, only Scholz (2006a) uses and compares three of the aforementioned techniques in a single study, i.e. PCA, ICA, and NNPCA, carried over to molecular data in biochemistry to extract biologically meaningful components. The author explains the benefits and drawbacks of each type of analysis to understand biological issues, concluding that, depending on the characteristics of the data and the purpose of the research, one specific technique is more suitable than the others. For the sake of saving space and considering that one of the main contributions of this paper is the application of these four techniques in the financial context, comparative studies in fields different from Finance and Economics are out of the scope of this research. Nevertheless, the interested reader can easily find references of comparative studies between some of these techniques in the literature of Natural and Social Sciences.
Some recent and relevant researches focused on comparative studies of these types of techniques are, for example, Corominas, Garrido-Baserba, Villez, Olsson, Cortés & Poch (2018), where the authors do a description of the state-of-the art computer-based techniques for data analysis in the context of wastewater treatment plants; and Ayesha, Hanif & Talib (2020) where they present the state-of-the-art dimensionality reduction techniques for high dimensional data and their suitability for different types of application areas such as biomedicine, audio, image and video, genetics, signal-processing, pattern-recognition, etc. Comparative studies in Economics and Finance are not very frequent in literature, and they have dealt with only two of these techniques in the same review. Some relevant references in these fields are the following.2
Regarding PCA and FA, Ince & Trafalis (2007) use the components and factors extracted through PCA and FA as the input variables for two different forecasting models to compare their performance for stock price prediction on the NASDAQ. They found that the factors extracted through FA performed better than the components extracted through PCA. More recently, Ibraimova (2019) compares the performance of PCA and FA as dimensionality reduction techniques where the factors extracted by each technique were used as inputs in a model that tried to predict financial distress in companies through machine learning. The best results in predictions were those that used the PCA factors extraction.
Concerning ICA, Bellini & Salinelli (2003) find that the immunization strategies to the US Treasury spot rates curve movements based on ICA perform better than those based on PCA. Lesch, Caille, & Lowe (1999) apply PCA and ICA to perform feature extraction from currency exchange data of the British Pound against the US Dollar, showing that both techniques are capable of detecting deterministic structure in the data, but independent components are much closer in their morphology to the signals.
Back & Weigend (1997) apply ICA and PCA on the Tokyo Stock Exchange, showing that while the reconstruction of the observed stock prices derived from the independent components extracted is outstanding, the reproduction resulting from the principal components is not. Yip & Xu (2000) carry ICA and PCA over to stocks from the S&P 500, finding that ICA gives a better indication of the underlying structure of the US stock market, in terms of the linear relationship between the components extracted through both techniques and some predefined macroeconomic factors.
Rojas & Moody (2001) compare ICA and PCA by investigating the term structure and the interactions between the returns of iShares MSCI Index Funds and the returns of the S&P Depositary Receipts Index; they demonstrate that ICA has more influence on the average mutual information. Lizieri, Satchell & Zhang (2007) compare the ICA and PCA components' capability of capturing the kurtosis in Real Estate Investment Trusts (REIT) in the USA, therefore proving that ICA overcomes PCA. Nevertheless, Wei, Jin & Jin (2005) uncover that, although both techniques produce similar results, PCA outperforms ICA in the reconstruction of mutual funds in the Chinese financial market.
On the other hand, Coli, Di Nisio & Ippoliti (2005), in an application of ICA and PCA to a stocks portfolio of the Milan Stock Exchange, uncover that, although the principal components present a minimum reprojection error when they are used to reconstruct the data, the independent components make it easier to distinguish of the trend and the cyclical components. More recently, Sayah (2016), compares PCA-GARCH versus ICA-GARCH models in the context of Basel's III sensitivity based approach for the interest rate risk in the trading book of Banks, finding that in general, the ICA model produced more restrictive results in the Value at Risk (VaR) computation.
Regarding NNPCA, Weigang, Rodrigues, Lihua & Yukuhiro (2007) compare NNPCA and PCA in terms of their dimensional reduction capability, to extract the main feature explaining the trends of withdrawals from an employment time guarantee fund, thereby showing that NNPCA is more suitable than PCA for dimension reduction in this context.3 On the other hand, Liu & Wang (2011) integrated ICA and PCA with Neural Networks to predict the Chinese Stock Market finding suitable results. In addition, interesting surveys focused on applications of related intelligent computational techniques in financial markets applications can be found in Cavalcante, Brasileiro, Souza, Nobrega & Oliveira (2016), and in machine learning techniques applied to financial market prediction in Miranda, Amorin & Kimura (2019).
On the other hand, there is a working precedent that has undertaken a systematic exploration of dimensionality reduction methods such as Anowar, Sadaoui & Selim (2021). However, the current work aims to find basic factors that can explain the underlying risk factors in the Mexican market. This is why a selection of dimensionality reduction techniques has been made that allows a local approximation to explain the risk. In particular, not all dimensionality reduction techniques allow for explanations. For example, following Scikit-Learn (2021), Multi-dimensional Scaling (MDS) is based on finding a 2D distribution that maintains the metric relationships of the original space, but loses the proportionality relationship when explaining risk and the dynamic aspect that is of interest in this article.
Finally, techniques such as ISOMAP and Local Linear Embedding (LEE), are based on locally generating the manifold using the observed training points. These are techniques related to differential geometry, which generate a graph that allows surfaces to be modelled in a higher-dimensional space, without crossings or mixtures of regions in different parts of the surface. Although this type of technique may be of interest for understanding neighborhood relationships in the trajectories of values, it does not allow expressing risk with a linear relationship. Something similar happens with the t-distributed stochastic neighbor embedding (t-SNE), in this case, it is an embedding, not a projection, so a mesh is found based on a similarity measure based on proximity of probability distributions utilizing the Kullbach Leiber measure. Although this is a method that allows a reliable low-dimensional representation of the mesh that relates the entry points, it does not allow us to capture the dynamic risk relationships through a matrix, which is the objective of our work.
MATRIX PARALLELISM AMONG PCA, FA, ICA, AND NNPCA.
The four techniques used in this study, PCA, FA, ICA, and NNPCA,4 can be classified as latent variable analysis, dimension reduction, or feature extraction techniques, whose main objective is to obtain some new underlying synthetic variables - from a set of observed data - capable of reproducing the behavior of the original data, in this context, returns on equities. Strictly speaking, a latent variable analysis technique tries to infer some unobservable artificial variables from a set of observable ones by using some mathematical models. On the other hand, the objective of a dimension reduction technique is only to reduce the dimensionality of the problem by selecting a fewer number of new artificial variables created by the combination of the original ones, via some mathematical or geometric transformation of the observed variables. Finally, a feature extraction technique seeks that the new variables extracted represent the main or most relevant and meaningful components or factors resulting from specific combinations of the observed ones.
Nevertheless, the purpose of this paper is to obtain a set of factors -hidden in the observed variables- to explain, in the best manner, why the returns on equities in the sample behave as they do. Consequently, any of the three approaches to classify these techniques fits properly as a method for extracting the main factors explaining the behavior of the returns on the equities of the sample. The four classes of analysis include two different processes, the extraction of the underlying factors process and the generation of the original variables process. Table 1 presents matrix parallelism among the extraction and generation processes employed in each technique and the main attributes of their extracted components or factors.
Matrix parallelism among techniques to extract the underlying factors of systematic risk.
Source: Author's elaboration.
The same kind of analogy can be made to include PCA and FA in the former parallelism as well, taking the matrices of weights in the extraction process (A and C), the matrices of the extracted components or factors (Z and F), and the factor loading matrices in the generation process (A' and A'), respectively. It is important to remark that, although there is matrix parallelism among the elements of these techniques, in this context, the direct comparison of their values is not homogeneous among all of them, e.g., the generation processes in PCA, FA, and ICA include only a linear mixing of the original data matrix and the demixing matrices; however, in NNPCA the process includes a non-linear combination of two matrices of weights and the original data matrix; thus, this technique does not have a single demixing matrix which, when multiplied directly by the data matrix, might produce the extracted factors. A similar situation occurs with the generation process, so it is necessary to use other methods to compare the four techniques, such as the reconstruction accuracy of the observed variables.
On the other hand, strictly speaking, the FA should not be compared directly with the rest of these techniques since the FA includes an independent term corresponding to the specific factors (U), which is not considered in the rest of them.5 The FA should be compared with the equivalent versions of the other techniques that consider an independent term in the model as well, e.g., the Noisy ICA (N-ICA) or Independent Factor Analysis (IFA) and the Non-linear Factor Analysis (NLFA). Nevertheless, PCA and FA have always been compared and in some cases even confused, since PCA is considered as a method of estimation within the FA, which is incorrect; thus, FA results were included in this review, too. The next step in further research will be to compare FA with the equivalent versions of the independent and non-linear models.
Finally, in the financial context, the most important differences among the four techniques are perhaps the attributes of the components or factors extracted, because they imply a progression from only linearly uncorrelated components in PCA to linearly uncorrelated common factors in FA, then to statistically independent components in ICA, and lastly to non-linearly uncorrelated components in NNPCA. From a theoretical standpoint, the former statement would imply the uncovering of a more realistic latent systematic risk factor structure, as one advance to more sophisticated techniques. This nature of the components or factors extracted through each technique is given mainly for the following conditions: First, while the orthogonal components extracted by using PCA explain the total amount of variance in the observed variables, the orthogonal factors produced by FA explain only the amount of variance explained by common factors, i.e., the covariance among the variables.
Nevertheless, both PCA and FA consider only the second-moment absence of linear correlation; on the other hand, ICA considers higher moment absences of linear correlation, which produce not only linearly uncorrelated components but also statistically independent ones. Finally, while the three former techniques only consider a linear mixing in the extraction and generation processes, NNPCA includes a nonlinear transformation in both processes, which generates not only linearly uncorrelated components but also non-linearly uncorrelated ones.
METHODOLOGY
The data.
The data used in the empirical study correspond to stocks of the Price and Quotation Index (IPC) of the Mexican Stock Exchange (BMV); Table 2 presents the list of the entire sample used in this study. Both the period analyzed and the shares selected respond to the following criteria: 1) the sample used in the cited former studies that allow us to make this comparative study of the results produced by each of the four techniques used in them, 2) the interest in a worldwide recognized pre-crisis period where stock prices were out of the effect of the subprime crisis formation, and 3) the availability of data among the diverse information sources consulted.
Stocks used in the study.
Source: Author's elaboration.
In this context, the basic aim was to build a homogeneous and sufficiently broad database, capable of being processed with the feature extraction techniques used in this study. Four different databases to test different expressions and periodicities of the returns on equities were built. On the one hand, two databases are expressed in returns, and the other two, in returns in excesses of the riskless interest rate. On the other hand, two of them have weekly periodicity and the other two a daily one. The weekly databases range from July 7, 2000, to January 27, 2006, and include 20 stocks and 291 observations; whereas the daily databases, from July 3, 2000, to January 27, 2006, contain 22 assets and 1491 quotations.
Extraction of underlying factors and reconstruction of the observed returns.
According to the models in Table 1, the first step was the extraction of the underlying factors by using Matlab® scripts,6 obtaining also the matrices of weights for the extraction process or demixing matrices and the matrices of loadings of the generation process or mixing matrices. For the estimation of the models, this paper used the following specifications: in PCA, the classic linear version; in FA, the Maximum Likelihood method (MLFA); in ICA, the ICASSO software based on the FastICA algorithm; and in NNPCA, a hierarchical auto-associative neural network or autoencoder.7 Secondly, the observed variables employing the extracted factors and the mixing matrices were reconstructed. This paper includes the experiments for the four techniques, the four databases, and a test window ranging from two to nine extracted factors.8
Measures of reconstruction accuracy.
To obtain a more objective measure of the accuracy of the reconstruction using the systematic risk factors obtained with each technique, some statistics widely employed to evaluate the accuracy of forecasting models in economy and finance were used, which in this context will represent measures of reconstruction accuracy. These measures, taken from Pérez & Torra (2001) and Diebold & López (1996), are the following: mean absolute error (MAE), mean absolute percentage error (MAPE), root mean square error (RMSE), Theil's U statistic (U-Theil), confusion matrix (CM), confusion rate (CR), chi-squared contrast of independence, and Pesaran & Timmermann's directional accuracy statistic (DA).
The first four are measures of reconstruction accuracy, which represent different expressions to compute the error in the reconstruction of the observed returns; these are their mathematical formulations:
Where H denotes the total number of observations; h = 1, H; rh are the observed returns and , the reconstructed returns.
The confusion matrix is a contingency table necessary to compute the contrasts for evaluating the direction-of-change reconstruction measures, namely, confusion rate and chi-squared contrast; it is constructed in this manner:
Where nij indicates the absolute frequency of occurrence of each condition.
The confusion rate shows the percentage of incorrect reconstructions and is calculated by:
The chi-squared contrast assumes a null hypothesis of independence between the signs of the reconstruction and their real values; therefore, the rejection of the null hypothesis and the high values of the statistic imply a good performance based on the direction-of-change reconstruction; its formulation is as follows 9:
Where ni. and n.j are the marginal frequencies.
Finally, the DA statistic is another directional accuracy reconstruction measure, with distribution N (0,1), which poses a null hypothesis of independence between the observed and the reconstructed values; its interpretation is similar to the former contrast and is built as follows:
Where SR denotes the success ratio; SRI, the success ratio in the case of independence between the observed and reconstructed values under the null hypothesis, and I is an indicative function denoting the occurrence of the condition imposed inside the square brackets.10
RESULTS
Graphic analysis.
The results obtained in the reconstruction of the observed returns using the four techniques individually were outstanding at first sight for all of them, making it difficult to determine which one was the best. Figures 1 and 2 present the observed versus the reconstructed returns produced by the four techniques, from the first eight stocks of the database of weekly returns when nine factors were extracted.11 The line plots include all the observations, showing that in general, all the techniques reproduce the real values successfully for the entire period; nevertheless, if a zoom of stem plots is done for the first 50 observations, it can be distinguished that FA and ICA present greater errors in the reconstruction.
Figure 1. Observed vs. reconstructed returns. Database of weekly returns. Nine underlying factors were extracted.
Source: Author's elaboration.
Figure 2. Observed vs. reconstructed returns. Database of weekly returns. Nine underlying factors were extracted. Stem plots.
Source: Author's elaboration.
Derived from the visual plot analysis it can be detected that, given the number of factors extracted, the four techniques fail to reproduce the highest and lowest peaks in the observations, but, if the number of factors extracted is increased from all the techniques, this problem disappears.12 In addition, it can be observed that in some cases the best reconstruction of each asset is not produced by the same technique, i.e., while some stocks are reconstructed better by one technique, other shares are better reproduced through another method. All the former results are similar for the entire case of the experiments.
Measures of reconstruction analysis.
All the foregoing measures of accuracy for each stock were computed as well as the arithmetic mean, median, and standard deviation for the MAE, MAPE, RMSE, U-Theil, and CR as proposed synthetic global measures to evaluate the errors in reconstruction for all the assets. In addition, this paper also analyses the results of the directional accuracy statistics x2 and DA individually for each stock to test the null hypothesis of independence in the reconstruction process. Therefore, all these calculations for the four extraction techniques, the four databases, and the entire testing window are replicated.
Tables 3 to 6 present the results of the foregoing experiments applied on the database of weekly returns, when nine factors were extracted, for PCA, FA, ICA, and NNPCA, respectively. First of all, it is important to remark that the results for all the techniques are outstanding and reflect a high-quality reconstruction of the returns; however, in trying to find the best of these methods the following distinctions are important. In general, regarding the measures of reconstruction accuracy MAE, MAPE, RMSE, and U-Theil, the smaller errors in the reconstruction - in terms of their arithmetic mean - points to PCA and NNPCA as the best ones. Strictly speaking, PCA scored better results in all the foregoing measures except the U-Theil statistic, but the difference between both techniques in the computed error is really small. However, NNPCA presents a smaller standard deviation of the former statistics, which means less sensitivity to the variations of mean values of the proposed synthetic measures.
Measures of reconstruction accuracy.
Source: Author's elaboration.
Measures of reconstruction accuracy.
Source: Author's elaboration.
Measures of reconstruction accuracy.
Source: Author's elaboration.
Measures of reconstruction accuracy.
Source: Own elaboration.
In addition, considering that the observed variables are not normally distributed and that the median is a more suitable synthetic measure of the reconstruction accuracy, in this case, NNPCA beat PCA in all the foregoing measures except the MAPE. Regarding the CR, the results are similar; PCA obtained the lowest percentage of incorrect reconstruction in terms of mean, and NNPCA in terms of the median. Concerning the directional accuracy contrasts x2 and DA for each stock, the findings of this research show that in almost all the cases the null hypothesis of independence at 5% level of statistical significance is rejected in both tests; therefore, an association can be established between the signs of the predictions and the real values of the returns.13
In summary, in almost all cases of the study, the results point to NNPCA as the best technique for the reconstruction in terms of the mean when a smaller number of factors is retained; and to PCA, when a larger number of them are extracted, which leads us to think that NNPCA performs better than the other techniques as a dimensional reduction or feature extraction technique. In terms of the median, NNPCA surpasses the rest of the techniques in almost all the cases; besides, in the daily databases, NNPCA shows clearer supremacy over the other techniques in almost all the measures in terms of mean, median, and standard deviation. Nevertheless, this is not a rule, and for some databases, a particular number of factors, and specific measures of accuracy, the results point to other techniques as the best ones.
Additionally, to analyze the performance of each technique in the individual reproduction of the observed variables, the results of the MAE, MAPE, RMSE, U-Theil, and CR obtained in PCA were taken as benchmarks. Then, this set of benchmarks were confronted with the results from the same measures obtained with the rest of the techniques by subtracting the former from the latter. Tables 7 to 9 present said results. The findings of this research reveal that, in terms of the individual reconstruction of the observed returns, in a comparison between FA vs. PCA, ICA vs. PCA, and NNPCA vs. PCA, 50% of the stocks performed equally across these techniques; FA only surpass PCA in 20% of the reproductions -in almost all the measures-, ICA in about 5% - 10% and NNPCA in around 10% - 30%. The former results were similar in the totality of the cases and samples in the study.
Factor Analysis (FA) vs. Principal Component Analysis (PCA).
Source: Author's elaboration.
Independent Component Analysis (ICA) vs. Principal Component Analysis (PCA).
Source: Author's elaboration.
Neural Networks Principal Component Analysis (NNPCA) vs. Principal Component Analysis (PCA).
Source: Own elaboration.
CONCLUSIONS
From the theoretical standpoint, NNPCA constitutes the best technique, since the underlying factors extracted present better attributes; they are nonlinearly uncorrelated, warranting not only linearly uncorrelated systematic risk factors for the Arbitrage Pricing Theory (APT) model but also nonlinearly uncorrelated ones.
However, the findings in the empirical study do not demonstrate clear supremacy of one technique over the others since all the techniques successfully reproduce the observed returns; nevertheless, broadly speaking and based on its theoretical supremacy and the evidence uncovered, NNPCA can be pointed out as the best technique to reconstruct the observed returns on equities of the sample.
MLFA was the technique with the worst performance in the reconstruction; although its results were good enough, the other techniques simply performed better. However, the clarification stated in the section on matrix parallelism about the direct comparison of FA with the other kinds of analysis used in this study must not be forgotten. A future step in the research will be to compare FA to its equivalent versions for the independent and non-linear models.
According to the attributes of the components or factors produced by each technique, it can be expected that the results in the reconstruction should be better as one moves from basic techniques such as PCA and FA to advanced methods like ICA and NNPCA. However, in general, the ICA reconstruction was worse than the PCA in terms of the first four measures of reconstruction accuracy in almost all cases. Further research will be necessary to find out the reasons for these results.
Additionally, it can be concluded that the four techniques performed a successful reconstruction of the observed returns; nevertheless, the supremacy of one of them over the others is very sensitive to the number of components or factors retained, the expression of the model, and the specific asset analyzed. Consequently, it might be stated that the selection of one technique or the other will depend mainly on the number of dimensions to retain and the specific stock object of study; nevertheless, further research concerning this issue will be necessary.
Finally, some natural expansions of this work would be the search for some other measures to evaluate the accuracy of the reproduction - both in univariate and in multivariate terms - and some other methodologies to compare the results of the four techniques; a deeper study regarding the univariate and multivariate statistics and the morphology of the components and factors extracted; and the interpretation of the underlying factors of systematic risk, namely, the risk attribution process. Likewise, after having tested these techniques in a pre-crisis period free of almost any prices distortion originated by natural speculative movements during a crisis period, other extensions of this research would be the testing the accuracy of the reproduction of the observed returns produced by the multifactor generative models of returns generated by each technique in both a crisis and a post-crisis period.
ACKNOWLEDGEMENTS
The authors acknowledge Cristina Ubago at Gaesco, Spain, for the financial data provided as well as to the referees of this paper for their valuable comments.
DECLARATION OF INTEREST CONFLICTS
There is not any conflict of interest in the elaboration of this paper and all the ethical principles generally accepted in the scientific community have been observed.
Notes
1 For the sake of saving space, a review of literature on the application of multivariate techniques in different fields of knowledge is out of the scope of this paper. However, interested reader can consult Ladrón de Guevara-Cortés & Torra-Porras (2014), Ladrón de Guevara-Cortés, Torra-Porras & Monte-Moreno (2018) and Ladrón de Guevara-Cortés, Torra-Porras & Monte-Moreno (2019), where a deeper review of literature focused in each one of the techniques compared in this paper is done.
2 Although in the following papers the authors made a theoretical comparison of the techniques utilized, this paper will focus on the comparison of their empirical results. For detailed information about both the theoretical and the empirical comparisons made in those works, the interested reader can consult the sources.
3 Neither other techniques to produce non-linear components nor other methods to obtain non-linear principal component analysis (NLPCA) different than NNPCA are in the scope of this research. Nevertheless, the interested reader can find in the literature some works where techniques such as the quantum-inspired evolutionary algorithm (QIA) to extract non-linear principal components, or kernel principal component analysis (KPCA) and curvilinear component analysis (CCA) are compared with some of the techniques used in this study.
4 The explanation of each class of analysis is out of the scope of this research since they have been discussed in former studies; this paper will focus only on the comparison among the four techniques. Nevertheless, the interested reader can find a general explanation of PCA, FA, ICA, and NNPCA in Ladrón de Guevara-Cortés & Torra-Porras (2014) and Ladrón de Guevara-Cortés, Torra-Porras & Monte-Moreno (2018, 2019).
5 The complete factor analysis model specification includes the matrix of specific factors uX = 1μ + FΛ' + U, however, this paper cannot use this matrix in the generation process because it represents the error in the reconstruction of the original variables, which will be known after the reproduction of the variables by: U = X –(1μ + FΛ').
6 The PCA and FA scripts were elaborated using the functions included in the software; ICA scripts were adapted from Himberg & Hyvãrinen (2005); and NNPCA, from Scholz (2006b).
7 For details about the estimation models, see Ladrón de Guevara-Cortés & Torra-Porras (2014) and Ladrón de Guevara-Cortés, Torra-Porras & Monte-Moreno (2018, 2019).
8 Since there is not a definite widespread criterion to define the best number of components to extract in all the techniques, nine different criteria usually accepted in PCA and FA literature were used. These criteria were: the eigenvalues arithmetic mean, the percentage of explained variance, the exclusion of the components or factors explaining a small amount of variance, the scree plot, the unretained eigenvalue contrast (Q statistic), the likelihood ratio contrast, Akaike's information criterion (AIC), the Bayesian information criterion (BIC), and the maximum number of components feasible to estimate in each technique. The comparable window across the four techniques indicated the results of the former criteria ranged from two to nine factors.
9 The degrees of freedom for this contrast are calculated by: v = (r—1)(k—1), where v denotes the degrees of freedom; r, the number of rows of the confusion matrix; and k, the number of columns.
10 If the condition is fulfilled, the indicator takes the value of 1.
11 For the sake of saving space in this paper, only the results for the database of weekly returns when nine factors were extracted will be presented explicitly; nevertheless, the results and conclusions reported include the entire cases. The rest of the plots are available upon request.
12 The results of those additional experiments are not presented in this study, those experiments were done only to test that the reproduction capacity of the techniques, considering all the factors feasible to compute in each one of them.
13 The null hypothesis of independence of the X2 and DA contrast is rejected in almost all cases; nevertheless, for some specific stocks, the null hypothesis could not be rejected. The effect of these few cases does not significantly affect the overall results and conclusions derived from these statistics.
REFERENCES
1. Anowar, F., Sadaoui, S., & Selim, B. (2021). A conceptual and empirical comparison of dimensionality reduction algorithms (PCA, KPCA, LDA, MDS, SVD, LLE, ISOMAP LE, ICA, t-SNE). Computer Science Review, 40(5), p.p. 1000378-. https://doi.org/10.1016/j.cosrev.2021.100378
2. Ayesha, S., Hanif, M. K., Talib, R. (2020). Overview and comparative study of dimensionality reduction techniques for high dimensional data. Information Fusion, 59 (July 2020), p.p. 44-58. https://doi.org/10.1016/j.inffus.2020.01.005
3. Back, A. & Weigend, A. (1997). A first application of independent component analysis to extracting structure from stock returns. International Journal of Neural Systems, 8 (4), p.p. 473-484. https://doi.org/10.1142/S0129065797000458
4. Bellini, F. & Salinelli, E. (2003). Independent Component Analysis and Immunization: An exploratory study. International Journal of Theoretical and Applied Finance, 6(7), p.p. 721-738. https://doi.org/10.1142/S0219024903002201
5. Cavalcante, R.C., Brasileiro, R.C., Souza, L.F., Nobrega, J.P., Oliveira, A.L.I. (2016). Computational Intelligence and Financial Markets: A Survey and Future Directions. Expert Systems with Applications, 55 (15 August 2016), p.p. 194-211. https://doi.org/10.1016/j.eswa.2016.02.006
6. Coli, M., Di Nisio, R., & Ippoliti, L. (2005). Exploratory analysis of financial time series using independent component analysis. In: Proceedings of the 27th international conference on information technology interfaces, p.p. 169-174. Zagreb: IEEE. https://doi.org/10.1109/ITL2005.1491117
7. Corominas, Ll., Garrido-Baserba, M., Villez, K., Olson, G., Cortés, U., & Poch, M. (2018). Transforming data into knowledge for improved wastewater treatment operation: A critical review of techniques. Environmental Modelling & Software, 106 (Agosto 2018), p.p. 89-103. https://doi.org/10.1016/j.envsoft.2017.11.023
8. Diebold, F.X. & Lopez, J.A. (1996). Forecast evaluation and combination. In: G.S. Madala & C.R. Rao (eds.), Handbook of statistics, Vol.14. Statistical Methods in Finance, p.p. 241-268. Amsterdam: Elsevier. https://doi.org/10.3386/t0192
9. Himberg, J. & Hyvárinen, A. (2005). Icasso: software for investigating the reliability of ICA estimates by clustering and visualization. Retrieved from at: http://www.cis.hut.fi/projects/ica/icasso/about+download.shtml [2 February 2009].
10. Ibraimova, M. (2019). Predicting Financial Distress Through Machine Learning (Publication No. 139967) [Unpublished Master's Thesis]. Universitat Politécnica de Catalunya. Retrieved from: http://hdl.handle.net/2117/131355
11. Ince, H. & Trafalis, T B. (2007). Kernel principal component analysis and support vector machines for stock price prediction. IIE Transactions 39(6): p.p. 629-637. https://doi.org/10.1109/IJCNN.2004.1380933
12. Ladrón de Guevara-Cortés, R., Torra-Porras, S. & Monte-Moreno, E. (2019). Neural Networks Principal Component Analysis for estimating the generative multifactor model of returns under a statistical approach to the Arbitrage Pricing Theory. Evidence from the Mexican Stock Exchange. Computación y Sistemas, 23(2), p.p. 281-298. http://dx.doi.org/10.13053/CyS-23-2-3193
13. Ladrón de Guevara-Cortés, R., Torra-Porras, S. & Monte-Moreno, E. (2018). Extraction of the underlying structure of systematic risk from Non-Gaussian multivariate financial time series using Independent Component Analysis. Evidence from the Mexican Stock Exchange. Computación y Sistemas, 22(4), p.p. 1049-1064 http://dx.doi.org/10.13053/CyS-22-4-3083
14. Ladrón de Guevara Cortés, R., & Torra Porras, S. (2014). Estimation of the underlying structure of systematic risk using Principal Component Analysis and Factor Analysis. Contaduría y Administración, 59(3), p.p. 197-234. http://dx.doi.org/10.1016/S0186-1042(14)71270-7">S0186-1042(14)71270-7
15. Lesch, R., Caille, Y., & Lowe, D. (1999). Component analysis in financial time series. In: Proceedings of the 1999 Conference on Computational intelligence for financial engineering, p.p. 183-190. New York: IEEE/IAFE. http://dx.doi.org/10.1109/CIFER.1999.771118
16. Lui, H. & Wan, J. (2011). Integrating Independent Component Analysis and Principal Component Analysis with Neural Network to Predict Chinese Stock Market. Mathematical Problems in Engineering, 2011, p.p. 1-15. https://doi.org/10.1155/2011/382659
17. Lizieri, C., Satchell, S. Satchell & Zhang, Q. (2007). The underlying return-generating factors for REIT returns: An application of independent component analysis. Real Estate Economics, 35(4): p.p. 569-598. https://doi.org/10.1111/j.1540-6229.2007.00201.x
18. Miranda-Henrique, B., Amorin-Sobreiro, V., Kimura, H. (2019). Experts Systems with Applications, 124 (15 jun 2019), p.p. 226-251. https://doi.org/10.1016/j.eswa.2019.01.012
19. Pérez, J.V. & Torra, S. (2001). Diversas formas de dependencia no lineal y contrastes de selección de modelos en la predicción de los rendimientos del Ibex35. Estudios sobre la Economía Española 94 (marzo, 2001), p.p. 1-42. Retrieved from: http://documentos.fedea.net/pubs/eee/eee94.pdf
20. Rojas, S., & Moody, J. (2001). Cross-sectional analysis of the returns of iShares MSCI index funds using Independent Component Analysis. CSE610 Internal Report, Oregon Graduate Institute of Science and Technology. Retrieved from: http://www.geocit.ies.ws/rr_sergio/Projects/cse610_report.pdf
21. Ross, S.A. (1976). The arbitrage theory of capital asset pricing. Journal of Economic Theory 13(3): p.p. 341-360. https://doi.org/10.1016/0022-0531(76)90046-6
22. Sayah, M. (2016). Analyzing and Comparing Basel III Sensitivity Based Approach for the Interest Rate Risk in the Trading Book. Applied Finance and Accounting, 2(1), p.p. 101-118. https://doi.org/10.11114/afa.v2i1.1300
23. Scholz, M. (2006a). Approaches to analyzing and interpret biological profile data. [Unpublished Ph.D. Dissertation]. Postdam University. Retrieved from: https://publishup.uni-potsdam.de/opus4-ubp/frontdoor/deliver/index/docId/696/file/scholz_diss.pdf
24. Scholz, M. (2006b). Nonlinear PCA toolbox for Matlab®. Retrieved from: http://www.nlpca.org/matlab. [8 September 2008].
25. Scikit-Learn (2021, July 12). Manifold Learning. https://scikit-learn.org/stable/modules/manifold.html#
26. Wei, Z., Jin, L. & Jin, Y. (2005). Independent Component Analysis. Working Paper. Department of Statistics. Stanford University.
27. Weigang, L., Rodrigues, A. Lihua, S. & Yukuhiro, R. (2007). Nonlinear Principal Component Analysis for withdrawal from the employment time guarantee fund. In: S. Chen, P Wang & T Kuo (eds.), Computational Intelligence in Economics and Finance. Vol. II, p.p. 75-92. Berlin: Springer-Verlag. https://doi.org/10.1007/978-3-540-72821-4_4
28. Yip, F. & Xu, L. (2000). An application of independent component analysis in the arbitrage pricing theory. In: S. Amari et al. (eds.) Proceedings of the International Joint Conference on Neural Networks, p.p. 279-284. Los Alamitos: IEEE. https://doi.org/10.1109/IJCNN.2000.861471