A Review on Machine Learning for Asset Management
Abstract
:1. Introduction
2. Related Works
2.1. General Surveys
2.2. Price Forecasting
2.3. Value/Factor Investing
2.4. Portfolio Management
2.5. Our Proposal in Context
- Finance is an extremely diverse field in Economics, that includes such diverse disciplines as Asset/Portfolio Management, Risk Assessment, Fraud Detection or Financial Regulation. The use of ML techniques in all these fields, in the recent years, has been increasingly relevant. Most of the literature reviews have tried to span every single discipline within this extensive and heterogeneous group, so that the analysis has been revealed, in some occasions, as relatively superficial. In some other cases, the classification has been quite arguable because of the dominance of price forecasting fields, and the mixture of financial and banking disciplines. Maybe due to this extensive strategy adopted by their peers, some authors have adopted the reverse approach, focusing the revision on a single ML methodology, achieving an arguable result extensive in financial disciplines but excessively focused on a single ML technique.
- Within the financial disciplines related to asset management, price forecasting has been the favourite field of review papers in the last five years. Favored by this wide coverage, researchers and practitioners that are interested in this topic can decide which path they should take according to currently existing literature.
- Similarly, portfolio management reviews are clearly underrated. Just one paper Emerson et al. (2019) in the last five years can be considered as a general survey about this topic.
- It covers a very understated field in the surveys’ literature, despite the growing relevance that this financial discipline has reached in recent years in the banking industry. The range of fields of our paper spans all the topics related to asset/portfolio management, from asset pricing and factor investing, more linked to economic and financial variables, to price forecasting and algorithmic trading, more concentrated on price and volume data.
- Our paper tries to find an equilibrium point between finance, statistics and computational fields, enhancing the analysis of the recent literature from the perspective of financial economics, as well as ML methods, so that researchers and practitioners can find their areas of interest without gaps. We dedicate a section to analyzing the quality and homogeneity of datasets, a somewhat neglected aspect in finance, unlike other scientific fields. Similarly, we make an analysis of the different approaches adopted in the recent literature regarding the performance criteria.
- Beyond the gathering, processing, and classification of papers and information, our work gives responses to several research questions, such as areas of interest to the financial and ML community, degree of maturity of the existing research in each of the application areas, areas with more promising potential from academic and industrial perspectives, and suggestions about future directions for ML research in asset/portfolio management.
3. Theoretical Background
3.1. Financial Background
- Value/factor investing. Investment strategies which use asset pricing models to select the most valuable assets to invest in.
- Price Forecasting. Investment strategies focused on the best prediction of asset prices. Algorithmic trading can be considered as a special case of price strategy.
- Portfolio Management. Mathematical and statistical techniques which solve optimization and simulation problems in investment management.
3.1.1. Value/Factor Investing
SDF as General Source of Asset Pricing Models
Theoretical Factor Models
- Static CAPM. The CAPM of Sharpe (1964) and Lintner (1965) is an equilibrium model in which the excess return on the market portfolio is the only pricing component. As a result, the model predicts that every asset’s projected excess return is proportionate to its market beta.being the risk-free rate, the return on the market portfolio, and the beta of the asset i with respect to the market.
- Intertemporal CAPM. By allowing for various time horizons and preferences among investors, Intertemporal Capital Asset Pricing Model (ICAPM) of Merton (1973) relaxes some assumptions of the static CAPM. Asset risk premia are linear functions of the market beta and other betas in terms of factors. As a result, the market factor is really not the exclusive determinant of pricing any longer.
- Consumption-Based CAPM. The ICAPM in its seminal form calls for determining the variables that influence the opportunity set’s evolution. In Breeden (1979), the author introduced a CCAPM that substitutes the multiple betas in the decomposition of expected returns by a single beta, which reflects changes in aggregate consumption. The assumptions are the same as in Merton’s ICAPM.
- Arbitrage Pricing Theory. APT from Ross (1976), along with CAPM, is one of the most influential theories on asset pricing. The APT varies from the CAPM in that its assumptions are less restrictive. The APT concentrates on return factor decomposition: statistical description of asset returns as linear combinations of K common factors and a random disturbance serves as its foundation.If indicates the pay-off of an asset, then we have:
Empirical Factor Models
- Size and value factors. The empirical evidence that small-cap equities perform better than large-cap equities is known as the size effect. Van Dijk (2011) made an extensive survey of 30 years of research in equity returns. As the author recognizes, this additional factor in CAPM was primarily introduced by Fama and French (1992) with their three-factor model, and since then, “there has been a vigorous debate on whether the size premium is a compensation for systematic risk”.The size factor is represented as the excess return of small caps over large caps. Fama and French (1992) introduced a “Small–Minus–Big” (SMB) portfolio, which is a zero-investment portfolio built as the difference between the average return on three small-cap portfolios and that on three large-cap portfolios, which has been ordered, previously, according to the book-to-market ratio (Value, Neutral and Growth). This previous filter is defining the third factor, “High-Minus-Low” (HML) of their model, which is another zero-investment portfolio built as the difference between the average return on two value stock portfolios and that on two growth stock portfolios, according to the size quantiles (Big and Small).being the risk-free rate, the return on the market portfolio, the beta of the asset i with respect to the market factor, the beta of the asset i with respect to the size factor, and the beta of the asset i with respect to the value factor.
- Momentum factor. Beyond the size and value factors, the momentum factor is the most prevalent factor in the literature. Momentum can be defined as the rate of acceleration of a security’s price, and simply, it refers to the inertia of a price trend to continue either rising or falling for a particular length of time. The trading strategies related to this effect, as we will see in the next Section 3.1.2, also called “trend following”, seek to capitalize on momentum to enter a trend as it is picking up steam. In statistical terms, the momentum effect characterizes by the existence of serial autocorrelation.The paper by Carhart (1997) can be considered as a study about this matter which, in the last few years, has received more acknowledgement from the investment industry. The author provides evidence in this research which focuses on the mutual fund business, that strong previous performance does not necessarily imply future returns, but that the contrary might be true (if the performance is based on loading up on specific risk factors). This paper presents a four-factor model with the three factors from Fama and French (1992), plus a new factor which represents the momentum effect:being the beta of the asset i with respect to the momentum factor. The WML factor is defined as the excess return of an equally-weighted portfolio for 30% of past winners over an identical portfolio of the 30% past losers (“Winners-Minus-Losers”).
- Profitability and Investment factors. Following the release of the five-factor model from Fama and French (2015), these two components have lately gained a lot of traction in stock investment techniques. This model was expressed as follows:being the beta of asset i with respect to the profitability factor, and the beta of asset i with respect to the investment factor.The procedure to estimate these two new factors is similar to previous factors in Fama and French (1992). Stocks are first sorted according to a measure of profitability or investment. The profitability factor is the excess return of robust profitability stocks over weak profitability ones (“Robust-Minus-Weak” or RMW factor), while in the case of the investment factor, it is defined as the excess return of high-investment stocks over low-investment ones (“Conservative-Minus-Aggressive” or CMA factor). The authors choose as measures the operating profit after interest expenses and the growth of total assets, respectively.
3.1.2. Price Forecasting
3.1.3. Portfolio Management
3.2. Machine Learning Background
3.2.1. Supervised Learning
- LASSO Tibshirani (1996): A form of linear regression that is characterized for using shrinkage. This means that LASSO performs L1 regularization to penalize the absolute value of the magnitude of the coefficients. As a result, typically a sparse set of coefficients is produced by helping reduce overfitting and model complexity. Ridge regression works in a similar fashion but by enforcing L2 penalties, which does not produce sparse models.
- Regression Trees Elith et al. (2008): A decision tree is an ML architecture that uses a flowchart-like structure to arrive to infer a result by taking tests over input variables. Each node of the tree is a test on an input variable and depending on the outcome, the flow continues in one branch of the tree or another until the flow reaches the leaves where final outputs are given. Regression trees are just an extension of decision trees where the target value to predict takes the form of a continuous value.
- Random Forest Breiman (2001): A classification or regression method that works by constructing multiple decision trees at training times. That multitude of trees constitutes an ensemble to produce a final prediction (e.g., in the case of classification by voting and in the case of regression by averaging the outputs of all trees).
- Support Vector Machines Cortes and Vapnik (1995): Binary classifiers that map the training samples to points in another space to maximize the gap between the two categories. They can also perform non-linear classification using specific kernels which map those inputs to high-dimensional feature spaces where non-linear decision boundaries can be tackled. Intuitively, an SVM finds a hyperplane that optimally separates the decision boundary by maximizing the distance between one class and another. They can also be used for regression with the appropriate modifications (namely, Support Vector Regressions (SVRs)).
- Neural Networks (NNs) are the most basic architecture, usually composed of individual perceptrons which are arranged into multiple layers—usually with non-linear activation functions interleaved—of varying width. A perceptron Rosenblatt (1958) is a function f that maps an input x to generate an output z in the following way:In its most simple form, the activation function is just a threshold and the perceptron is just a binary classifier. Note that the bias simply shifts the decision boundary away from the origin. Single-layer perceptrons can be combined together to form a Multi-layer Perceptron (MLP). This architecture is usually composed of three layers: the input layer as before, a hidden layer, and an output one. The input layer remains as before, but the hidden and output ones can be composed by an arbitrary number of nodes (also named neurons). Each of those nodes is a single-layer perceptron that uses a non-linear activation function. Deep Neural Network (NN) (also called fully connected networks) often refer to MLPs with more hidden layers l. In this general case, the output of a certain neuron i of a layer l can be defined as follows:Vanilla NNs are capable of learning any non-linear function (they are universal approximators) given enough network complexity, but they face a number of challenges: (1) due to their fully connected nature, they require a huge number of parameters, (2) are usually harder to train, (3) they lose spatial information of the input, and (4) there is no built-in mechanism for capturing sequential data.
- Convolutional Neural Networks (CNNs) LeCun et al. (1998) uses learnable kernel filters to extract relevant features from the inputs by applying the convolution operation with them. They are especially useful with structured data and in those cases where spatial information is important. Typically, they are currently applied to process 2D images (although a convolution can be applied to any dimensionality). For instance, for the 1D case, we can formulate the output of a single neuron in a CNN as follows:With regards to fully connected NNs, they sport some advantages: (1) as we mentioned, by convolving the input with filters of predefined size instead of being fully connected, they capture spatial features, and (2) by not being fully connected, but instead sharing kernel weights across the whole input, they require way less parameters and thus are easier to train and less prone to overfitting.
- Recurrent Neural Networks (RNNs) are specifically designed to deal with sequence data and learn from temporal information. Although internally they can be shaped either as traditional NNs or CNNs, they usually add recurrent connections in their layers, which helps take into account the state from previous sequence elements or temporal instants. They therefore can capture sequential information and share parameters across different timesteps (in a similar fashion as CNNs do spatially).Typically, the most general topology is a fully recurrent RNN where the outputs of all neurons are connected to the inputs of all for them. Each one multiplies the current inputs and previous outputs through an activation function. Other relevant topologies are Gated Recurrent Unit Network (GRU) and the widely spread Long-short Term Memory (LSTM).GRUs Cho et al. (2014) features two gating mechanisms: update and reset. The update gate is responsible for determining the amount of previous information that will flow to the next step. The reset gate decides which information from the past timestep to neglect for the current state.LSTMs Hochreiter and Schmidhuber (1997) features three gating mechanisms: input, output, and forget. This triple gate system allows the architecture to model long- and short-term dependencies properly.As a matter of fact, all vanilla RNNs, GRUs, or LSTMs are able to model arbitrary time dependencies. The problem is, however, computational and numerical: due to the nature of the training process, the required gradients to learn can easily explode (turn to infinity) or vanish (go to zero) preventing any learning. GRUs are a step forward in comparison with vanilla RNNs and the additional gates from LSTMs help even more to control the information flow to avoid those problems.
3.2.2. Unsupervised Learning
- k-Means Clustering: Usually employed as a pre-processing technique to reduce the number of data points by summarizing them according to their mean expectations. In other words, it takes a number of samples (n) and aims to partition them in some sets (k, where ) so that the variance within each cluster is minimized. The most common algorithm is the iterative or naïve k-means Lloyd (1982).
3.2.3. Reinforcement Learning
3.3. Performance Criteria
3.3.1. Returns
- As summary measure of return we can define the annualized rate of return as the Compounded Annual Growth Rate (CAGR) of the portfolio value between two periods separated n years:being the investment value in period t and n the number of years between the two periods we want to compare.
- Sometimes, the returns are measured in terms of Excess Returns. That means that the portfolio return is measured in terms of comparison with the risk-free asset or, in general, a benchmark asset that is used as reference. The arithmetic excess return can be expressed as follows:
3.3.2. Risk/Return Ratios
- The most popular ratio to measure portfolio performance is the Sharpe Ratio (SR). Conceived by Bill Sharpe, this measure closely follows his work on the CAPM and, by extension, uses total risk to compare portfolios to the Capital Market Line (CML). It compares the portfolio return with the risk involved in achieving this return, in form of total risk, measured through the return standard deviation, as follows:being the standard deviation of portfolio returns.
- Differential Return (DR), by contrast, results in an excess return for the portfolio manager that considers risk in the form of standard deviation (the variability of past returns). It is a sort of a modified Sharpe ratio. Here is the formula:
- When we used the systematic risk measured by the CAPM, instead of total risk, we are referring to Treynor Ratio (TR):
- We can find another very popular ratio, the Calmar Ratio (CR), which can be defined as the ratio between the CAGR and its Maximum Drawdown (MDD) which, at the same time, measures the maximum observed loss from a peak to a trough of a portfolio, before a new peak is attained, and can be considered as an indicator of a downside risk over a specified time period.A very similar approach is achieved by the Sterling Ratio (STR).
- Finally, the Certainty Equivalent Return (CEQ) considers the risk-free return for an investor with quadratic utility and risk aversion parameter compared to the risky portfolio and is given by the following equation:
3.3.3. Goodness of Fit/Prediction
- For linear regression models, R-squared is a goodness-of-fit metric. This statistic shows the percentage of variance in the dependent variable that the independent factors account for when taken jointly. The strength of the link between your model and the dependent variable is measured by R-squared, which is defined between 0 and 1. It can be calculated as follows:
- Mean Absolute Percentage Error (MAPE) is one of the most commonly used performance indicators to measure forecast accuracy. It can be defined as the sum of the individual absolute errors divided by the observed value (each period separately). It is the average of the percentage errors.
- Mean Absolute Error (MAE) is a very useful performance indicator to measure forecast accuracy. As the name implies, it is the mean of the absolute error.It solves the problem of skewness of the previous indicator but, in return, it is not scaled, so it depends on the magnitude of the dependent variable.
- Root Mean Squared Error (RMSE) is a frequently used measure of the differences between values (sample or population values) predicted by a model or an estimator and the values observed.The RMSE serves to aggregate the magnitudes of the errors in predictions for various data points into a single measure of predictive power. RMSE is a measure of accuracy, to compare forecasting errors of different models for a particular dataset and not between datasets, as it is scale-dependent. Actually, many algorithms (especially for ML) are based on the Mean Squared Error (MSE), which is directly related to RMSE.
3.3.4. Risk of Loss Measures
- Value at Risk (VaR) is a metric for calculating investment risk. It calculates how much a set of assets would lose (with a specified probability) in a specific time period, such as a day, under typical market conditions. According to Abad et al. (2014), the VaR is thus a conditional quantile of the asset return loss distribution. Let , , , …, be identically distributed independent random variables representing the financial returns. Use to denote the cumulative distribution function, conditionally on the information set that is available at time . Assume that follows the stochastic process:This quantile can be estimated in two different ways: (1) inverting the distribution function of financial returns, and (2) inverting the distribution function of innovations , in which case is also necessary to estimate .
- Conditional Value at Risk (CVaR), also known as Expected Shortfall (ES), is a risk measure derived from the previous one. The ES at the % level is the expected return on the portfolio in the worst % of cases. ES is an alternative to VaR that is more sensitive to the shape of the tail of the loss distribution.
3.3.5. Statistical Significance
- Most times, hypothesis testing of statistical significance can be run using a t-student distribution. The t-statistic can be expressed this way:
- The p-value in hypothesis significance testing is the probability of getting test findings that are at least as extreme as the actual results, assuming that the null hypothesis is valid. A tiny p-value indicates that under the null hypothesis, such an extreme observed result would be very implausible. p-values of statistical tests are commonly reported in academic articles in a variety of quantitative domains.
3.3.6. Accuracy of Predictions
- Accuracy is a classification metric for evaluating classifiers and can be expressed as:
- Precision is the skill of the model to classify samples accurately and can be calculated as follows:
- Recall shows the skill of the model to classify the maximum possible samples, and is represented by the following equation:
- F-measure describes both precision and recall and can be represented as follows:
4. Datasets
4.1. Structured Data
4.1.1. Stock Values
4.1.2. Macroeconomic Indicators and Financial Information
4.1.3. Technical Indicators
4.2. Unstructured Data
4.3. Analysis
4.3.1. Value/Factor Investing
- Kozak et al. (2018). The methodology used is a very good example of how this type of datasets can be a good way to generate results that can be globally interpreted. In this work, the authors use, firstly, the 5 × 5 size and book-to-market (B/M) sorted portfolios of Fama and French (1993). Secondly, they use 15 anomaly long-short strategies defined as in Novy-Marx and Velikov (2016) and the underlying 30 portfolios from the long and short sides of these strategies. These two datasets are available in two websites fed by the own authors, with the aim to contribute to future and reproducible research. In the first case, datasets are provided by the Kenneth French’s website, which provides downloadable and updated files of Fama/French factors from 1926. In the second case, the author shares the datasets used in his 2016 article. The French’s webpage can be considered as one of the best examples of publicly available databases that has become as meeting point of asset pricing researchers.
- Feng et al. (2020). In this paper we can find another excellent example of how using publicly available datasets from other authors. In their article, the authors firstly download all workhorse factors in the U.S. equity market from Ken French’s data library. Then they add several published factors directly from the authors’ websites, including liquidity from Pastor and Stambaugh (2003) (Stambaugh’s website), the q-factors from Hou and Zhang (2015), and the intermediary asset pricing factors from He et al. (2016). In addition to these 15 publicly available factors, they follow Fama and French (1993) to construct 135 long-short value-weighted portfolios as factor proxies, using firm characteristics surveyed in Hou et al. (2017) and Green et al. (2016).
- Gu et al. (2021). Another example of academic database, but in this case accessible by subscription, is Center for Research in Security Prices (CRSP) US Stock Databases, an affiliate of University of Chicago, which contain daily and monthly market and corporate action data for over 32,000 active and inactive securities with primary listings on the NYSE, NYSE American and NASDAQ. The research-quality data created by this transformational project spawned a vast amount of scholarly research from several generations of academics. Today, nearly 500 leading academic institutions in 35 countries rely on CRSP data for academic research and to support classroom instructions. The CRSP value-weighted index is one of the most usual equity market benchmarks used in financial research.
4.3.2. Portfolio Management
- Heaton et al. (2017). Weekly returns data for the components of the biotechnology IBB index in the period 2012–2016. They train the learner without knowledge of the actual component weights. Their goal is to find a selection of investments that outperforms the official index.
- Krauss et al. (2017). Monthly and daily returns data for the components of S&P 500 in the period 1989–2015. The data source is the Thomson Reuters Datastream. The goal was to build portfolios following a statistical arbitrage strategy with better performance than the benchmark index.
- Ban et al. (2018). Ken French’s website mentioned in the section of asset pricing. They collect monthly excess returns for three different data sets, composed of 5, 10 and 49 industry portfolios, in the period 1994–2013.
- Almahdi and Yang (2017). A five-asset portfolio using five of the most commonly traded ETFs from different asset categories. They extract the weekly closing prices for each of the five assets from Yahoo Finance website, for the period 2011–2015.
- Lee et al. (2019). Data over a period of 22 years from 1995 to 2016, sourced from Thomson Reuters Datastream database. The dataset is composed by weekly closing prices of 10 global equity indices.
- Paiva et al. (2019). Opening, closing, maximum and minimum daily prices of the components of the Brazilian index Ibovespa, from 2001 to 2016. They were sourced from the Bloomberg terminal.
4.3.3. Price Forecasting
- Nikou et al. (2019). In some cases, historical closing prices are the only reference, where the data used include the daily closing price of iShares MSCI UK ETF, also collected from the Yahoo Finance site.
- Zhong and Enke (2019). In other cases we can find financial and economic factors -as in asset pricing models-. In this paper, the dataset includes the daily direction (up or down) of the closing price of the SPDR S&P 500 ETF as the output, along with 60 financial and economic factors as input features. The daily data is collected from 2518 trading days from June 2003. The data sources are public and free (e.g., finance.yahoo.com).
- Khan et al. (2020). Lastly, we can find some examples of financial news and social media data, perfect examples of unstructured data. The source of stock historical daily prices is the same, Yahoo Finance, but the downloaded data have seven features, from date to closing price, passing by traded volume. Given the methodology used in the article, financial news data are also needed, as well as social media data. In the first case, the authors have used Business Insider because it contains a collection of stock market related news from the most famous world news websites, such as Reuters, Financial Times, and so forth. In the second case, they have utilized Twitter API, implemented in Python, to download desired tweets.
5. Methods
- What financial application areas, within the asset management discipline, are of interest to the financial and ML community?
- In each of these application areas, which ML models/methods are preferred (and more successful)?
- Which are the most used performance metrics by the researchers?
5.1. Methodology
5.2. Value/Factor Investing
5.3. Portfolio Management
5.4. Price Forecasting
5.5. Algorithmic Trading
6. Discussion
6.1. Overview
- Researchers are sometimes compelled to present incomplete results that are often refuted by additional studies due to the publication bias towards successful results (see Harvey 2017). As a result, replication is critical, and many academic findings have a very short expiration date, especially if transaction costs are taken into account Cakici and Zaremba (2021).
- One of the main pitfalls of the traditional econometric approach has to do with the p-hacking. As it was demonstrated by Chen (2019), p-hacking alone cannot account for all the anomalies documented in the literature. One way to reduce the risk of spurious detection is to increase the hurdles (often, the t-statistics) but the debate whose title might be “the factor zoo” is still ongoing Harvey and Liu (2019).
- Because of its easy understanding, the decomposition of returns into linear factor models is extremely useful. Nonetheless, there is an eternal dispute in the academic literature as to whether business returns are explained by exposure to macroeconomic variables or merely by firm characteristics. Until the new century the factor-based explanation for risk premium was the favourite, but after the seminal work by Daniel and Titman (1997), the characteristics-based explanation has become a great competitor of the traditional outlook.
- Some researchers have observed fading anomalies as a result of publication: once an anomaly is made public, agents invest in it, driving up prices and causing the anomaly to vanish. David McLean and Pontiff (2016) documents this impact in the United States, while Jacobs and Mülle (2020) finds that post-publication factor returns are sustained in other relevant markets. Herding may be destroying factor premia Krkoska and Schenk-Hoppé (2019), and the democratization of so-called smart-beta products (particularly the ETFs) that enable investors to actively invest in specific styles (value, low volatility, etc.) may speed the process up.
- Researchers have developed more sophisticated techniques to organize the so-called factor zoo and, more significantly, to detect false anomalies. Feng et al. (2020), for example, uses LASSO selection and Fama–MacBeth regressions to see if new factor models are worthwhile. They calculate the benefit of adding one new factor to a set of preset factors, demonstrating that many of the factors described in papers published in the 2010 decade do not provide much extra value.
- There is no such thing as a flawless approach, but the sheer volume of contributions in the field emphasizes the importance of robustness. The notion that factors are likely to change over time is a key obstacle for short-term strategies. We refer for instance to Cooper and Maio (2019).
- As we have seen in Section 5.4 about price forecasting, the difficulty to test consistently, using traditional approaches, the EMH, leaves a huge space to alternative techniques.
- In the case of MVO, as Cochrane (2011) points out, even though it is not a particularly useful guide to computation, classic one-period mean-variance analysis is a brilliantly useful characterization of an optimal portfolio, useful for final investors to understand and think hard about risk allocations. Even when investors are considering highly non-normal payoffs, traditional mean-variance analysis continues to dominate portfolio applications. Nevertheless, many researchers have tried to improve the suitability of this model from different perspectives.
6.2. Discipline Focus
- Value/Factor investing: The landscape is not specifically dominated by any particular technique. PCA is successful in most works as a pre-processing technique whilst other classical ML methods like RFs, SVMs, or shallow NNs are present in almost all the analyzed works. RNNs does not have much presence in this discipline. The paradigms are mainly Supervised Learning and Unsupervised Learning.
- Portfolio Management: In this discipline, we observed a trend of favoring RNNs architectures to model long-term dependencies of financial time-series data. In particular, most of the reviewed methods make use of variations of LSTMs (usually combining them with other techniques like MVO). RL methods appear in this discipline coupled with RNNs in the form of LSTMs in the most recent works. The dominant paradigms are Supervised Learning (SL) and Reinforcement Learning (RL).
- Price Forecasting: The most heterogeneous discipline where all sorts of ML methods have been applied to either refine the output of other algorithms, to generate predictions on its own or even as a technique to process alternative data sources. A few works make use of social media, financial news, and sentiment analysis to increase prediction accuracy. There is no dominant paradigm in this discipline.
- Algorithmic Trading: In this case, most reviewed papers make use of SL to train architectures more typically suited to target other domains. For instance, CNNs (which are common in image processing scenarios) are applied to specially pre-processed financial data with success. Oftentimes, they are also coupled with RNNs techniques to model time dependencies, usually applying LSTMs.
6.3. Challenges and Future Research
- Standard Datasets: The whole field is characterized by a lack of curated datasets to be reused by the community. Although some datasets are built upon the portfolio database of Fama and French (1993), most of them deviate from this standard. Furthermore, even those which reuse that database end up diverging in terms of the final data available for investigation. Therefore, creating a standard database (complete and broad enough) to be reused by the research community is a need for further works.
- Reproducibility: No common methodology or framework for method training and benchmarking has been established. This hurts reproducibility since most of the analyzed methods are difficult, if not impossible, to compare against each other (unless reimplemented specifically for each scenario). In addition, almost no paper includes codes or data to be accessed by other researchers. Establishing a reproducibility framework for asset management ML research is a high-impact workstream for improving the quality of life and pace of the research community.
- Multimodal Data: Most methods are focused on analyzing numerical financial data to generate predictions. Analyzing alternative sources of information like news, social media, sentiment, and user-generated content can provide useful cues for financial decisions. Few works make use of those data sources at the moment. The challenge of combining all those multimodal sources and multiple architectures might unlock new levels of prediction accuracy.
- Heterogeneous Architectures: Arguably due to the state of immaturity in which financial ML sits nowadays (with regard to other more established synergies like image processing or NLP), no clear architectures for processing financial data have been established yet. There is a broad range of papers that spawn new models, and few that build upon solid groundwork to improve them. Finding the common patterns and unifying those diverse architectures could have a beneficial effect to the community for broad adoption in industry (in a similar way as other networks, such as UNet or ResNet, have done for image processing by becoming the de facto standard for many applications).
- Algorithmic trading: This application field is characterized by a very interesting trade-off. From an academic perspective, this area is relatively disconnected from the theoretical background about asset pricing and value investing, which has had a central role in financial economics during the last five decades, and it has been summarized in Section 3. However, in return, and precisely because of this characteristics—exclusive dependence on price data—it is the financial discipline that can maximize the contributions of ML applications. We can find a future challenge in the possibility of combining both issues, deepening trading algorithms with a higher relevance of financial fundamentals.
7. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Conflicts of Interest
1 | The term “stochastic discount factor” is used because m generalizes standard discount factor ideas. If there is no uncertainty, we may use the conventional present value formula to describe prices
|
2 | An investor’s first order conditions give the basic consumption-based model, in which the pricing kernel or SDF can be expressed as:
|
3 | In the consumption-based model already described, it means that investors are risk neutral, i.e., is linear or there is no variation in consumption, and we are in short time horizons where is close to one. |
4 | The alpha component is, according with the different factor models we have exposed, the independent term which is not associated with any factor of risk and, supposedly, can be associated with the skill of the investors to find extra returns in the securities they invest in. |
References
- Abad, Pilar, Sonia Benito, and Carmen López. 2014. A comprehensive review of value at risk methodologies. The Spanish Review of Financial Economics 12: 15–32. [Google Scholar] [CrossRef]
- Aboussalah, Amine Mohamed, and Chi Guhn Lee. 2020. Continuous control with stacked deep dynamic recurrent reinforcement learning for portfolio optimization. Expert Systems with Applications 140: 112891. [Google Scholar] [CrossRef]
- Almahdi, Saud, and Steve Y. Yang. 2017. An adaptive portfolio trading system: A risk-return portfolio optimization using recurrent reinforcement learning with expected maximum drawdown. Expert Systems with Applications 87: 267–79. [Google Scholar] [CrossRef]
- Arnott, Rob, Campbell R. Harvey, and Harry Markowitz. 2019. A backtesting protocol in the era of machine learning. The Journal of Financial Data Science 1: 64–74. [Google Scholar] [CrossRef]
- Asness, Clifford, Andrea Frazzini, Ronen Israel, and Tobias Moskowitz. 2014. Fact, fiction, and momentum investing. The Journal of Portfolio Management 40: 75–92. [Google Scholar] [CrossRef]
- Avramov, D., Si Cheng, and Lior Metzker. 2021. Machine Learning Versus Economic Restrictions: Evidence from Stock Return Predictability. Available online: https://ssrn.com/abstract=3450322 (accessed on 30 September 2021).
- Aw, E. N., Joshua Jiang, and John Q. Jiang. 2019. Rise of the machines: Factor investing with artificial neural networks and the cross–section of expected stock returns. The Journal of Investing 29: 6–17. [Google Scholar] [CrossRef]
- Ban, Gah Yi, Noureddine El Karoui, and Andrew E. B. Lim. 2018. Machine learning and portfolio optimization. Management Science 64: 1136–54. [Google Scholar] [CrossRef] [Green Version]
- Belloni, A., V. Chernozhukov, and C. Hansen. 2014. Inference on treatment effects after selection among high-dimensional controls. The Review of Economic Studies 81: 608–50. [Google Scholar] [CrossRef]
- Black, Fischer, Michael C. Jensen, and Myron Scholes. 1972. The Capital Asset Pricing Model: Some Empirical Tests. Westport: Praeger Publishers Inc., pp. 79–121. [Google Scholar]
- Breeden, Douglas T. 1979. An intertemporal asset pricing model with stochastic consumption and investment opportunities. Journal of Financial Economics 7: 265–96. [Google Scholar] [CrossRef]
- Breiman, Leo. 2001. Random forests. Machine Learning 45: 5–32. [Google Scholar] [CrossRef] [Green Version]
- Bustos, Oscar, and Alexandra Pomares-Quimbaya. 2020. Stock market movement forecast: A systematic review. Expert Systems with Applications 156: 113464. [Google Scholar] [CrossRef]
- Cakici, Nusret, and Adam Zaremba. 2021. Size, value, profitability, and investment effects in international stock returns: Are they really there? The Journal of Investing 30: 65–86. [Google Scholar] [CrossRef]
- Carhart, M. M. 1997. On persistence in mutual fund performance. The Journal of Finance 52: 57–82. [Google Scholar] [CrossRef]
- Cavalcante, Rodolfo C., Rodrigo C. Brasileiro, Victor L. F. Souza, Jarley P. Nobrega, and Adriano L. I. Oliveira. 2016. Computational intelligence and financial markets: A survey and future directions. Expert Systems with Applications 55: 194–211. [Google Scholar] [CrossRef]
- Cerniglia, Joseph A., and Frank J. Fabozzi. 2020. Selecting computational models for asset management: Financial econometrics versus machine learning. Is there a conflict? The Journal of Portfolio Management 47: 107–18. [Google Scholar] [CrossRef]
- Cervelló-Royo, Roberto, and Francisco Guijarro. 2020. Forecasting stock market trend: A comparison of machine learning algorithms. Finance, Markets and Valuation 6: 37–49. [Google Scholar] [CrossRef]
- Chen, Andrew Y. 2019. The limits of p-hacking: A thought experiment. In Finance and Economics Discussion Series; Washington, DC: Divisions of Research & Statistics and Monetary Affairs, Federal Reserve Board. [Google Scholar]
- Chen, Luyang, Markus Pelger, and Jason Zhu. 2020. Deep Learning in Asset Pricing. Available online: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3350138 (accessed on 4 April 2019).
- Cho, Kyunghyun, Bart Van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning phrase representations using rnn encoder-decoder for statistical machine translation. arXiv arXiv:1406.1078. [Google Scholar]
- Chordia, Tarun, Amit Goyal, and Jay Shanken. 2019. Cross-Sectional Asset Pricing with Individual Stocks: Betas Versus Characteristics. Available online: https://ssrn.com/abstract=2549578 (accessed on 1 November 2017).
- Cochrane, John H. 2000. Asset Pricing. Princeton: Princeton University Press. [Google Scholar]
- Cochrane, John H. 2011. Discount rates. NBER Working Paper No. w16972. Available online: https://ssrn.com/abstract=1820084 (accessed on 1 April 2011).
- Cooper, Ilan, and Paulo F. Maio. 2019. New evidence on conditional factor models. Journal of Financial and Quantitative Analysis 54: 1975–2016. [Google Scholar] [CrossRef] [Green Version]
- Cortes, Corinna, and Vladimir Vapnik. 1995. Support-vector networks. Machine Learning 20: 273–97. [Google Scholar] [CrossRef]
- Cox, John C., and Stephen A. Ross. 1976. The valuation of oprions for alternative stochastic processes. Journal of Financial Economics 3: 145–66. [Google Scholar] [CrossRef]
- Creswell, Antonia, Tom White, Vincent Dumoulin, Kai Arulkumaran, Biswa Sengupta, and Anil A. Bharath. 2018. Generative adversarial networks: An overview. IEEE Signal Processing Magazine 35: 53–65. [Google Scholar] [CrossRef] [Green Version]
- Daniel, Kent, and Sheridan Titman. 1997. Evidence on the characteristics of cross sectional variation in stock returns. Journal of Finance 52: 1–33. [Google Scholar] [CrossRef]
- David McLean, R., and Jeffrey Pontiff. 2016. Does academic research destroy stock return predictability? Journal of Finance 71: 5–32. [Google Scholar] [CrossRef]
- DeMiguel, V., L. Garlappi, and R. Uppal. 2009. Optimal versus naive diversification: How inefficient is the 1/n portfolio strategy? The Review of Financial Studies 22: 1915–53. [Google Scholar] [CrossRef] [Green Version]
- Ding, Xiao, Yue Zhang, Ting Liu, and Junwen Duan. 2015. Deep learning for event-driven stock prediction. Paper presented at Twenty-Fourth International Joint Conference on Artificial Intelligence, Buenos Aires, Argentina, July 25–31. [Google Scholar]
- Durairaj, M., and B. H. Krishna Mohan. 2019. A review of two decades of deep learning hybrids for financial time series prediction. International Journal on Emerging Technologies 10: 324–331. [Google Scholar]
- Elith, Jane, John R. Leathwick, and Trevor Hastie. 2008. A working guide to boosted regression trees. Journal of Animal Ecology 77: 802–13. [Google Scholar] [CrossRef]
- Emerson, S., R. Kennedy, L. O’Shea, and J. O’Brien. 2019. Trends and applications of machine learning in quantitative finance. Paper presented at 8th International Conference on Economics and Finance Research (ICEFR 2019), Lyon, France, June 18–21. [Google Scholar]
- Fama, Eugene, and Kenneth French. 1992. The cross section of expected stock returns. The Journal of Finance XLVII: 427–65. [Google Scholar] [CrossRef]
- Fama, Eugene, and Kenneth French. 1993. Common risk factors in the returns on stocks and bonds. Journal of Financial Economics 33: 3–56. [Google Scholar] [CrossRef]
- Fama, Eugene F., and Kenneth R. French. 2015. A five-factor asset pricing model. Journal of Financial Economics 116: 1–22. [Google Scholar] [CrossRef] [Green Version]
- Fama, Eugene F., and Kenneth R. French. 2018. Choosing factors. Journal of Financial Economics 128: 234–52. [Google Scholar] [CrossRef]
- Fama, Eugene F., and James D. Macbeth. 1973. Risk, return, and equilibrium: Empirical tests. Journal of Political Economy 81: 607–36. [Google Scholar] [CrossRef]
- Feng, Guanhao, Stefano Giglio, and D. Xiu. 2020. Taming the factor zoo: A test of new factors. The Journal of Finance 75: 1327–70. [Google Scholar] [CrossRef]
- Feng, Guanhao, and Jingyu He. 2019. Factor Investing: Hierarchical Ensemble Learning. Available online: https://ssrn.com/abstract=3326617 (accessed on 31 January 2019).
- Feng, Guanhao, Nicholas G. Polson, and J. Xu. 2018a. Deep Factor Alpha. Available online: https://www.arxiv-vanity.com/papers/1805.01104/ (accessed on 1 March 2018).
- Feng, Guanhao, Nicholas G. Polson, and Jianeng Xu. 2018b. Deep Learning in Characteristics-Sorted Factor Models. Available online: https://ssrn.com/abstract=3243683 (accessed on 3 May 2018).
- Feng, Guanhao, Nicholas G. Polson, and Jianeng Xu. 2019. Deep Learning in Asset Pricing. Available online: https://www.semanticscholar.org/paper/Deep-Learning-in-Asset-Pricing%E2%88%97-Feng-Kong/d0404ccdd0598f5ac6abee0ae97741323190aaf2 (accessed on 15 March 2019).
- Freyberger, Joachim, Andreas Neuhierl, and M. Weber. 2020. Dissecting characteristics nonparametrically. Review of Financial Studies 33: 2326–77. [Google Scholar] [CrossRef]
- Giglio, Stefano, and Dacheng Xiu. 2019. Asset Pricing with Omitted Factors. Available online: https://ssrn.com/abstract=2865922 (accessed on 14 September 2019).
- Goetzmann, W. N., and A. Kumar. 2008. Equity portfolio diversification. Review of Finance 12: 433–63. [Google Scholar] [CrossRef] [Green Version]
- Gogas, P., Theofilos Papadimitriou, and Dimitrios Karagkiozis. 2018. The Fama 3 and Fama 5 Factor Models under a Machine Learning Framework. Publisher=Rimini Centre for Economic Analysis. Available online: https://ideas.repec.org/p/rim/rimwps/18-05.html (accessed on 1 May 2018).
- Green, Jeremiah, John R. M. Hand, and F. Zhang. 2016. The characteristics that provide independent information about average u.s. monthly stock returns. Review of Financial Studies 30: 4389–36. [Google Scholar] [CrossRef]
- Gu, Shihao, Bryan Kelly, and Dacheng Xiu. 2020. Empirical asset pricing via machine learning. The Review of Financial Studies 33: 2223–73. [Google Scholar] [CrossRef] [Green Version]
- Gu, Shihao, Bryan T. Kelly, and Dacheng Xiu. 2021. Autoencoder asset pricing models. Journal of Econometrics 222: 429–50. [Google Scholar] [CrossRef]
- Harvey, C., and Y. Liu. 2019. Lucky Factors. Available online: https://ideas.repec.org/a/eee/jfinec/v141y2021i2p413-435.html (accessed on 8 April 2021).
- Harvey, Campbell R. 2017. Presidential address: The scientific outlook in financial economics. Journal of Finance 72: 1399–40. [Google Scholar] [CrossRef]
- He, Zhiguo, Bryan T. Kelly, and Asaf Manela. 2016. Intermediary asset pricing: New evidence from many asset classes. Journal of Financial Economics 126: 1–35. [Google Scholar] [CrossRef] [Green Version]
- Heaton, J. B., N. G. Polson, and J. H. Witte. 2016. Deep Portfolio Theory. Available online: https://arxiv.org/abs/1605.07230 (accessed on 23 May 2016).
- Heaton, J. B., N. G. Polson, and J. H. Witte. 2017. Deep learning for finance: Deep portfolios. Applied Stochastic Models in Business and Industry 33: 3–12. [Google Scholar] [CrossRef]
- Henrique, Bruno Miranda, Vinicius Amorim Sobreiro, and Herbert Kimura. 2019. Literature review: Machine learning techniques applied to financial market prediction. Expert Systems with Applications 124: 226–51. [Google Scholar] [CrossRef]
- Hester, Todd, Matej Vecerik, Olivier Pietquin, Marc Lanctot, Tom Schaul, Bilal Piot, Dan Horgan, John Quan, Andrew Sendonaris, Ian Osband, and et al. 2018. Deep q-learning from demonstrations. Paper presented at AAAI Conference on Artificial Intelligence, New Orleans, IL, USA, February 2–7; vol. 32. [Google Scholar]
- Hochreiter, Sepp, and Jürgen Schmidhuber. 1997. Lstm can solve hard long time lag problems. Advances in Neural Information Processing Systems 9: 473–479. [Google Scholar]
- Hou, Kewei, Chen Xue, and Lu Zhang. 2017. Replicating Anomalies. Available online: https://ssrn.com/abstract=2961979 (accessed on 12 June 2017).
- Hou, K., C. Xue, and L. Zhang. 2015. Digesting anomalies: An investment approach. Review of Financial Studies 28: 650–705. [Google Scholar] [CrossRef] [Green Version]
- Huang, Jian, Junyi Chai, and Stella Cho. 2020. Deep learning in finance and banking: A literature review and classification. Frontiers of Business Research in China 14: art. n. 13. [Google Scholar] [CrossRef]
- Huck, Nicolas. 2019. Large data sets and machine learning: Applications to statistical arbitrage. European Journal of Operational Research 278: 330–42. [Google Scholar] [CrossRef]
- Huotari, Tommi, Jyrki Savolainen, and Mikael Collan. 2020. Deep reinforcement learning agent for s&p 500 stock selection. Axioms 9: 130. [Google Scholar]
- Jacobs, Heiko, and Sebastian Mülle. 2020. Anomalies across the globe: Once public, no longer existent? Journal of Financial Economics 135: 213–30. [Google Scholar] [CrossRef]
- Jain, Prayut, and Shashi Jain. 2019. Can machine learning-based portfolios outperform traditional risk-based portfolios? The need to account for covariance misspecification. Risks 7: 74. [Google Scholar] [CrossRef] [Green Version]
- Jegadeesh, N. 1990. Evidence of predictable behavior of security returns. The Journal of Finance 45: 881–98. [Google Scholar] [CrossRef]
- Jiang, Weiwei. 2021. Applications of deep learning in stock market prediction: Recent progress. Expert Systems with Applications 184: 115537. [Google Scholar] [CrossRef]
- Jiang, Zhengyao, Dixing Xu, and Jinjun Liang. 2017. A deep reinforcement learning framework for the financial portfolio management problem. arXiv arXiv:1706.10059. [Google Scholar]
- Kamley, Sachin, Shailesh Jaloree, and R. S. Thakur. 2016. Performance forecasting of share market using machine learning techniques: A review. International Journal of Electrical and Computer Engineering 6: 3196–204. [Google Scholar]
- Kelly, Bryan T., Seth Pruitt, and Yinan Su. 2018. Characteristics Are Covariances: A Unified Model of Risk and Return. Available online: https://ssrn.com/abstract=3032013 (accessed on 15 October 2018).
- Khan, Wasiat, Mustansar Ali Ghazanfar, M. Azam, A. Karami, K. H. Alyoubi, and A. S. Alfakeeh. 2020. Stock market prediction using machine learning classifiers and social media, news. Journal of Ambient Intelligence and Humanized Computing 1: 1–24. [Google Scholar] [CrossRef]
- Khan, Wasiat, Usman Malik, Mustansar Ali Ghazanfar, Muhammad Awais Azam, Khaled H. Alyoubi, and Ahmed S. Alfakeeh. 2020. Predicting stock market trends using machine learning algorithms via public sentiment and political situation analysis. Soft Computing 24: 11019–43. [Google Scholar] [CrossRef]
- Konstantinov, Gueorgui S., Andreas Chorus, and Jonas Rebmann. 2020. A network and machine learning approach to factor, asset, and blended allocation. The Journal of Portfolio Management 46: 54–71. [Google Scholar] [CrossRef]
- Kozak, Serhiy, Stefan Nagel, and Shrihari Santosh. 2018. Interpreting factor models. Journal of Finance 73: 1183–223. [Google Scholar] [CrossRef]
- Kozak, Serhiy, Stefan Nagel, and Shrihari Santosh. 2019. Shrinking the cross-section. Journal of Financial Economics 135: 271–92. [Google Scholar] [CrossRef]
- Krauss, Christopher, Xuan Anh Do, and Nicolas Huck. 2017. Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500. European Journal of Operational Research 259: 689–702. [Google Scholar]
- Krkoska, Eduard, and Klaus Reiner Schenk-Hoppé. 2019. Herding in smart-beta investment products. Journal of Risk and Financial Management 12: 47. [Google Scholar] [CrossRef] [Green Version]
- Kumar, Indu, Kiran Dogra, Chetna Utreja, and Premlata Yadav. 2018. A comparative study of supervised machine learning algorithms for stock market trend prediction. Paper presented at International Conference on Inventive Communication and Computational Technologies, ICICCT 2018, Lalitpur, Nepal, July 20–22. [Google Scholar]
- Lai, Tze Leung, and Haipeng Xing. 2008. Statistical Models and Methods for Financial Markets. New York: Springer. [Google Scholar]
- LeCun, Yann, Yoshua Bengio, and Geoffrey Hinton. 2015. Deep learning. Nature 521: 436–44. [Google Scholar] [CrossRef]
- LeCun, Yann, Léon Bottou, Yoshua Bengio, and Patrick Haffner. 1998. Gradient-based learning applied to document recognition. Proceedings of the IEEE 86: 2278–324. [Google Scholar] [CrossRef] [Green Version]
- Lee, Jinho, and Jaewoo Kang. 2020. Effectively training neural networks for stock index prediction: Predicting the s&p 500 index without using its index data. PLoS ONE 15: e0230635. [Google Scholar]
- Lee, Tae Kyun, Joon Hyung Cho, Deuk Sin Kwon, and So Young Sohn. 2019. Global stock market investment strategies based on financial network indicators using machine learning techniques. Expert Systems with Applications 117: 228–42. [Google Scholar] [CrossRef]
- Levenberg, Kenneth. 1944. A method for the solution of certain non-linear problems in least squares. Quarterly of Applied Mathematics 2: 164–68. [Google Scholar] [CrossRef] [Green Version]
- Li, Bin, and Steven C. H. Hoi. 2014. Online portfolio selection: A survey. ACM Computing Surveys 46: 1–33. [Google Scholar] [CrossRef]
- Li, Xiujun, Lihong Li, Jianfeng Gao, Xiaodong He, Jianshu Chen, Li Deng, and Ji He. 2015. Recurrent reinforcement learning: A hybrid approach. arXiv arXiv:1509.03044. [Google Scholar]
- Lintner, John. 1965. The valuation of risk assets and the selection of risky investments in stock portfolios and capital budgets. The Review of Economics and Statistics 47: 13–37. [Google Scholar] [CrossRef]
- Lloyd, Stuart. 1982. Least squares quantization in pcm. IEEE Transactions on Information Theory 28: 129–37. [Google Scholar] [CrossRef] [Green Version]
- Lo, A. W., and A. C. MacKinlay. 1988. Stock market prices do not follow random walks: Evidence from a simple specification test. Review of Financial Studies 1: 41. [Google Scholar] [CrossRef]
- Lu, Zhichen, Wen Long, Jiashuai Zhang, and Yingjie Tian. 2019. Factor integration based on neural networks for factor investing. Paper presented at Computational Science—ICCS 2019, 19th International Conference, Faro, Portugal, June 11–14; vol. 11538 LNCS, pp. 286–92. [Google Scholar]
- López de Prado, M. 2016. Building diversified portfolios that outperform out of sample. The Journal of Portfolio Management 42: 59–69. [Google Scholar] [CrossRef]
- Ma, Y., R. Han, and W. Wang. 2020. Prediction-based portfolio optimization models using deep neural networks. IEEE Access 8: 115393–405. [Google Scholar] [CrossRef]
- Ma, Yilin, Ruizhu Han, and Weizhong Wang. 2021. Portfolio optimization with return prediction using deep learning and machine learning. Expert Systems with Applications 165: 113973. [Google Scholar] [CrossRef]
- Malkiel, B. G., and E. F. Fama. 1970. Efficient capital markets: A review of theory and empirical work. Journal of Finance 25: 383–417. [Google Scholar] [CrossRef]
- Markowitz, Harry. 1952. Portfolio selection. The Journal of Finance 7: 77–91. [Google Scholar]
- Merton, Robert C. 1973. An intertemporal capital asset pricing model. Econometrica 41: 867–87. [Google Scholar] [CrossRef]
- Messmer, Marcial. 2017. Deep Learning and the Cross-Section of Expected Returns. Available online: https://ssrn.com/abstract=3081555 (accessed on 2 December 2017).
- Minh, Dang Lien, Abolghasem Sadeghi-Niaraki, Huynh Duc Huy, Kyungbok Min, and Hyeonjoon Moon. 2018. Deep learning approach for short-term stock trends prediction based on two-stream gated recurrent unit network. IEEE Access 6: 55392–404. [Google Scholar] [CrossRef]
- Moritz, Benjamin, and Tom Zimmermann. 2016. Tree-Based Conditional Portfolio Sorts: The Relation between Past and Future Stock Returns. Working Paper. Available online: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=2740751 (accessed on 9 January 2022).
- Nabipour, Mojtaba, Pooyan Nayyeri, Hamed Jabani, Shahab S., and Amir Mosavi. 2020. Predicting stock market trends using machine learning and deep learning algorithms via continuous and binary data; a comparative analysis. IEEE Access 8: 150199–212. [Google Scholar] [CrossRef]
- Nikou, Mahla, Gholamreza Mansourfar, and Jamshid Bagherzadeh. 2019. Stock price prediction using deep learning algorithm and its comparison with machine learning algorithms. Intelligent Systems in Accounting, Finance and Management 26: 164–74. [Google Scholar] [CrossRef]
- Niño, Jaime, Germán Hernández, Andrés Arévalo, and Diego León. 2018. Cnn with limit order book data for stock price prediction. Paper presented at Future Technologies Conference (FTC) 2018, Vancouver, BC, Canada, November 15–16. [Google Scholar]
- Novy-Marx, Robert, and Mihail Velikov. 2016. A taxonomy of anomalies and their trading costs. Review of Financial Studies 29: 104–47. [Google Scholar] [CrossRef]
- Nti, Isaac Kofi, Adebayo Felix Adekoya, and B. Weyori. 2019. A systematic review of fundamental and technical analysis of stock market predictions. Artificial Intelligence Review 53: 3007–57. [Google Scholar] [CrossRef]
- Ozbayoglu, Ahmet Murat, Mehmet Ugur Gudelek, and Omer Berat Sezer. 2020. Deep learning for financial applications: A survey. Applied Soft Computing Journal 93: 106384. [Google Scholar] [CrossRef]
- Paiva, Felipe Dias, Rodrigo Tomás Nogueira Cardoso, Gustavo Peixoto Hanaoka, and Wendel Moreira Duarte. 2019. Decision-making for financial trading: A fusion approach of machine learning and portfolio selection. Expert Systems with Applications 115: 635–55. [Google Scholar] [CrossRef]
- Park, Hyungjun, Min Kyu Sim, and Dong Gu Choi. 2020. An intelligent financial portfolio trading strategy using deep q-learning. Expert Systems with Applications 158: 113573. [Google Scholar] [CrossRef]
- Pastor, L., and R. F. Stambaugh. 2003. Liquidity risk and expected stock returns. Journal of Political Economy 111: 642–85. [Google Scholar] [CrossRef] [Green Version]
- Pearson, Karl. 1901. Liii. on lines and planes of closest fit to systems of points in space. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science 2: 559–72. [Google Scholar] [CrossRef] [Green Version]
- Pitera, Marcin, and Thorsten Schmidt. 2018. Unbiased estimation of risk. Journal of Banking and Finance 91: 133–45. [Google Scholar] [CrossRef] [Green Version]
- Raffinot, Thomas. 2017. Hierarchical clustering-based asset allocation. Journal of Portfolio Management 44: 89–99. [Google Scholar] [CrossRef]
- Rasekhschaffe, Keywan Christian, and Robert C. Jones. 2019. Machine learning for stock selection. Financial Analysts Journal 75: 70–88. [Google Scholar] [CrossRef]
- Rosenblatt, Frank. 1958. The perceptron: A probabilistic model for information storage and organization in the brain. Psychological Review 65: 386. [Google Scholar] [CrossRef] [Green Version]
- Ross, Stephen A. 1976. The arbitrage theory of capital asset pricing. Journal of Economic Theory 13: 341–60. [Google Scholar] [CrossRef]
- Ross, Stephen A. 1978. A simple approach to the valuation of risky streams. The Journal of Business 51: 453–75. [Google Scholar] [CrossRef]
- Sezer, Omer Berat, Mehmet Ugur Gudelek, and Ahmet Murat Ozbayoglu. 2020. Financial time series forecasting with deep learning: A systematic literature review: 2005–2019. Applied Soft Computing 90: 106181. [Google Scholar] [CrossRef] [Green Version]
- Sezer, Omer Berat, and Ahmet Murat Ozbayoglu. 2018. Algorithmic financial trading with deep convolutional neural networks: Time series to image conversion approach. Applied Soft Computing 70: 525–38. [Google Scholar] [CrossRef]
- Sezer, Omer Berat, Murat Ozbayoglu, and Erdogan Dogdu. 2017. A deep neural-network based stock trading system based on evolutionary optimized technical analysis parameters. Procedia Computer Science 114: 473–80. [Google Scholar] [CrossRef]
- Sharpe, William F. 1964. American finance association capital asset prices: A theory of market equilibrium under conditions of risk. The Journal of Finance 19: 425–42. [Google Scholar]
- Shen, Jingyi, and M. Omair Shafiq. 2020. Short-term stock market price trend prediction using a comprehensive deep learning system. Journal of Big Data 7: 66. [Google Scholar] [CrossRef]
- Simonian, Joseph, Chenwei Wu, Daniel Itano, and Vyshaal Narayanam. 2019. A machine learning approach to risk factors: A case study using the fama–french–carhart model. The Journal of Financial Data Science 1: 32–44. [Google Scholar] [CrossRef]
- Sirignano, Justin A., and R. Cont. 2019. Universal features of price formation in financial markets: Perspectives from deep learning. Quantitative Finance 19: 1449–59. [Google Scholar] [CrossRef]
- Snow, Derek. 2020. Machine learning in asset management-part 1: Portfolio construction-trading strategies. The Journal of Financial Data Science 2: 10–23. [Google Scholar] [CrossRef]
- Song, Qiang, Anqi Liu, and Steve Y. Yang. 2017. Stock portfolio selection using learning-to-rank algorithms with news sentiment. Neurocomputing 264: 20–28. [Google Scholar] [CrossRef] [Green Version]
- Sugitomo, Seisuke, and Shotaro Minami. 2018. Fundamental factor models using machine learning. Journal of Mathematical Finance 8: 111–18. [Google Scholar] [CrossRef] [Green Version]
- Sun, Chuanping. 2020. Dissecting the Factor Zoo: A Correlation-Robust Machine Learning Approach. Available online: https://ssrn.com/abstract=3263420 (accessed on 4 November 2020).
- Sutton, Richard S., and Andrew G. Barto. 2018. Reinforcement Learning: An Introduction. Cambridge: MIT Press. [Google Scholar]
- Ta, Van Dai, Chuan Ming Liu, and Direselign Addis Tadesse. 2020. Portfolio optimization-based stock prediction using long-short term memory network in quantitative trading. Applied Sciences 10: 437. [Google Scholar] [CrossRef] [Green Version]
- Tibshirani, Robert. 1996. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society: Series B (Methodological) 58: 267–88. [Google Scholar] [CrossRef]
- Timmermann, Allan, and Clive W. J. Granger. 2004. Efficient market hypothesis and forecasting. International Journal of forecasting 20: 15–27. [Google Scholar] [CrossRef]
- Tkáč, Michal, and Robert Verner. 2016. Artificial neural networks in business: Two decades of research. Applied Soft Computing Journal 38: 788–804. [Google Scholar]
- Tobek, Ondrej, and Martin Hronec. 2020. Does it pay to follow anomalies research? machine learning approach with international evidence. Journal of Financial Markets 2: 100588. [Google Scholar] [CrossRef]
- Tristan, Lim, and Ong Chin Sin. 2021. Portfolio management: A financial application of unsupervised shape-based clustering-driven machine learning method. International Journal of Computing and Digital Systems 10: 235–43. [Google Scholar]
- Troiano, Luigi, Elena Mejuto Villa, and Vincenzo Loia. 2018. Replicating a trading strategy by means of lstm for financial industry applications. IEEE Transactions on Industrial Informatics 14: 3226–34. [Google Scholar] [CrossRef]
- Tsantekidis, Avraam, Nikolaos Passalis, Anastasios Tefas, Juho Kanniainen, Moncef Gabbouj, and Alexandros Iosifidis. 2017. Forecasting stock prices from thelimit order book using convolutional neural networks. Paper presented at 2017 IEEE 19th Conference on Business Informatics (CBI), Thessaloniki, Greece, July 24–27. [Google Scholar]
- Tsantekidis, A., N. Passalis, A. Tefas, J. Kanniainen, M. Gabbouj, and A. Iosifidis. 2017. Using deep learning to detect price change indications in financial markets. Paper presented at 2017 25th European Signal Processing Conference (EUSIPCO), Kos, Greece, August 28. [Google Scholar]
- Van Dijk, M. A. 2011. Is size dead? A review of the size effect in equity returns. Journal of Banking and Finance 35: 3263–74. [Google Scholar] [CrossRef]
- Vo, Nhi N. Y., Xuezhong He, Shaowu Liu, and Guandong Xu. 2019. Deep learning for decision making and the optimization of socially responsible investments and portfolio. Decision Support Systems 124: 113097. [Google Scholar] [CrossRef]
- Wang, Wuyu, Weizi Li, Ning Zhang, and Kecheng Liu. 2020. Portfolio formation with preselection using deep learning from long-term financial data. Expert Systems with Applications 143: 113042. [Google Scholar] [CrossRef]
- Watkins, Christopher J. C. H., and Peter Dayan. 1992. Q-learning. Machine Learning 8: 279–92. [Google Scholar] [CrossRef]
- Weigand, Alois. 2019. Machine learning in empirical asset pricing. Financial Markets and Portfolio Management 33: 93–104. [Google Scholar] [CrossRef]
- Xing, Frank Z., Erik Cambria, and Roy E. Welsch. 2018. Natural language based financial forecasting: A survey. Artificial Intelligence Review 50: 49–73. [Google Scholar] [CrossRef] [Green Version]
- Xue, Jingming, Qiang Liu, Miaomiao Li, Xinwang Liu, Yongkai Ye, Siqi Wang, and Jianping Yin. 2018. Incremental multiple kernel extreme learning machine and its application in robo-advisors. Soft Computing 22: 3507–17. [Google Scholar] [CrossRef]
- Yun, Hyungbin, Minhyeok Lee, Yeong Seon Kang, and Junhee Seok. 2020. Portfolio management via two-stage deep learning with a joint cost. Expert Systems with Applications 143: 113041. [Google Scholar] [CrossRef]
- Zhong, Xiao, and David Enke. 2019. Predicting the daily return direction of the stock market using hybrid machine learning algorithms. Financial Innovation 5: 1–20. [Google Scholar] [CrossRef]
Author | Journal/Venue | Period | Citations | References | Area | Market |
---|---|---|---|---|---|---|
Ozbayoglu et al. (2020) | 1 | 2014–2018 | 44 | 196 | GS | SM, CC, DV |
Huang et al. (2020) | 4 * | 2014–2018 | 13 | 51 | GS | SM, FX, CP |
Tkáč and Verner (2016) | 1 | 1994–2015 | 184 | 425 | GS | SM, FX, IR, DV |
Cavalcante et al. (2016) | 2 | 2009–2015 | 282 | 144 | GS | SM, FX, DV |
Sezer et al. (2020) | 1 | 2005–2019 | 162 | 216 | PF | SM, FX, CP |
Henrique et al. (2019) | 2 | 1991–2017 | 101 | 98 | PF | SM |
Bustos and P.-Quimbaya (2020) | 2 | 2014–2018 | 24 | 87 | PF | SM |
Kamley et al. (2016) | 5 | 2000–2015 | 13 | 68 | PF | SM |
Jiang (2021) | 2 | 2017–2019 | 39 | 234 | PF | SM |
Nti et al. (2019) | 3 | 2017–2019 | 36 | 207 | PF | SM |
Xing et al. (2018) | 3 | 1998–2016 | 157 | 153 | PF | SM |
Durairaj and Mohan (2019) | 6 | 1999–2019 | 15 | 46 | PF | SM, FX, IR, DV |
Weigand (2019) | 7 * | 1994–2018 | 8 | 49 | AP | SM, IR, DV, RE |
Emerson et al. (2019) | 9 * | 2015–2018 | 1 | 81 | PM | SM |
Li and Hoi (2014) | 8 | 1991–2013 | 137 | 246 | PM | SM |
Themes | References |
---|---|
Value Investing | 18 |
Portfolio Management | 31 |
Price Forecasting | 25 |
Algorithmic Trading | 17 |
TOTAL | 91 |
Author | Target Market | Method | Performance Criteria |
---|---|---|---|
Tobek and Hronec (2020) | NYSE, Amex and NASDAQ common stocks | WLS, PWLS, RF, GBRT, NN | Average return, Sharpe ratio, MDD |
Giglio and Xiu (2019) | US stocks, T-bonds, C-Bonds and currencies | PCA | R Squared, p-value |
Kelly et al. (2018) | World stocks | IPCA | R Squared, p-value |
Moritz and Zimmermann (2016) | US stocks | DT | Excess returns, R Squared, MSE |
Kozak et al. (2019) | US stocks | Bayesian and Lasso Regressions | OOS R2, Sharpe ratio |
Messmer (2017) | US stocks | DFNN | Sharpe ratio |
Feng et al. (2018a) | NYSE, Amex and NASDAQ common stocks | DFNN | Sharpe ratio |
Chen et al. (2020) | US stocks | DFNN, LSTM, GAN | Sharpe ratio |
Feng et al. (2018b) | NYSE, Amex and NASDAQ common stocks | TensorFlow, SGD, AD | MSE, R Squared |
Simonian et al. (2019) | US stocks | RF, ARL | R Squared, Annual return, Sharpe ratio |
Sun (2020) | NYSE common stocks | Ordered-Weighted LASSO | SR, Mean returns |
Freyberger et al. (2020) | NYSE, Amex and NASDAQ common stocks | LASSO | Sharpe ratio |
Lu et al. (2019) | Chinese stocks | NN, MLP | Average return, Sharpe ratio |
Feng and He (2019) | US stocks | Bayesian Hierarchical | OOS R2 |
Feng et al. (2020) | US stocks | DS LASSO | SR, Mean returns, t-stat |
Sugitomo and Minami (2018) | TOPIX 500 stocks | SVM, GBRT and NN | Average return, Sharpe ratio, RMSE |
Avramov et al. (2021) | US stocks | NN3, FFN, LSTM, GAN | Average return, Sharpe ratio |
Aw et al. (2019) | US stocks | NNs | Average return, Sharpe ratio |
Gogas et al. (2018) | NYSE, Amex and NASDAQ common stocks | SVR | R Squared, MAPE |
Author | Target Market | Method | Performance Criteria |
---|---|---|---|
Ban et al. (2018) | NYSE, Amex and NASDAQ common stocks | PBR | Sharpe ratio, Turnover |
Rasekhschaffe and Jones (2019) | Stocks from 22 countries | GBRT, SVM, AB, DNN | Excess return, Alpha |
Huck (2019) | US Stocks | DFN, RF, EN | VaR, Sharpe ratio, Max. Drawdown |
Huotari et al. (2020) | S&P 500 stocks | ANN, EIIE | Sharpe ratio, p-value |
Krauss et al. (2017) | S&P 500 stocks | DNN, GBRT, RF | Return distribution, VaR, Calmar ratio |
Park et al. (2020) | US and Korean ETFs | LSTM, DNN, Q-Learning | Cum. return, Sharpe ratio, Turnover |
Heaton et al. (2017) | IBB Index | Autoencoders | Validation error |
López de Prado (2016) | Monte Carlo simulations | HRP | OOS variance |
Yun et al. (2020) | World ETFs | PCA, LSTM | IR, MDD, VaR, CVaR |
Raffinot (2017) | S&P 500 stocks | HRP | IR, Sharpe ratio, MDD |
Jain and Jain (2019) | NIFTY 50 index | HRP | CVaR, Sharpe Ratio |
Tristan and Chin Sin (2021) | Singapore Index | AHC-DTW clustering | Cum. Return, Sharpe ratio |
Konstantinov et al. (2020) | World Assets | NN, LASSO regressions | Sharpe ratio, MDD, CEQ |
Xue et al. (2018) | Shanghai ETFs | FFN, IMK-ELN | MAP, MDD, Sharpe ratio |
Wang et al. (2020) | UK Stock Exchange 100 Index | LSTM+MVO | MSE, RMSE, MAPE, MAE, R2 |
Ta et al. (2020) | S&P 500 stocks | LSTM+MVO | Sharpe ratio |
Lee et al. (2019) | World equity indices | SVM | Directional accuracy |
Song et al. (2017) | Selected US Stocks | ListNet and RankNet (NN) | Sharpe ratio |
Vo et al. (2019) | S&P 500 stocks | LSTM+MVO | MAE, RMSE |
Ma et al. (2020) | China Securities 100 Index | DMLP, LSTM, CNN | MAE, MSE, MDD |
Ma et al. (2021) | China Securities 100 Index | DMLP, LSTM, CNN, SVR, RF | MAE, MSE, MDD |
Almahdi and Yang (2017) | US and World ETFs | RRL | Sharpe ratio, Calmar ratio, Sterling ratio |
Aboussalah and Lee (2020) | Selected US Stocks | SDDRRL | Total return |
Paiva et al. (2019) | Ibovespa stocks | SVM | Average return, st.deviation |
Author | Target Market | Method | Performance Criteria |
---|---|---|---|
Kumar et al. (2018) | Selected Indian stocks | SVM, RF, KNN, NB, Softmax | Accuracy, F-measure |
Lee and Kang (2020) | S&P 500 stocks | MLP, CNN | Total return, MDD |
Cervelló-Royo and Guijarro (2020) | Nasdaq 100 stocks | GBM, RF, CNN | Average accuracy ratio |
Nabipour et al. (2020) | Selected Indian stocks | DT, RF, KNN, LR, ANN, RNN, LSTM | Accuracy, F-measure |
Zhong and Enke (2019) | S&P 500 ETFs | DNN, FFNN | MSE |
Shen and Shafiq (2020) | Selected Chinese stocks | CNN, LSTM | Overall accuracy |
Nikou et al. (2019) | iShares MSCI UK ETFs | ANN, SVM, RF, RNN, LSTM | MAE, MSE, RMSE |
Minh et al. (2018) | S&P 500 index | RNN, TGRU, LSTM | Overall accuracy |
Ding et al. (2015) | S&P 500 index | WB+CNN | Total return |
Khan et al. (2020) | Selected US stocks | GNB, SVM, LR, MLP, KNN, GBM, RF | Accuracy, precision, recall, F-measure |
Author | Target Market | Method | Performance Criteria |
---|---|---|---|
Sezer et al. (2017) | Dow 30 stocks | MLP-ANN | Overall accuracy |
Troiano et al. (2018) | Dow 30 stocks | LSTM | Overall accuracy |
Sirignano and Cont (2019) | Selected US stocks | LSTM | Overall accuracy |
Tsantekidis et al. (2017) | Selected Finnish stocks | CNNs | Recall, precision, F1 |
Sezer and Ozbayoglu (2018) | World ETFss | CNNs | Annualized returns |
Niño et al. (2018) | Selected US stocks | CNNs | Directional accuracy |
Tsantekidis et al. (2017) | Selected Finnish stocks | LSTM | Recall, precision, F1 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Mirete-Ferrer, P.M.; Garcia-Garcia, A.; Baixauli-Soler, J.S.; Prats, M.A. A Review on Machine Learning for Asset Management. Risks 2022, 10, 84. https://doi.org/10.3390/risks10040084
Mirete-Ferrer PM, Garcia-Garcia A, Baixauli-Soler JS, Prats MA. A Review on Machine Learning for Asset Management. Risks. 2022; 10(4):84. https://doi.org/10.3390/risks10040084
Chicago/Turabian StyleMirete-Ferrer, Pedro M., Alberto Garcia-Garcia, Juan Samuel Baixauli-Soler, and Maria A. Prats. 2022. "A Review on Machine Learning for Asset Management" Risks 10, no. 4: 84. https://doi.org/10.3390/risks10040084