Algorithmic trading is one of the most concerned directions in financial applications. Compared with traditional trading strategies, algorithmic trading applications perform forecasting and arbitrage with higher efficiency and more stable performance. Numerous studies on algorithmic trading models using deep learning have been conducted to perform trading forecasting and analysis. In this article, we firstly summarize several deep learning methods that have shown good performance in algorithmic trading applications, and briefly introduce some applications of deep learning in algorithmic trading. We then try to provide the latest snapshot application for algorithmic trading based on deep learning technology, and show the different implementations of the developed algorithmic trading model. Finally, some possible research issues are suggested in the future. The prime objectives of this paper are to provide a comprehensive research progress of deep learning applications in algorithmic trading, and benefit for subsequent research of computer program trading systems.
Citation: Yongfeng Wang, Guofeng Yan. Survey on the application of deep learning in algorithmic trading[J]. Data Science in Finance and Economics, 2021, 1(4): 345-361. doi: 10.3934/DSFE.2021019
Related Papers:
[1]
Youtian Hao, Guohua Yan, Renjun Ma, M. Tariqul Hasan .
Linking dynamic patterns of COVID-19 spreads in Italy with regional characteristics: a two level longitudinal modelling approach. Mathematical Biosciences and Engineering, 2021, 18(3): 2579-2598.
doi: 10.3934/mbe.2021131
[2]
Weike Zhou, Aili Wang, Fan Xia, Yanni Xiao, Sanyi Tang .
Effects of media reporting on mitigating spread of COVID-19 in the early phase of the outbreak. Mathematical Biosciences and Engineering, 2020, 17(3): 2693-2707.
doi: 10.3934/mbe.2020147
[3]
Marco Roccetti .
Excess mortality and COVID-19 deaths in Italy: A peak comparison study. Mathematical Biosciences and Engineering, 2023, 20(4): 7042-7055.
doi: 10.3934/mbe.2023304
[4]
Marco Roccetti .
Drawing a parallel between the trend of confirmed COVID-19 deaths in the winters of 2022/2023 and 2023/2024 in Italy, with a prediction. Mathematical Biosciences and Engineering, 2024, 21(3): 3742-3754.
doi: 10.3934/mbe.2024165
[5]
Tianfang Hou, Guijie Lan, Sanling Yuan, Tonghua Zhang .
Threshold dynamics of a stochastic SIHR epidemic model of COVID-19 with general population-size dependent contact rate. Mathematical Biosciences and Engineering, 2022, 19(4): 4217-4236.
doi: 10.3934/mbe.2022195
[6]
S. H. Sathish Indika, Norou Diawara, Hueiwang Anna Jeng, Bridget D. Giles, Dilini S. K. Gamage .
Modeling the spread of COVID-19 in spatio-temporal context. Mathematical Biosciences and Engineering, 2023, 20(6): 10552-10569.
doi: 10.3934/mbe.2023466
[7]
Aili Wang, Xueying Zhang, Rong Yan, Duo Bai, Jingmin He .
Evaluating the impact of multiple factors on the control of COVID-19 epidemic: A modelling analysis using India as a case study. Mathematical Biosciences and Engineering, 2023, 20(4): 6237-6272.
doi: 10.3934/mbe.2023269
[8]
Sarah R. Al-Dawsari, Khalaf S. Sultan .
Modeling of daily confirmed Saudi COVID-19 cases using inverted exponential regression. Mathematical Biosciences and Engineering, 2021, 18(3): 2303-2330.
doi: 10.3934/mbe.2021117
[9]
Peng An, Xiumei Li, Ping Qin, YingJian Ye, Junyan Zhang, Hongyan Guo, Peng Duan, Zhibing He, Ping Song, Mingqun Li, Jinsong Wang, Yan Hu, Guoyan Feng, Yong Lin .
Predicting model of mild and severe types of COVID-19 patients using Thymus CT radiomics model: A preliminary study. Mathematical Biosciences and Engineering, 2023, 20(4): 6612-6629.
doi: 10.3934/mbe.2023284
[10]
Yuto Omae, Yohei Kakimoto, Makoto Sasaki, Jun Toyotani, Kazuyuki Hara, Yasuhiro Gon, Hirotaka Takahashi .
SIRVVD model-based verification of the effect of first and second doses of COVID-19/SARS-CoV-2 vaccination in Japan. Mathematical Biosciences and Engineering, 2022, 19(1): 1026-1040.
doi: 10.3934/mbe.2022047
Abstract
Algorithmic trading is one of the most concerned directions in financial applications. Compared with traditional trading strategies, algorithmic trading applications perform forecasting and arbitrage with higher efficiency and more stable performance. Numerous studies on algorithmic trading models using deep learning have been conducted to perform trading forecasting and analysis. In this article, we firstly summarize several deep learning methods that have shown good performance in algorithmic trading applications, and briefly introduce some applications of deep learning in algorithmic trading. We then try to provide the latest snapshot application for algorithmic trading based on deep learning technology, and show the different implementations of the developed algorithmic trading model. Finally, some possible research issues are suggested in the future. The prime objectives of this paper are to provide a comprehensive research progress of deep learning applications in algorithmic trading, and benefit for subsequent research of computer program trading systems.
1.
Introduction
Since the emergence of the COVID-19 pandemic around December 2019, the outbreak has snowballed globally [1,2], and there is no clear sign that the new confirmed cases and deaths are coming to an end. Though vaccines are rolling out to deter the spread of this pandemic, the mutations of the viruses are already under way [3,4,5,6]. Despite the fact and research that the origin of the pandemic is still in debate [7], many researchers are conducting their study from different aspects and perspectives. They could be categorised mainly into three levels: SARS-CoV-2 genetic level [8], COVID-19 individual country level [9,10,11] and continental levels [12,13]. In this study, we focus on the latter two levels. Regarding these two levels, there are many methods and techniques on these issues. For example, linear and non-linear growth models, together with 2-week-kernel-window regression, are exploited in modelling the exponential growth rate of COVID-19 confirmed cases [14] - which are also generalised to non-linear modelling of COVID-19 pandemic [15,16]. Some research works focus on the prediction of COVID-19 spread by estimating the lead-lag effects between different countries via time warping technique [17], while some utilise clustering analyses to group countries via epidemiological data of active cases, active cases per population, etc.[18]. In addition, there are other researches focusing on tackling the relationship between economic variables and COVID-19 related variables [19,20] - though both the results show there are no relation between economic freedom and COVID-19 deaths and no relation between the performance of equality markets and the COVID-19 cases and deaths.
In this study, we aim to extract the features of daily biweekly growth rates of cases and deaths on national and continental levels. We devise the orthonormal bases based on Fourier analysis [21,22], in particular Fourier coefficients for the potential features. For the national levels, we import the global time series data and sample 117 countries for 109 days [23,24]. Then we calculate the Euclidean distance matrices for the inner products between countries and between days. Based on the distance matrices, we then calculate their variabilities to delve into the distribution of the data. For the continental level, we also import the biweekly changes of cases and deaths for 5 continents as well as the world data with time series data for 447 days. Then we calculate their inner products with respect to the temporal frequencies and find the similarities of extracted features between continents.
For the national levels, the biweekly data bear higher temporal features than spatial features, i..e., as time goes by, the pandemic evolves more in the time dimension than the space (or country-wise) dimension. Moreover, there exists a strong concurrency between features for biweekly changes of cases and deaths, though there is no clear or stable trend for the extracted features. However, in the continental level, one observes that there is a stable trend of features regarding biweekly change. In addition, the extracted features between continents are similar to one another, except Asia whose features bear no clear similarities with other continents.
Our approach is based on orthonormal bases, which serve as the potential features for the biweekly change of cases and deaths. This method is straightforward and easy to comprehend. The limitations of this approach are the extracted features are based on the hidden frequencies of the dynamical structure, which is hard to assign a interpretable meaning for the frequencies, and the data fetched are not complete, due to the missing data in the database. However the results provided in this study could help one map out the evolutionary features of COVID-19.
2.
Method and implementation
Let δ:N→{0,1} be a function such that δ(n)=0 (or δn=0), if n∈2N and δ(n)=1, if n∈2N+1. Given a set of point data D={→v}⊆RN, we would like to decompose each →v into some frequency-based vectors by Fourier analysis. The features of COVID-19 case and death growth rates are specified by the orthogonal frequency vectors BN={→fij:1≤j≤N}Ni=1, which is based on Fourier analysis, in particular Fourier series [22], where
● →f1j=√1N for all 1≤j≤N;
● For any 2≤i≤N−1+δN,
→fij=√2N⋅cos[π2⋅δi−(i−δi)⋅πN⋅j;
(2.1)
● If N∈2N, then →fNj=√1N⋅cos(j⋅π) for all 1≤j≤N.
Now we have constructed an orthonormal basis FN={→f1,→f2,⋯,→fN} as features for RN. Now each →v=N∑i=1<→v,→fi>⋅→fi, where <,> is the inner product. The basis BN could also be represented by a matrix
and the representation of a data column vector →v={(-3,14,5,8,-12)} with respect to B5 is calculated by F5→v=[<→v,→fi>]5i=1 or a column vector or 5-by-1 matrix (5.367,-16.334,-3.271,-6.434,-9.503).
2.1. Data description and handling
There are two main parts of data collection and handling - one for individual countries (or national level) and the other for individual continents (or continental level). In both levels, we fetch the daily biweekly growth rates of confirmed COVID-19 cases and deaths from Our World in Data [23,24]. Then we use R programming 4.1.0 to handle the data and implement the procedures.
Sampled targets: national. After filtering out non-essential data and missing data, the effective sampled data are 117 countries with effective sampled 109 days as shown in Results. The days range from December 2020 to June 2021. Though the sampled days are not subsequent ones (due to the missing data), the biweekly information could still cover such loss. In the latter temporal and spatial analyses, we will conduct our study based on these data.
Sampled targets: continental. As for the continental data, we collect data regarding the world, Africa, Asia, Europe, North and South America. The sampled days range from March 22nd, 2020 to June 11th, 2021. In total, there are 449 days (this is different from the national level). In the latter temporal analysis (there is no spatial analysis in the continental level, due to the limited sampling size), we will conduct our study based on these data.
Notations: national. For further processing, let us utilise some notations to facilitate the introduction. Let the sampled countries be indexed by i=1,...,117. Let the sampled days be indexed by t=1,...,109. Days range from December 3rd 2020 to May 31st 2021. Let ci(t) and di(t) be the daily biweekly growth rates of confirmed cases and deaths in country i on day t, respectively, i.e.,
ci(t):=casei,t+13−casei,tcasei,t;
(2.3)
di(t):=deathi,t+13−deathi,tdeathi,t,
(2.4)
where casei,t and deathi,t denote the total confirmed cases and deaths for country i at day t, respectively. We form temporal and spatial vectors by
the vector ci and di give every count in time for a given country, and the vector v(t) and w(t) give every countries' count for a given time.
Notations: continental. For further processing, let us utilise some notations to facilitate the introduction. Let the sampled continents be indexed by j=1,...,6. Let the 447 sampled days range from March 22nd 2020 to June 11th 2021. We form temporal vectors for confirmed cases and deaths by
For any m-by-n matrix A, we use min(A) to denote the value min{aij:1≤i≤m;1≤j≤n}. Similarly, we define max(A) by the same manner. If →v is a vector, we define min(→v) and max(→v) in the same manner. The implementation goes as follows:
(1) Extract and trim and source data.
Extraction: national. Extract the daily biweekly growth rates of COVID-19 cases and deaths from the database and trim the data. The trimmed data consist of 109 time series data for 117 countries as shown in Table 1, which consists of two 117-by-109 matrices:
Row i in the matrices are regarded as temporal vectors ci and di respectively, and Column t in the matrices are regarded as spatial vectors v(t) and w(t) respectively.
Extraction: continental. As for the continental data, they are collected by two 6-by-447 matrices:
Biweekly_cont_cases=[xj(τ)]τ=1:447j=1:6;
Biweekly_cont_deaths=[yj(t)]τ=1:447j=1:6.
(2) Specify the frequencies (features) for the imported data.
Basis: national. In order to decompose ci and di into some fundamental features, we specify F109 as the corresponding features, whereas to decompose v(t) and w(t), we specify F117 as the corresponding features. The results are presented in Table 2.
Table 2.
Orthonormal temporal frequencies for 109 days (upper block or F109) and orthonormal spatial frequencies for 117 countries (lower block or F117).
Average variability: continental. For each continent j, the temporal variabilities for confirmed cases and deaths are computed by
var_cont_case_time[j]=6∑k=1dE(F447xj,F447xk)447;
var_cont_death_time[j]=6∑k=1dE(F447yj,F447yk)447.
(6) Unify the national temporal and spatial variabilities of cases and deaths. For each country i, the unified temporal and spatial variabilities for cases and deaths are defined by
where σij and βij denotes the value in the (i,j) cells of IP_cont_cases_time and IP_cont_deaths_time, respectively. The results are visualised by figures in Results.
3.
Results
There are two main parts of results shown in this section: national results and continental results.
National results. Based on the method mentioned in section 2, we identify the temporal orthonormal frequencies and spatial ones as shown in Table 2.
The computed inner products at country levels, served as the values for extracted features, for daily biweekly growth rates of cases and deaths with respect to temporal frequencies are shown in Figure 1. Similarly, the computed inner products at a country level for daily biweekly growth rates of cases and deaths with respect to spatial frequencies are shown in Figure 2. Meanwhile, their scaled variabilities are plotted in Figure 3.
Figure 1.
Inner products between growth rates of cases (in solid line) over 109 temporal frequencies; and inner products between growth rates of deaths (in dotted line) over 109 temporal frequencies for some demonstrative countries: Afghanistan, Albania, Algeria, Uruguay, Zambia, and Zimbabwe.
Figure 2.
Inner products between growth rates of cases (in solid line) over 117 spatial frequencies; and inner products between growth rates of deaths (in dotted line) over 117 spatial frequencies for some demonstrative dates: 2020/12/3, 2020/12/4, 2020/12/5, 2021/5/29, 2021/5/30, and 2021/5/31.
Continental results. According to the obtained data, we study and compare continental features of daily biweekly growth rates of confirmed cases and deaths of Africa, Asia, Europe, North America, South America and World. Unlike the missing data in analysing individual countries, the continental data are complete. We take the samples from March 22nd, 2020 to June 11th, 2021. In total, there are 447 days for the analysis. The cosine values which compute the similarities between representations for continents are shown in Table 3. The results of the unified inner products with respect to confirmed cases and deaths are plotted in Figures 4 and 5, respectively.
Table 3.
Cosine values (similarities) between World, Africa, Asia, Europe, North America (No. Am.), and South America (So. Am.).
Figure 5.
Unified inner product, or UIP, for world, Africa, Asia, Europe, North and South America with respect to daily biweekly growth rates of deaths.
Other auxiliary results that support the plotting of the graphs are also appended in Appendix. The names of the sampled 117 countries are provided in Tables A1 and A2. The dates of the sampled days are provided in Figure A1. The tabulated results for inner product of temporal and spatial frequencies on a national level are provided in Table A3. The tabulated results for inner product of temporal frequencies on a continental level are provided in Table A4. The Euclidean distance matrices for temporal and spatial representations with respect to confirmed cases and deaths are tabulated in Table A5 and their average variabilities are tabulated in Table A6.
Summaries of results. Based on the previous tables and figures, we have the following results.
(1) From Figures 1 and 2, one observes that the temporal features are much more distinct that the spatial features, i.e., if one fixes one day and extracts the features from the spatial frequencies, he obtains less distinct features when comparing with fixing one country and extracting the features from the temporal frequencies. This indicates that SARS-CoV-2 evolves and mutates mainly according to time than space.
(2) For individual countries, the features for the biweekly changes of cases are almost concurrent with those of deaths. This indicates biweekly changes of cases and deaths share the similar features. In some sense, the change of deaths is still in tune with the change of confirmed cases, i.e., there is no substantial change between their relationship.
(3) For individual countries, the extracted features go up and down intermittently and there is no obvious trend. This indicates the virus is still very versatile and hard to capture its fixed features in a country-level.
(4) From Figure 3, one observes that there is a clear similarities, in terms of variabilities, for both daily biweekly growth rates of cases and deaths under temporal frequencies. Moreover, the distribution of overall data is not condensed, where middle, labelled countries are scattering around the whole data. This indicates the diversity of daily biweekly growth rates of cases and deaths across countries is still very high.
(5) From Figure 3, the daily biweekly growth rates of deaths with respect to the spatial frequencies are fairly concentrated. This indicates the extracted features regarding deaths are stable, i.e., there are clearer and stabler spatial features for daily biweekly growth rates of deaths.
(6) Comparing the individual graphs in Figures 4 and 5, they bear pretty much the same shape, but in different scale - with death being higher feature oriented (this is also witnessed in a country-level as claimed in the first result above). This indicates there is a very clear trend of features regarding daily biweekly growth rates in a continental level (this is a stark contrast to the third claimed result above).
(7) From Figures 4 and 5, the higher values of inner products lie in both endpoints for biweekly change of cases and deaths, i.e., low temporal frequencies and high temporal frequencies for all the continents, except the biweekly change of deaths in Asia. This indicates the evolutionary patterns in Asia are very distinct from other continents.
(8) From Table 3, the extracted features are all very similar to each continents, except Asia. This echoes the above result.
4.
Conclusions and future work
In this study, we identify the features of daily biweekly growth rates of COVID-19 confirmed cases and deaths via orthonormal bases (features) which derive from Fourier analysis. Then we analyse the inner products which represent the levels of chosen features. The variabilities for each country show the levels of deaths under spatial frequencies are much more concentrated than others. The generated results are summarised in Results 3. There are some limitations in this study and future improvements to be done:
● The associated meanings of the orthonormal features from Fourier analysis are not yet fully explored;
● We use the Euclidean metric to measure the distances between features, which is then used to calculate the variabilities. Indeed Euclidean metric is noted for its geographical properties, but may not be the most suitable in the context of frequencies. One could further introduce other metrics and apply machine learning techniques to find out the optimal ones.
● In this study, we choose the daily biweekly growth rates of confirmed cases and deaths as our research sources. This is a one-sided story. To obtain a fuller picture of the dynamical features, one could add other variables for comparison.
Acknowledgements
This work is supported by the Humanities and Social Science Research Planning Fund Project under the Ministry of Education of China (No. 20XJAGAT001).
Conflict of interest
No potential conflict of interest was reported by the authors.
Table A3.
Inner products w.r.t. temporal (case: upper top and death: upper bottom blocks) and spatial (case: lower top and death: lower bottom blocks) frequencies at a national level.
Table A4.
Temporal inner product for continents (World, Africa, Asia, Europe, North and South America) w.r.t. daily biweekly growth rates of cases (upper block) and deaths (lower block) from March 22nd, 2020 to June 11th, 2021 (447 days).
Table A5.
Distance matrices for daily biweekly growth rates of cases (uppermost block) and deaths (2nd block) w.r.t. temporal frequencies and the ones of cases (3rd block) and deaths (bottommost block) w.r.t. spatial frequencies.
Table A6.
Variability for 117 countries with respect to daily biweekly growth rates of cases and death under temporal frequencies and variability for 109 days with respect to daily biweekly growth rates of cases and death under spatial frequencies.
Baek Y, Kim H (2018) ModAugNet: A new forecasting framework for stock market index value with an overfitting prevention LSTM module and a prediction LSTM module. Expert Syst Appl 113: 457–480. doi: 10.1016/j.eswa.2018.07.019
[2]
Boehmer E, Fong K, Wu J (2012) International evidence on algorithmic trading. In AFA 2013 San Diego Meetings Paper.
[3]
Chen C, Zhang P, Liu Y, et al. (2020) Financial quantitative investment using convolutional neural network and deep learning technology. Neurocomputing 390: 384–390. doi: 10.1016/j.neucom.2019.09.092
[4]
Chen J, Chen W, Huang C, et al. (2016) Financial Time-Series Data Analysis Using Deep Convolutional Neural Networks. In 2016 7th International Conference on Cloud Computing and Big Data (CCBD), 87–92.
[5]
Chen K, Zhou Y, Dai F (2015) A LSTM-based method for stock returns prediction: A case study of China stock market. In 2015 IEEE International Conference on Big Data (Big Data), 2823–2824.
[6]
Chen S, He H (2018) Stock Prediction Using Convolutional Neural Network. In IOP Conference Series: Materials Science and Engineering, 435: 012026.
[7]
Chen Y, Chen W, Huang S (2018) Developing Arbitrage Strategy in High-frequency Pairs Trading with Filterbank CNN Algorithm. In 2018 IEEE International Conference on Agents (ICA), 113–116.
[8]
Cybenko G (1989) Approximation by superpositions of a sigmoidal function. Math Control Signals Syst 2: 303–314. doi: 10.1007/BF02551274
[9]
Day M, Lee C (2016) Deep learning for financial sentiment analysis on finance news providers. In 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), 1127–1134.
[10]
Deng Y, Bao F, Kong Y, et al. (2017) Deep Direct Reinforcement Learning for Financial Signal Representation and Trading. IEEE Trans Neural Networks Learn Syst 28: 653–664. doi: 10.1109/TNNLS.2016.2522401
[11]
Dixon M, Klabjan D, Bang J (2017) Classification-based financial markets prediction using deep neural networks. Algorithmic Financ 6: 67–77. doi: 10.3233/AF-170176
[12]
Doering J, Fairbank M, Markose S (2017) Convolutional neural networks applied to high-frequency market microstructure forecasting. In 2017 9th Computer Science and Electronic Engineering (CEEC), 31–36.
[13]
Fang Y, Chen J, Xue Z (2019) Research on quantitative investment strategies based on deep learning. Algorithms 12: 35. doi: 10.3390/a12020035
[14]
Gudelek M, Boluk S, Ozbayoglu A (2017) A deep learning based stock trading model with 2-D CNN trend detection. In 2017 IEEE Symposium Series on Computational Intelligence (SSCI), 1–8.
[15]
Gunduz H, Yaslan Y, Cataltepe Z (2017) Intraday prediction of borsa Istanbul using convolutional neural networks and feature correlations. Knowl Based Syst 137: 138–148. doi: 10.1016/j.knosys.2017.09.023
[16]
Hendershott T, Jones C, Menkveld A (2011) Does algorithmic trading improve liquidity? J Financ 66: 1–33. doi: 10.1111/j.1540-6261.2010.01624.x
[17]
Hendershott T, Riordan R (2009) Algorithmic trading and information. University of California, Berkeley.
[18]
Hinton G, Salakhutdinov R (2006) Reducing the Dimensionality of Data with Neural Networks. Science 313: 504–507. doi: 10.1126/science.1127647
Lee H, Grosse R, Ranganath R, et al. (2009) Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations. In Proceedings of the 26th annual international conference on machine learning, 609–616.
[21]
Hoseinzade E, Haratizadeh S (2019) CNNpred: CNN-based stock market prediction using a diverse set of variables. Expert Syst Appl 129: 273–285. doi: 10.1016/j.eswa.2019.03.029
[22]
Hossain M, Karim R, Thulasiram R, et al. (2018) Hybrid Deep Learning Model for Stock Price Prediction. In 2018 IEEE Symposium Series on Computational Intelligence (SSCI), 1837–1844.
[23]
Jeong G, Kim H (2019) Improving financial trading decisions using deep Q-learning: Predicting the number of shares, action strategies, and transfer learning. Expert Syst Appl 117: 125–138. doi: 10.1016/j.eswa.2018.09.036
[24]
Ji S, Kim J, Im H (2019) A Comparative Study of Bitcoin Price Prediction Using Deep Learning. Mathematics 7: 898. doi: 10.3390/math7100898
[25]
Kalman B, Kwasny S (1992) Why tanh: choosing a sigmoidal function. In [Proceedings 1992] IJCNN International Joint Conference on Neural Networks, 4: 578–581.
[26]
Kim S, Kang M (2019) Financial series prediction using Attention LSTM. arXiv preprint arXiv: 1902.10877.
[27]
Krauss C, Do X, Huck N (2017) Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S & P 500. Eur J Oper Res 259: 689–702. doi: 10.1016/j.ejor.2016.10.031
[28]
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521: 436-44. doi: 10.1038/nature14539
[29]
Li Y, Zheng W, Zheng Z (2019) Deep Robust Reinforcement Learning for Practical Algorithmic Trading. IEEE Access 7: 108014–108022. doi: 10.1109/ACCESS.2019.2932789
[30]
Lin B, Chu W, Wang C (2018) Application of Stock Analysis Using Deep Learning. In 2018 7th International Congress on Advanced Applied Informatics (ⅡAI-AAI), 612–617.
[31]
Liu S, Zhang C, Ma J (2017) CNN-LSTM neural network model for quantitative strategy analysis in stock markets. In international conference on neural information processing, Springer, 198–206.
[32]
Lu W, Li J, Li Y, et al. (2020) A CNN-LSTM-Based Model to Forecast Stock Prices. Complexity 2020: 6622927.
[33]
Luo S, Lin X, Zheng Z (2019) A novel CNN-DDPG based AI-trader: Performance and roles in business operations. Transp Res Part E Logist Transp Rev 131: 68–79. doi: 10.1016/j.tre.2019.09.013
[34]
Lv D, Yuan S, Li M, et al. (2019) An Empirical Study of Machine Learning Algorithms for Stock Daily Trading Strategy. Math Probl Eng 2019: 7816154.
[35]
Mikolov T, Karafiát M, Burget L, et al. (2010) Recurrent neural network based language model. In Interspeech, Makuhari, 1045–1048.
[36]
Mudassir M, Bennbaia S, Unal D, et al. (2020) Time-series forecasting of Bitcoin prices using high-dimensional features: a machine learning approach. Neural Comput Appl 2020: 1–15.
[37]
Nair V, Hinton G (2010) Rectified linear units improve restricted boltzmann machines. In Icml.
[38]
Nelson D, Pereira A, de Oliveira R (2017) Stock market's price movement prediction with LSTM neural networks. In 2017 International Joint Conference on Neural Networks (IJCNN), 1419–1426.
[39]
Nuti G, Mirghaemi M, Treleaven P, et al. (2011) Algorithmic Trading. Computer 44: 61–69. doi: 10.1109/MC.2011.31
[40]
Ozbayoglu A, Gudelek M, Sezer O (2020) Deep learning for financial applications: A survey. Appl Soft Comput 93: 106384. doi: 10.1016/j.asoc.2020.106384
[41]
Selvin S, Vinayakumar R, Gopalakrishnan E, et al. (2017) Stock price prediction using LSTM, RNN and CNN-sliding window model. In 2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI), 1643–1647.
[42]
Serrano W (2018) Fintech Model: The Random Neural Network with Genetic Algorithm. Proced Comput Sci 126: 537–546. doi: 10.1016/j.procs.2018.07.288
[43]
Sezer O, Ozbayoglu M, Dogdu E (2017) A Deep Neural-Network Based Stock Trading System Based on Evolutionary Optimized Technical Analysis Parameters. Proced Comput Sci 114: 473–480. doi: 10.1016/j.procs.2017.09.031
[44]
Sezer O, Ozbayoglu A (2018) Algorithmic financial trading with deep convolutional neural networks: Time series to image conversion approach. Appl Soft Comput 70: 525–538. doi: 10.1016/j.asoc.2018.04.024
[45]
Sezer O, Ozbayoglu A (2019) Financial trading model with stock bar chart image time series with deep convolutional neural networks. arXiv preprint arXiv: 1903.04610.
[46]
Shah D, Campbell W, Zulkernine F (2018) A Comparative Study of LSTM and DNN for Stock Market Forecasting. In 2018 IEEE International Conference on Big Data (Big Data), 4148–4155.
[47]
Singh R, Srivastava S (2017) Stock prediction using deep learning. Multimed Tools Appl 76: 18569–18584. doi: 10.1007/s11042-016-4159-7
[48]
Sirignano J, Cont R (2019) Universal features of price formation in financial markets: perspectives from deep learning. Quant Financ 19: 1449–1459. doi: 10.1080/14697688.2019.1622295
[49]
Sohangir S, Wang D, Pomeranets A, et al. (2018) Big Data: Deep Learning for financial sentiment analysis. J Big Data 5: 1–25. doi: 10.1186/s40537-017-0111-6
[50]
Sutskever I, Hinton G, Taylor G (2009) The recurrent temporal restricted boltzmann machine. In Advances in neural information processing systems, 1601–1608.
[51]
Théate T, Ernst D (2021) An application of deep reinforcement learning to algorithmic trading. Expert Syst Appl 173: 114632. doi: 10.1016/j.eswa.2021.114632
[52]
Treleaven P, Galas M, Lalchand V (2013) Algorithmic trading review. Commun ACM 56: 76–85. doi: 10.1145/2500117
[53]
Troiano L, Villa E, Loia V (2018) Replicating a Trading Strategy by Means of LSTM for Financial Industry Applications. IEEE Trans Ind Inf 14: 3226–3234. doi: 10.1109/TII.2018.2811377
[54]
Wang Z, Lu W, Zhang K, et al. (2021) MCTG: Multi-frequency continuous-share trading algorithm with GARCH based on deep reinforcement learning. arXiv preprint arXiv: 2105.03625.
[55]
Xie M, Li H, Zhao Y (2020) Blockchain financial investment based on deep learning network algorithm. J Comput Appl Math 372: 112723. doi: 10.1016/j.cam.2020.112723
[56]
Zhao Z, Rao R, Tu S, et al. (2017) Time-Weighted LSTM Model with Redefined Labeling for Stock Trend Prediction. In 2017 IEEE 29th International Conference on Tools with Artificial Intelligence (ICTAI), 1210–1217.
[57]
Zou Z, Qu Z (2020) Using LSTM in Stock prediction and Quantitative Trading. CS230: Deep Learning, Winter.
Yongfeng Wang, Guofeng Yan. Survey on the application of deep learning in algorithmic trading[J]. Data Science in Finance and Economics, 2021, 1(4): 345-361. doi: 10.3934/DSFE.2021019
Yongfeng Wang, Guofeng Yan. Survey on the application of deep learning in algorithmic trading[J]. Data Science in Finance and Economics, 2021, 1(4): 345-361. doi: 10.3934/DSFE.2021019
Table 2.
Orthonormal temporal frequencies for 109 days (upper block or F109) and orthonormal spatial frequencies for 117 countries (lower block or F117).
Table A3.
Inner products w.r.t. temporal (case: upper top and death: upper bottom blocks) and spatial (case: lower top and death: lower bottom blocks) frequencies at a national level.
Table A4.
Temporal inner product for continents (World, Africa, Asia, Europe, North and South America) w.r.t. daily biweekly growth rates of cases (upper block) and deaths (lower block) from March 22nd, 2020 to June 11th, 2021 (447 days).
Table A5.
Distance matrices for daily biweekly growth rates of cases (uppermost block) and deaths (2nd block) w.r.t. temporal frequencies and the ones of cases (3rd block) and deaths (bottommost block) w.r.t. spatial frequencies.
Table A6.
Variability for 117 countries with respect to daily biweekly growth rates of cases and death under temporal frequencies and variability for 109 days with respect to daily biweekly growth rates of cases and death under spatial frequencies.