Research article

A comparative study of symbolic aggregate approximation and topological data analysis

  • Received: 06 August 2024 Revised: 08 October 2024 Accepted: 24 October 2024 Published: 30 October 2024
  • JEL Codes: C63; C65; C53; C18

  • The movement of stocks is often perceived as random due to the complex interactions between different stocks and the inherently chaotic nature of the market. This study investigated the similarity in stock movements across multiple industry sectors in Europe. Specifically, we applied topological data analysis (TDA) to analyze stock time series data and compared the results with those obtained using an expanded form of a more classical time series analysis method, symbolic aggregate approximation (SAX). Our findings indicated that while TDA offered detailed insights into "local" stock movements, SAX was more effective in capturing broader trends in financial markets, where less detail was required, making it suitable for portfolio optimization. We also presented an extension of SAX that incorporated volatility measures, improving its performance in highly volatile markets.

    Citation: Fredrik Hobbelhagen, Ioannis Diamantis. A comparative study of symbolic aggregate approximation and topological data analysis[J]. Quantitative Finance and Economics, 2024, 8(4): 705-732. doi: 10.3934/QFE.2024027

    Related Papers:

  • The movement of stocks is often perceived as random due to the complex interactions between different stocks and the inherently chaotic nature of the market. This study investigated the similarity in stock movements across multiple industry sectors in Europe. Specifically, we applied topological data analysis (TDA) to analyze stock time series data and compared the results with those obtained using an expanded form of a more classical time series analysis method, symbolic aggregate approximation (SAX). Our findings indicated that while TDA offered detailed insights into "local" stock movements, SAX was more effective in capturing broader trends in financial markets, where less detail was required, making it suitable for portfolio optimization. We also presented an extension of SAX that incorporated volatility measures, improving its performance in highly volatile markets.



    加载中


    [1] Bendich P, Edelsbrunner H, Kerber M (2010) Computing robustness and persistence for images. IEEE Transactions on Visualization and Computer Graphics 16: 1251–1260. https://doi.org/10.1109/TVCG.2010.139 doi: 10.1109/TVCG.2010.139
    [2] Canelas A, Neves R, Horta N (2013) A SAX-GA approach to evolve investment strategies on financial markets based on pattern discovery techniques. Expert Syst Appl 40: 1579–1590. https://doi.org/10.1016/j.eswa.2012.09.002 doi: 10.1016/j.eswa.2012.09.002
    [3] Gao T, Li X, Chai Y, et al. (2016) Deep learning with stock indicators and two-dimensional principal component analysis for closing price prediction system. In: 2016 7th IEEE International Conference on Software Engineering and Service Science (ICSESS), IEEE, 166–169. https://doi.org/10.1109/ICSESS.2016.7883040
    [4] Gidea M, Katz Y (2018) Topological data analysis of financial time series: Landscapes of crashes. Physica A 491: 820–834. https://doi.org/10.1016/j.physa.2017.09.028 doi: 10.1016/j.physa.2017.09.028
    [5] Hankinson GA, Rhys JMW (1983) Electricity consumption, electricity intensity and industrial structure. Energy Econ 5: 146–152. https://doi.org/10.1016/0140-9883(83)90054-3 doi: 10.1016/0140-9883(83)90054-3
    [6] Karan A, Kaygun A (2021) Time series classification via topological data analysis. Expert Syst Appl 183: 115326. https://doi.org/10.1016/j.eswa.2021.115326 doi: 10.1016/j.eswa.2021.115326
    [7] Kennel MB, Brown R, Abarbanel HDI (1992) Determining embedding dimension for phase-space reconstruction using a geometrical construction. Phys Rev A 45: 3403. https://doi.org/10.1103/PhysRevA.45.3403 doi: 10.1103/PhysRevA.45.3403
    [8] Keogh E, Chakrabarti K, Pazzani M, et al. (2001) Dimensionality reduction for fast similarity search in large time series databases. Knowl Inf Syst 3: 263–286. https://doi-org.mu.idm.oclc.org/10.1007/PL00011669 doi: 10.1007/PL00011669
    [9] Khasawneh FA, Munch E (2017) Utilizing topological data analysis for studying signals of time-delay systems, In: Insperger, T., Ersal, T., Orosz, G. (eds), Time Delay Systems: Theory, Numerics, Applications, and Experiments, Springer, 93–106. https://doi-org.mu.idm.oclc.org/10.1007/978-3-319-53426-8_7
    [10] Leitão J, Neves RF, Horta N (2016) Combining rules between PIPs and SAX to identify patterns in financial markets. Expert Syst Appl 65: 242–254. https://doi.org/10.1016/j.eswa.2016.08.032 doi: 10.1016/j.eswa.2016.08.032
    [11] Liebert W, Pawelzik K, Schuster HG (1991) Optimal embeddings of chaotic attractors from topological considerations. Europhys Lett 14: 521. https://doi.org/10.1209/0295-5075/14/6/004 doi: 10.1209/0295-5075/14/6/004
    [12] Liu H, Yang Y (2020) Regression-adjusted average treatment effect estimates in stratified randomized experiments. Biometrika 107: 935–948. https://doi-org.mu.idm.oclc.org/10.1093/biomet/asaa038 doi: 10.1093/biomet/asaa038
    [13] Li X, Ding P (2020) Rerandomization and regression adjustment. J R Stat Soc Ser B-Stat Methodol 82: 241–268. https://doi-org.mu.idm.oclc.org/10.1111/rssb.12353 doi: 10.1111/rssb.12353
    [14] Lin J, Keogh E, Wei L, et al. (2007) Experiencing SAX: a novel symbolic representation of time series. Data Min Knowl Discov 15: 107–144. https://doi-org.mu.idm.oclc.org/10.1007/s10618-007-0064-z doi: 10.1007/s10618-007-0064-z
    [15] Liu W, Shao L (2009) Research of SAX in distance measuring for financial time series data. In: 2009 First International Conference on Information Science and Engineering, IEEE, 935–937. https://doi.org/10.1109/ICISE.2009.924
    [16] Lkhagva B, Suzuki Y, Kawagoe K (2006) Extended SAX: Extension of symbolic aggregate approximation for financial time series data representation.
    [17] Majumdar S, Laha AK (2020) Clustering and classification of time series using topological data analysis with applications to finance. Expert Syst Appl 162: 113868. https://doi.org/10.1016/j.eswa.2020.113868 doi: 10.1016/j.eswa.2020.113868
    [18] United Nations (2008) International standard industrial classification of all economic activities. Rev. 4. United Nations. Available from: https://unstats.un.org/unsd/publication/seriesm/seriesm_4rev4e.pdf.
    [19] Perea JA, Harer J (2015) Sliding windows and persistence: An application of topological methods to signal analysis. Found Comput Math 15: 799–838. https://doi-org.mu.idm.oclc.org/10.1007/s10208-014-9206-z doi: 10.1007/s10208-014-9206-z
    [20] Skaf Y, Laubenbacher R (2022) Topological data analysis in biomedicine: A review. J Biomed Inform 130: 104082. https://doi.org/10.1016/j.jbi.2022.104082 doi: 10.1016/j.jbi.2022.104082
    [21] Tralie C, Saul N, Bar-On R (2018) Ripser.py: A lean persistent homology library for Python. J Open Source Softw 3: 925. https://doi.org/10.21105/joss.00925
    [22] Truong P (2017) An exploration of topological properties of high-frequency one-dimensional financial time series data using TDA. Preprint.
    [23] Van Rossum G, Drake FL (2009) Python 3 Reference Manual. CreateSpace, Scotts Valley, CA.
    [24] Wasserman L (2018) Topological data analysis. Annu Rev Stat Application 5: 501–532. http://dx.doi.org/10.1146/annurev-statistics-031017-100045 doi: 10.1146/annurev-statistics-031017-100045
    [25] Van Rossum G, Drake FL (2009) Python 3 Reference Manual. CreateSpace, Scotts Valley, CA. ISBN: 1441412697.
  • QFE-08-04-027.pdf
  • Reader Comments
  • © 2024 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(285) PDF downloads(34) Cited by(0)

Article outline

Figures and Tables

Figures(14)  /  Tables(1)

Other Articles By Authors

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog