Research article Special Issues

Saddlepoint approximation of the p-values for the multivariate one-sample sign and signed-rank tests

  • Received: 21 June 2024 Revised: 06 August 2024 Accepted: 13 August 2024 Published: 02 September 2024
  • MSC : 62E17, 62G10, 92B15

  • A multivariate data analysis (MVDA) is a powerful statistical approach to simultaneously analyze datasets with multiple variables. Unlike univariate or bivariate analyses, which simultaneously focus on one or two variables, respectively, MVDA considers the interactions and relationships among multiple variables within a dataset. Several nonparametric tests can be used in the context of one-sample multivariate location problems. The exact distributions of such tests cannot be analytically computed and are usually approximated using an asymptotic approximation. This article proposes the saddlepoint approximation method to approximate the tail probability for multivariate sign and signed-rank tests. It is suggested as a more accurate alternative to the traditional asymptotic approximation method and an alternative to the simulation method. It requires a lot of time as it depends on all possible permutations. Real data examples were provided to illustrate the calculation of p-values, and a simulation study was conducted to compare the accuracy of the saddlepoint approximation method with the simulation method (permutation-based, so time-consuming) and an asymptotic normal approximation method. The study results show that the saddlepoint approximation provides highly accurate approximations to the p-values of the considered statistics, and it often outperforms the normal approximation. Additionally, the results show that the proposed method's computation time is much less than that of the time-consuming simulation method.

    Citation: Abd El-Raheem M. Abd El-Raheem, Ibrahim A. A. Shanan, Mona Hosny. Saddlepoint approximation of the p-values for the multivariate one-sample sign and signed-rank tests[J]. AIMS Mathematics, 2024, 9(9): 25482-25493. doi: 10.3934/math.20241244

    Related Papers:

  • A multivariate data analysis (MVDA) is a powerful statistical approach to simultaneously analyze datasets with multiple variables. Unlike univariate or bivariate analyses, which simultaneously focus on one or two variables, respectively, MVDA considers the interactions and relationships among multiple variables within a dataset. Several nonparametric tests can be used in the context of one-sample multivariate location problems. The exact distributions of such tests cannot be analytically computed and are usually approximated using an asymptotic approximation. This article proposes the saddlepoint approximation method to approximate the tail probability for multivariate sign and signed-rank tests. It is suggested as a more accurate alternative to the traditional asymptotic approximation method and an alternative to the simulation method. It requires a lot of time as it depends on all possible permutations. Real data examples were provided to illustrate the calculation of p-values, and a simulation study was conducted to compare the accuracy of the saddlepoint approximation method with the simulation method (permutation-based, so time-consuming) and an asymptotic normal approximation method. The study results show that the saddlepoint approximation provides highly accurate approximations to the p-values of the considered statistics, and it often outperforms the normal approximation. Additionally, the results show that the proposed method's computation time is much less than that of the time-consuming simulation method.



    加载中


    [1] J. L. Hodges, A bivariate sign test, Ann. Math. Stat., 26 (1955), 523–527.
    [2] I. Blumen, A new bivariate sign test, J. Am. Stat. Assoc., 53 (1958), 448–456. https://doi.org/10.1080/01621459.1958.10501451 doi: 10.1080/01621459.1958.10501451
    [3] R. H. Randles, A distribution-free multivariate sign test based on interdirections, J. Am. Stat. Assoc., 84 (1989), 1045–1050. https://doi.org/10.1080/01621459.1989.10478870 doi: 10.1080/01621459.1989.10478870
    [4] T. P. Hettmansperger, J. Nyblom, H. Oja, Affine invariant multivariate one-sample sign tests, J. R. Stat. Soc. B, 56 (1994), 221–234. https://doi.org/10.1111/j.2517-6161.1994.tb01973.x doi: 10.1111/j.2517-6161.1994.tb01973.x
    [5] J. Möttönen, H. Oja, Multivariate spatial sign and rank methods, J. Nonparametr. Stat., 5 (1995), 201–213. https://doi.org/10.1080/10485259508832643 doi: 10.1080/10485259508832643
    [6] T. P. Hettmansperger, J. Möttönen, H. Oja, Affine invariant multivariate one-sample signed-rank tests, J. Am. Stat. Assoc., 92 (1997), 1591–1600.
    [7] J. Möttönen, H. Oja, On the efficiency of multivariate spatial sign and rank tests, Ann. Stat., 25 (1997), 542–552.
    [8] D. Larocque, M. Labarre, A conditionally distribution-free multivariate sign test for one-sided alternatives, J. Am. Stat. Assoc., 99 (2004), 499–509.
    [9] Z. R. Mahfoud, R. H. Randles, On multivariate signed rank tests, J. Nonparametr. Stat., 17 (2005), 201–216.
    [10] G. Bernard, T. Verdebout, On some multivariate sign tests for scatter matrix eigenvalues, Econom. Stat., 29 (2024), 252–260. https://doi.org/10.1016/j.ecosta.2021.04.001. doi: 10.1016/j.ecosta.2021.04.001
    [11] H. Oja, Affine invariant multivariate sign and rank tests and corresponding estimates: A review, Scand. J. Stat., 26 (2002), 319–343. https://doi.org/10.1111/1467-9469.00152 doi: 10.1111/1467-9469.00152
    [12] H. E. Daniels, Saddlepoint approximation in statistics, Ann. Math. Stat., 25 (1954), 631–650.
    [13] R. Lugannani, S. O. Rice, Saddlepoint approximations for the distribution of the sum of independent random variables, Adv. Appl. Probab., 12 (1980), 475–490.
    [14] I. M. Skovgaard, Saddlepoint expansions for conditional distributions, J. Appl. Probab., 24 (1987), 875–887.
    [15] O. E. Barndorff-Nielsen, D. R. Cox, Asymptotic Techniques for Use in Statistics, London: Chapman & Hall, 1989.
    [16] R. W. Butler, Saddlepoint Approximations with Applications, Cambridge: Cambridge University Press, 2007.
    [17] D. Meng, S. Yang, T. Lin, J. Wang, H. Yang, Z. Lv, RBMDO using Gaussian mixture model-based second-order mean-value saddlepoint approximation, Comput. Model. Eng. Sci., 132 (2022), 553–568.
    [18] A. M. Abd El-Raheem, E. F. Abd-Elfattah, Weighted log-rank tests for clustered censored data: Saddlepoint p-values and confidence intervals, Stat. Meth. Med. Res., 29 (2020), 2629–2636.
    [19] A. M. Abd El-Raheem, E. F. Abd-Elfattah, Weighted log-rank tests for censored data under Wei's urn design: Saddlepoint approximation and confidence intervals, J. Biopharm. Stat., 2023, 1–10. https://doi.org/10.1080/10543406.2023.2183508
    [20] Q. Zhao, J. Duan, T. Wu, J. Hong, Time-dependent reliability analysis under random and interval uncertainties based on Kriging modeling and saddlepoint approximation, Comput. Ind. Eng., 182 (2023), 109391.
    [21] I. A. Shanan, E. F. Abd-Elfattah, A. M. Abd El-Raheem, A new approach for approximating the p-value of a class of bivariate sign tests, Sci. Rep., 13 (2023), 19133.
    [22] D. Meng, Y. Guo, Y. Xu, S. Yang, Y. Guo, L. Pan, et al., Saddlepoint approximation method in reliability analysis: A review, Comput. Model. Eng. Sci., 139 (2024), 2329–2359.
    [23] A. M. Abd El-Raheem, M. Hosny, Saddlepoint p-values for a class of nonparametric tests for the current status and panel count data under generalized permuted block design, AIMS Mathematics, 8 (2023), 18866–18880. https://doi.org/10.3934/math.2023960 doi: 10.3934/math.2023960
    [24] A. M. Abd El-Raheem, M. Hosny, E. F. Abd-Elfattah, Statistical inference of the class of nonparametric tests for the panel count and current status data from the perspective of the saddlepoint approximation, J. Math., 2023, 9111653. https://doi.org/10.1155/2023/9111653 doi: 10.1155/2023/9111653
    [25] A. M. Abd El-Raheem, Kh. S. Kamal, E. F. Abd-Elfattah, P-values and confidence intervals of linear rank tests for left-truncated data under truncated binomial design, J. Biopharm. Stat., 34 (2024), 127–135.
    [26] D. A. Pierce, D. Peters, Practical use of higher order asymptotics for multiparameter exponential families (with Discussion), J. R. Stat. Soc. B, 54 (1992), 701–737.
    [27] A. C. Davison, S. Wang, Saddlepoint approximations as smoothers, Biometrika, 89 (2002), 933–938.
    [28] P. Henrici, Applied and Computational Complex Analysis, Volume 2: Special Functions, Integral Transforms, Asymptotics, Continued Fractions, London: Wiley, 1977.
    [29] R. Rao, Tests of significance in multivariate analysis, Biometrika, 35 (1948), 58–79.
    [30] J. A. Merchant, G. M. Halprin, A. R. Hudson, K. H. Kilburn, W. N. McKenzie, D. J. Hurst, et al., Responses to cotton dust, Arch. Environ. Health: Int. J., 30 (1975), 222–229. https://doi.org/10.1080/00039896.1975.10666685 doi: 10.1080/00039896.1975.10666685
    [31] Met Éireann, Historical data. (n. d.), 2022. Available from: https://www.met.ie/climate/available-data/historical-data.
  • Reader Comments
  • © 2024 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(383) PDF downloads(35) Cited by(0)

Article outline

Figures and Tables

Tables(7)

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog