Linear mixed-effects models (LMEMs) are widely used in medical, engineering, and social applications. The accurate specification of the covariance matrix structure within the error term is known to impact the estimation and inference procedures. Thus, it is crucial to detect the source of errors in LMEMs specifications. In this study, we propose combining a user-friendly computational test with an analytical method to visualize the source of errors. Through statistical simulations under different scenarios, we evaluate the performance of the proposed test in terms of the Power and Type I error rate. Our findings indicate that as the sample size $ n $ increases, the proposed test effectively detects misspecification in the systematic component, the number of random effects, the within-subject covariance structure, and the covariance structure of the error term in the LMEM with high Power while maintaining the nominal Type I error rate. Finally, we show the practical usefulness of our proposed test with a real-world application.
Citation: Jairo A. Angel, Francisco M.M. Rocha, Jorge I. Vélez, Julio M. Singer. A new test for detecting specification errors in Gaussian linear mixed-effects models[J]. AIMS Mathematics, 2024, 9(11): 30710-30727. doi: 10.3934/math.20241483
Linear mixed-effects models (LMEMs) are widely used in medical, engineering, and social applications. The accurate specification of the covariance matrix structure within the error term is known to impact the estimation and inference procedures. Thus, it is crucial to detect the source of errors in LMEMs specifications. In this study, we propose combining a user-friendly computational test with an analytical method to visualize the source of errors. Through statistical simulations under different scenarios, we evaluate the performance of the proposed test in terms of the Power and Type I error rate. Our findings indicate that as the sample size $ n $ increases, the proposed test effectively detects misspecification in the systematic component, the number of random effects, the within-subject covariance structure, and the covariance structure of the error term in the LMEM with high Power while maintaining the nominal Type I error rate. Finally, we show the practical usefulness of our proposed test with a real-world application.
[1] | J. Afiune, Avaliação Ecocardiografica Evolutiva de Recém$-$nascidos Pre-termo, do Nascimento Até o Termo, Phd thesis, Universidade de São Paulo, 2000. |
[2] | E. Demidenko, Mixed Models: Theory and Applications with R, 2 Eds., Hoboken: John Wiley & Sons, Inc., 2013. |
[3] | R. Drikvandi, G. Verbeke, G. Molenberghs, Diagnosing Misspecification of the Random-Effects Distribution in Mixed Models, Biometrics, 73 (2017), 63–71. |
[4] | D. A. Freedman, On the so-called 'Huber Sandwich Estimator' and 'Robust Standard Errors', Amer. Stat., 60 (2006), 299–302. http://doi.org/10.1198/000313006X152207 doi: 10.1198/000313006X152207 |
[5] | S. K. Hanneman, Design, Analysis, and Interpretation of Method-Comparison Studies, AACN Adv. Crit. Care, 19 (2008), 223–234. http://doi.org/10.4037/15597768-2008-2015 doi: 10.4037/15597768-2008-2015 |
[6] | D. A. Harville, Maximum Likelihood Approaches to Variance Component Estimation and to Related Problems, J. Amer. Stat. Assoc., 72 (1977), 320–338. |
[7] | X. Huang, Detecting Random-Effects Model Misspecification via Coarsened Data, Comput. Stat. Data Anal., 55 (2011), 703–714. https://doi.org/10.1016/j.csda.2010.06.012 doi: 10.1016/j.csda.2010.06.012 |
[8] | J. Jiang, Goodness-of-Fit Tests for Mixed Model Diagnostics, Ann. Stat., 29 (2001), 1137–1164. |
[9] | N. Lange, L. Ryan, Assessing Normality in Random Effects Models, Ann. Stat., 17 (1989), 624–642. |
[10] | S. Litière, A. Alonso, G. Molenberghs, Type I and Type II Error under Random-Effects Misspecification in Generalized Linear Mixed Models, Biometrics, 63 (2007), 1038–1044. |
[11] | C. Nagle, An Introduction to Fitting and Evaluating Mixed-effects Models in R, Proceedings of the 10th Pronunciation in Second Language Learning and Teaching Conference, 2018, 82–105. |
[12] | H. D. Patterson, R. Thompson, Recovery of Inter-block Information when Block Sizes are Unequal, Biometrika, 58 (1971), 545–554, |
[13] | R Core Team, R: A Language and Environment for Statistical Computing, Available from: https://www.r-project.org/. |
[14] | G. K. Robinson, That BLUP is a Good Thing: The Estimation of Random Effects, Stat. Sci., 6 (1991), 15–32, |
[15] | F. M. Rocha, J. M. Singer, Selection of Terms in Random Coefficient Regression Models, J. Appl. Stat., 45 (2018), 225–242. |
[16] | H. Schielzeth, N. J. Dingemanse, S. Nakagawa, D. F. Westneat, H. Allegue, C. Teplitsky, et al., Robustness of Linear Mixed-Effects Models to Violations of Distributional Assumptions, Methods Ecol. Evol., 11 (2020), 1141–1152. https://doi.org/10.1111/2041-210X.13434 doi: 10.1111/2041-210X.13434 |
[17] | P. Sen, J. Singer, Large Sample Methods in Statistics: An Introduction with Applications, Boca Raton: CRC Press, 1993. |
[18] | J. M. Singer, F. M. Rocha, J. S. Nobre, Graphical Tools for Detecting Departures from Linear Mixed Model Assumptions and Some Remedial Measures, Int. Stat. Rev., 85 (2017), 290–324. https://doi.org/10.1111/insr.12178 doi: 10.1111/insr.12178 |
[19] | H. White, Maximum Likelihood Estimation of Misspecified Models, Econometrica, 50 (1982), 1–25. https://doi.org/10.2307/1912526 doi: 10.2307/1912526 |
[20] | D. Yu, X. Zhang, K. K. Yau, Asymptotic Properties and Information Criteria for Misspecified Generalized Linear Mixed Models, J. R. Stat. Soc. Ser. B (Stat. Methodol.), 80 (2018), 817–836. |