Design strategy | Reference |
RCBD | [17,18] |
LSED | [17,18] |
Greco-Latin square experimental design (GLSED) | [19] |
BIBD | [20,21] |
Youden design | [10] |
Split plot design | [13] |
This study introduced a novel exact-scheme analysis of variance to tackle the challenge of incomplete data within the Greco-Latin square experimental design (GLSED), specifically for scenarios with a single missing observation across any treatment and block level, thus eliminating the need for conventional data imputation methods. This approach innovatively addresses and mitigates the bias in the treatment sum of squares, a significant drawback of traditional missing plot techniques, by providing a precise, exact-scheme-based formula for calculating the treatment sum of squares in fixed-effect GLSED contexts with unrecorded values. Moreover, it offers a method for correcting biased treatment sum of squares values, presenting an adjustment mechanism for instances where the least squares method was previously employed to estimate missing values. This comprehensive strategy not only enhances the methodological accuracy and integrity of GLSED studies but also contributes significantly to the field by offering a solution to navigate the complexities of incomplete datasets without resorting to data imputation, thus improving the rigor and validity of experimental designs in the face of missing data challenges.
Citation: Kittiwat Sirikasemsuk, Sirilak Wongsriya, Kanogkan Leerojanaprapa. Solving the incomplete data problem in Greco-Latin square experimental design by exact-scheme analysis of variance without data imputation[J]. AIMS Mathematics, 2024, 9(12): 33551-33571. doi: 10.3934/math.20241601
[1] | Raúl M. Falcón, Laura Johnson, Stephanie Perkins . A census of critical sets based on non-trivial autotopisms of Latin squares of order up to five. AIMS Mathematics, 2021, 6(1): 261-295. doi: 10.3934/math.2021017 |
[2] | Shulan Kong, Chengbin Wang, Yawen Sun . A recursive filter for a class of two-dimensional nonlinear stochastic systems. AIMS Mathematics, 2025, 10(1): 1741-1756. doi: 10.3934/math.2025079 |
[3] | Hassan Ranjbar, Leila Torkzadeh, Dumitru Baleanu, Kazem Nouri . Simulating systems of Itô SDEs with split-step $ (\alpha, \beta) $-Milstein scheme. AIMS Mathematics, 2023, 8(2): 2576-2590. doi: 10.3934/math.2023133 |
[4] | Chang-Xu Zhang, Fu-Tao Hu, Shu-Cheng Yang . On the (total) Roman domination in Latin square graphs. AIMS Mathematics, 2024, 9(1): 594-606. doi: 10.3934/math.2024031 |
[5] | Muhammad Aslam, Osama H. Arif . Simulating chi-square data through algorithms in the presence of uncertainty. AIMS Mathematics, 2024, 9(5): 12043-12056. doi: 10.3934/math.2024588 |
[6] | M. J. Huntul, Kh. Khompysh, M. K. Shazyndayeva, M. K. Iqbal . An inverse source problem for a pseudoparabolic equation with memory. AIMS Mathematics, 2024, 9(6): 14186-14212. doi: 10.3934/math.2024689 |
[7] | Fan Sha, Jianbing Zhang . Randomized symmetric Gauss-Seidel method for solving linear least squares problems. AIMS Mathematics, 2024, 9(7): 17453-17463. doi: 10.3934/math.2024848 |
[8] | Weiping Zhou, Wan He, Wei Wang, Shigui Huang . Construction of marginally coupled designs with mixed-level qualitative factors. AIMS Mathematics, 2024, 9(12): 33731-33755. doi: 10.3934/math.20241610 |
[9] | Hongyi Wu, Jinwen Jin, Zhiwei Li . NGBoost algorithm-based prediction of mechanical properties of a hot-rolled strip and its interpretability research with ANOVA values. AIMS Mathematics, 2024, 9(11): 33000-33022. doi: 10.3934/math.20241578 |
[10] | Rohul Amin, Kamal Shah, Hijaz Ahmad, Abdul Hamid Ganie, Abdel-Haleem Abdel-Aty, Thongchai Botmart . Haar wavelet method for solution of variable order linear fractional integro-differential equations. AIMS Mathematics, 2022, 7(4): 5431-5443. doi: 10.3934/math.2022301 |
This study introduced a novel exact-scheme analysis of variance to tackle the challenge of incomplete data within the Greco-Latin square experimental design (GLSED), specifically for scenarios with a single missing observation across any treatment and block level, thus eliminating the need for conventional data imputation methods. This approach innovatively addresses and mitigates the bias in the treatment sum of squares, a significant drawback of traditional missing plot techniques, by providing a precise, exact-scheme-based formula for calculating the treatment sum of squares in fixed-effect GLSED contexts with unrecorded values. Moreover, it offers a method for correcting biased treatment sum of squares values, presenting an adjustment mechanism for instances where the least squares method was previously employed to estimate missing values. This comprehensive strategy not only enhances the methodological accuracy and integrity of GLSED studies but also contributes significantly to the field by offering a solution to navigate the complexities of incomplete datasets without resorting to data imputation, thus improving the rigor and validity of experimental designs in the face of missing data challenges.
In business, the design of experiments (DOE) emerges as a focal scientific methodology that significantly enhances profitability through strategic manipulation of potential factors by management. This technique is instrumental in refining and optimizing both the design and the advancement of manufacturing processes. It achieves this by minimizing variation in the response variable, thereby facilitating the development of products and processes that are both reliable and efficient. Central to Fisher's approach to DOE is the employment of an analysis of variance (ANOVA), which relies on the F-test to assess the statistical significance among group means. The foundations of Fisher's DOE are bolstered by three critical principles: blocking, randomization, and replication, each serving to ensure the integrity and validity of experimental results [1]. In the context of DOE, experiments that allow for the investigation of a single potential factor under conditions where every observation is equally likely to participate in the trial are categorized as completely randomized designs. In contrast, designs that do not fully randomize the assignment of observations but still focus on a single potential factor—such as the randomized complete block design (RCBD), Latin square experimental design (LSED), and Greco-Latin square experimental design (GLSED)—are recognized for their structured approach to managing variability within experimental settings.
The Greco-Latin square experimental design (GLSED) is a structured methodology that assigns treatments within a two-dimensional matrix, with the stipulation that each treatment is represented exactly once in every row and column, thereby facilitating the control of three nuisance factors efficiently within the confines of limited resources and time constraints [2,3]. This design framework is especially advantageous in the fields of agriculture, medicine, and industry, where it is imperative to systematically test multiple variables to elevate the precision and reliability of research outcomes [4]. An exemplary case further elucidates the efficacy of GLSED, demonstrating its capacity to address complex experimental demands by ensuring comprehensive coverage and balanced representation of treatments across the experimental matrix, thus exemplifying its critical role in optimizing experimental strategies across diverse scientific disciplines.
According to Diawara et al. [5], the employment of a Greco-Latin square significantly streamlines the experimental process by reducing both the labor and costs associated with conducting experimental runs, thereby enabling a thorough and systematic investigation of all relevant factors. An illustrative analysis utilizing a square of order 3 accentuated the profound impact of variables such as flow rate, insulin type, pump type, and vibration on the quantity of insulin delivered, a finding of paramount importance for achieving precise dosing and averting long-term complications in diabetes patients. Further, Mahamud and Gomes [6] investigated the enzymatic saccharification of sugar cane bagasse using the Greco-Latin square design. They found optimal conditions at 2.5% substrates, 5.5 ml enzyme, pH 4.5, and 45℃, differing from previous conditions of 2.0% substrates, 5 ml enzyme, pH 5, and 50℃. This enhanced saccharification yields more fermentable sugar, improving the competitiveness of fuel ethanol production. In another application, Woodside and Pearce [7] examined the efficacy of shotblasting—a novel furnace tube cleaning technology—analyzing factors such as price, cleaning time, energy efficiency, and tube damage. Using a Greco-Latin square, the study adeptly condensed the experimental design from an unwieldy 81 scenarios in a full factorial setup to a more manageable nine-product design, revealing that all factors, except for cleaning time, exert a significant influence on market share. This insight offers strategic value to marketers seeking to enhance the industrial market penetration of shotblasting technology. Meanwhile, Tovar-Aguilar et al. [8] applied the Greco-Latin square design to evaluate the effectiveness of safety eyewear among citrus harvest workers in Florida, addressing concerns over lens fogging and dew. The study distinguished one type of safety glasses as exhibiting superior fog resistance, demonstrating the design's utility in minimizing biases and enhancing the reliability of results across various research contexts.
In the domain of design of experiments (DOE), both completely and incompletely randomized frameworks can encounter unrecorded observations, leading to an incomplete-data design. This phenomenon may arise from two primary causes: (1) the intentional exclusion of observation data at the outset, necessitated by the scarcity of experimental units [9], or (2) the emergence of unexpected circumstances or the presence of outliers. In the former case, the arrangement of observation data often adheres to a balanced DOE approach, benefiting from established formulas for computing treatment and error sums of squares, with exemplars of such designs including the Youden square design [10], the balanced incomplete block design (BIBD) [11], and the balanced incomplete Latin square design (BILSD) [12]. Conversely, in scenarios marked by the latter cause, outliers are pinpointed and excised, resulting in an unbalanced DOE. Distinguished from its balanced counterpart by the absence of pre-existing ANOVA formulas to directly address the unbalanced nature of the data, unbalanced DOEs employ the missing plot technique to estimate the unrecorded observations. Subsequently, the treatment and error sums of squares are computed utilizing tailor-made ANOVA formulas specifically devised for these more complex and irregular datasets [13,14], reflecting the comprehensive and adaptive methodologies required to maintain the integrity and validity of experimental research under such conditions.
Table 1 presents a systematic compilation of scholarly literature focused on the estimation of missing observations within the context of unbalanced designs, utilizing the traditional missing plot technique. This methodological approach, crucial for addressing data gaps in experimental designs, undergoes a thorough examination across different instances of unbalanced designs of experiments (DOEs) in the comprehensive reviews detailed in references [15] and [16]. These sources collectively offer an in-depth exploration of the conventional missing plot technique, shedding light on its application, efficacy, and limitations within the framework of unbalanced DOEs, by this means contributing to the broader academic discourse on experimental design and data estimation strategies.
Design strategy | Reference |
RCBD | [17,18] |
LSED | [17,18] |
Greco-Latin square experimental design (GLSED) | [19] |
BIBD | [20,21] |
Youden design | [10] |
Split plot design | [13] |
Moreover, the analysis of covariance (ANCOVA) [22] has been employed as a statistical technique for estimating missing observations within unbalanced designs, with notable contributions from scholars such as Coons [23], Cochran [24], and Wilkinson [25]. This approach enhances the precision of data analysis by incorporating covariates that may influence the dependent variable, thus providing a more nuanced understanding of experimental results. In a specific advancement within this domain, Ogbonnaya and Uzochukwu [26] developed specialized formulas aimed at estimating missing observations in scenarios involving one-factor ANCOVA with a single covariate. Furthermore, in the context of the Greco-Latin square experimental design (GLSED), where the challenge of a single missing data value arises, Kupolusi and Ojo [19] introduced a formula that utilizes minimizing the error sum of squares to estimate this missing value. Subsequently, the analysis of variance is conducted as though the dataset were complete, maintaining the integrity of the experimental design and allowing for the comprehensive interpretation of results. This methodological innovation signifies a critical step forward in addressing the complexities associated with missing data in experimental research, ensuring robust and reliable analysis despite the challenges posed by unbalanced designs.
The reliance on the conventional missing plot technique and ANCOVA for the estimation of missing observations, as opposed to utilizing actual experimental data, introduces a notable bias in the treatment sum of squares, as highlighted in studies [13,14]. This inherent bias necessitates a thorough estimation process to accurately identify and subsequently subtract it from the treatment sum of squares, which are initially derived from pre-existing, ready-made formulas. This corrective measure is crucial for ensuring the integrity and accuracy of statistical analyses within experimental research, addressing the challenges posed by incomplete datasets and preserving the validity of conclusions drawn from such studies.
The exact-scheme analysis of variance emerges as a formidable resolution for handling designs plagued by incomplete data, obviating the necessity for the estimation of missing values, as substantiated by the literature [1,27,28]. Termed alternatively as the model comparison-based exact scheme, this methodology guarantees robustness in statistical analysis that traditional approaches might not offer. Specifically, Montgomery [1] delineates a precise methodology designed to tackle the issue of a single missing value within the domains of randomized complete block design (RCBD) and Latin square experimental design (LSED) through the application of exact-scheme analysis of variance. This approach strategically circumvented the conventional reliance on data imputation, thus presenting a more direct and potentially unbiased avenue for analyzing experimental data under conditions of incompleteness. This innovation not only enhances the accuracy of statistical analyses but also aligns with the broader goal of maintaining methodological integrity in the face of missing experimental data.
Sirikasemsuk et al. [29] expanded Montgomery's [1] research to analyze RCBD with missing data, where the missing values could occur in any cell of the observation table. They developed mathematical formulas for fitted parameters and the regression sum of squares using the exact-scheme analysis of variance. Similarly, Sirikasemsuk and Leerojanaprapa [30] addressed incomplete 4 × 4 Latin square designs with various patterns of two missing observations, simplifying parameter estimation and ANOVA calculations. Furthermore, Sirikasemsuk [31] and Sirikasemsuk and Leerojanaprapa [32] have shed light on the unbiased calculation of the treatment sum of squares within the context of LSED when confronted with a single missing value, utilizing the aforementioned method.
To illustrate these concepts, let us revisit a real example of the Greco-Latin square experimental design (GLSED). Subramani [33] referenced a 5 × 5 GLSED with incomplete data from Montgomery [34] to study the effects of raw material batches, acid concentrations, standing times, and catalyst concentrations on the yield of a chemical process. In this study, Subramani [33] estimated several missing values using non-iterative least squares estimation. However, such approaches can lead to biased analyses of the sum of squares and mean squares, which may result in incorrect conclusions about the factors affecting the response variable. This bias has been demonstrated in several studies, including those by Rangaswamy [14], Little and Rubin [16], and Ott and Longnecker [35]. Recent research by AlAita et al. [36] further emphasizes the significance of missing data as a critical challenge in analyzing Greco-Latin square designs, highlighting the risks of bias and reduced reliability in conclusions.
Given these challenges, an improved method is essential to eliminate bias and enhance the reliability of experimental results in the presence of missing data. Their exploratory work in gap analysis accentuates a noticeable void in the existing literature, specifically the absence of definitive evidence concerning the unbiased treatment sum of squares in GLSED. This identified lacuna not only delineates the scope but also establishes the central focus of the current research, indicating a targeted investigation aimed at addressing this gap and contributing novel insights into the methodology for ensuring unbiased statistical analysis in GLSED settings.
Nevertheless, in the conventional framework of the exact scheme, the estimation of fitted parameters—comprising the overall mean, treatment, and block effects—alongside the calculation of the regression sums of squares (RSS) for both full and reduced effect models, emerges as a notably time-intensive process. Addressing this efficiency concern, the present research introduces a novel, exact scheme-based formula for the instantaneous computation of RSS within the full model of a P × P-dimension fixed-effect GLSED with a single missing observation. This innovation circumvents the traditional necessity for estimating fitted parameters, where P represents the levels of treatment and blocks. Table 2 provides a visualization of the GLSED table, underscoring the practical application of this formula. This advancement not only streamlines the analytical process but also enhances the methodological rigor by enabling more rapid derivation of results without compromising on the accuracy and integrity of the statistical analysis.
Colum Row | ψ1 | ψ2 | ψ3 | ψ4 | yi⋅⋅⋅ | |
1 | 2 | 3 | 4 | |||
θ1 | 1 | A∝=y1111 | Bβ=y1222 | Cγ=y1333 | Dδ=y1444 | y1⋅⋅⋅ |
θ2 | 2 | Bδ=y2241 | Aγ=y2132 | Dβ=y2423 | C∝=y2314 | y2⋅⋅⋅ |
θ3 | 3 | Cβ=y3321 | D∝=y3412 | Aδ=y3143 | Bγ=y3234 | y3⋅⋅⋅ |
θ4 | 4 | Dγ=y4421 | Cδ=y4342 | B∝=y4213 | Aβ=y4124 | y4⋅⋅⋅ |
y⋅⋅⋅l | y⋅⋅⋅1 | y⋅⋅⋅2 | y⋅⋅⋅3 | y⋅⋅⋅4 | y⋅⋅⋅⋅ |
The structure of this research paper is organized as follows: Section 1 introduces the foundational concepts and the overarching aim of the study. Section 2 delineates the model comparison-based exact scheme, offering a detailed examination of its theoretical underpinnings. In Section 3, the focus shifts to linear algebra equations and the methodologies employed for estimating the fitted parameters, crucial for the analytical framework of the study. Section 4 explores the intricacies of the regression sum of squares, a key component in the evaluation of model efficacy. Section 5 proposes an innovative exact scheme-based instant formula for calculating the Latin letter treatment sum of squares, a significant contribution to the field of experimental design. Finally, Section 6 presents concluding remarks, encapsulating the findings and implications of the research. This logical progression through the sections ensures a coherent and comprehensive exploration of the subject matter, facilitating a deeper understanding of the proposed methodologies and their potential impact on experimental design analysis.
The notations and definitions used in this research are as follows:
yijkl the observation of row i, Latin letter treatment j, Greek letter k, and column l;
P the levels or size of the Greco-Latin square experimental design;
i row index (i = 1, 2, 3, …, P);
j Latin letter treatment index (j = 1, 2, 3, …, P);
k Greek letter index (k = 1, 2, 3, …, P);
l column index (l = 1, 2, 3, …, P);
μ the overall mean;
θi the row effect of level i;
τj the Latin letter treatment effect of level j;
ωk the Greek letter effect of level k;
ψl the column effect of level l;
εijkl the statistical error due to other sources of variability;
y⋅⋅⋅⋅ the grand total;
yi⋅⋅⋅ the ith row total;
y⋅j⋅⋅ the jth Latin letter treatment total;
y⋅⋅k⋅ the kth Greek letter total;
y⋅⋅⋅l the lth column total;
r the row index of the missing observation;
n the Latin letter treatment index of the missing observation;
m the Greek letter index of the missing observation;
c the column index of the missing observation;
SSR the row sum of squares;
SSTr the Latin letter treatment sum of squares;
SSG the Greek letter sum of squares;
SSC the column sum of squares;
SST the total sum of squares;
SSE the error sum of squares;
R(μ,θ,τ,ω,ψ) the overall regression sum of squares (ORSS) for the full model;
R(μ,τ,ω,ψ) the regression sum of squares for the reduced model, omitting the row effect;
R(μ,θ,ω,ψ) the regression sum of squares for the reduced model, omitting the Latin letter treatment effect;
R(μ,θ,τ,ψ) the regression sum of squares for the reduced model, omitting the Greek letter effect;
R(μ,θ,τ,ω) the regression sum of squares for the reduced model, omitting the column effect.
The Latin letter treatment sum of squares (SSTr) for the complete-data GLSED can be calculated by
SSTr=∑Pj=1y2⋅j⋅⋅P−y2⋅⋅⋅⋅P2. | (2.1) |
However, Eq (2.1) is not applicable to the incomplete-data (unbalanced) design. Therefore, the model comparison-based exact scheme is used to solve the incomplete-data design. SSTr of the exact scheme is expressed as
SSTr=R(μ,θ,τ,ω,ψ)−R(μ,θ,ω,ψ). | (2.2) |
It is noted that the Latin letter treatment sum of squares (SSTr) is calculated as the difference between the full-model regression sum of squares and the reduced-model regression sum of squares, omitting the treatment effect [1,9,14]. Likewise, the row, column, and Greek letter sums of squares are calculated as the discrepancies between the regression sums of squares of the full and reduced models, excluding the corresponding effects. Hence, the sums of squares for the GLSED are determined as
SSR=R(μ,θ,τ,ω,ψ)−R(μ,τ,ω,ψ), | (2.3) |
SSG=R(μ,θ,τ,ω,ψ)−R(μ,θ,τ,ψ), | (2.4) |
SSC=R(μ,θ,τ,ω,ψ)−R(μ,θ,τ,ω). | (2.5) |
The GLSED full-model linear equation for yijkl is
yijkl=μ+θi+τj+ωk+ψl+εijkl{i=1,2,…,Pj=1,2,…,Pk=1,2,…,Pl=1,2,…,P. | (3.1) |
In the fixed-effect GLSED, representing constraints, the sums of the fitted parameters can be expressed as
∑PAlliˆθi=0, | (3.2) |
∑PAlljˆτj=0, | (3.3) |
∑PAllkˆωk=0, | (3.4) |
and
∑PAlllˆψl=0. | (3.5) |
The regression sum of squares (RSS) for the full model of yijkl is expressed as [1],
R(μ,θ,τ,ω,ψ)=ˆμy⋅⋅⋅⋅+∑Pi=1ˆθiyi⋅⋅⋅+∑Pj=1ˆτjy⋅j⋅⋅+∑Pk=1ˆωky⋅⋅k⋅+∑Pl=1ˆψly⋅⋅⋅l. | (3.6) |
In estimating the parametric values of the full model (ˆμ, ˆθi, ˆτj, ˆωk, ˆψl), a series of linear algebra equations for the P × P GLSED with one missing observation can be written as
μ:(P2−1)ˆμ+P∑Pi=1,i≠rˆθi+(P−1)ˆθr+P∑Pj=1,j≠nˆτj+(P−1)ˆτn+P∑Pk=1,k≠mˆωk+(P−1)ˆωm+P∑Pl=1,l≠cˆψl+(P−1)ˆψc=y⋅⋅⋅⋅, | (3.7) |
θr:(P−1)ˆμ+(P−1)ˆθr+∑Pj=1,j≠nˆτj+∑Pk=1,k≠mˆωk+∑Pl=1,l≠cˆψl=yr⋅⋅⋅, | (3.8) |
τn:(P−1)ˆμ+∑Pi=1,i≠rˆθi+(P−1)ˆτn+∑Pk=1,k≠mˆωk+∑Pl=1,l≠cˆψl=y⋅n⋅⋅, | (3.9) |
ωm:(P−1)ˆμ+∑Pi=1,i≠rˆθi+∑Pj=1,j≠nˆτj+(P−1)ˆωm+∑Pl=1,l≠cˆψl=y⋅⋅m⋅, | (3.10) |
ψc:(P−1)ˆμ+∑Pi=1,i≠rˆθi+∑Pj=1,j≠nˆτj+∑Pk=1,k≠mˆωk+(P−1)ˆψc=y⋅⋅⋅c, | (3.11) |
θi: Pˆμ+Pˆθi+∑Pj=1ˆτj+∑Pk=1ˆωk+∑Pl=1ˆψc=yi⋅⋅⋅ when i≠r, | (3.12) |
τj: Pˆμ+∑Pi=1ˆθi+Pˆτj+∑Pk=1ˆωk+∑Pl=1ˆψc=y⋅j⋅⋅ when j≠n, | (3.13) |
ωk: Pˆμ+∑Pi=1ˆθi+∑Pj=1ˆτj+Pˆωk+∑Pl=1ˆψc=y⋅⋅k⋅ when k≠m, | (3.14) |
ψl: Pˆμ+∑Pi=1ˆθi+∑Pj=1ˆτj+∑Pk=1ˆωk+Pˆψl=y⋅⋅⋅l when l≠c. | (3.15) |
Given Eqs (3.2)–(3.5) and the matrix form of Eqs (3.7)–(3.11), the fitted parameters for the P × P GLSED with one missing observation can be determined by
ˆμ=(P−4)y⋅⋅⋅⋅+yr⋅⋅⋅+y⋅n⋅⋅+y⋅⋅m⋅+y⋅⋅⋅c(P−3)(P−1)P, | (3.16) |
ˆθr=yr⋅⋅⋅−y⋅⋅⋅⋅P+(P−1)ˆμ, | (3.17) |
ˆτn=y⋅n⋅⋅−y⋅⋅⋅⋅P+(P−1)ˆμ, | (3.18) |
ˆωm=y⋅⋅m⋅−y⋅⋅⋅⋅P+(P−1)ˆμ, | (3.19) |
ˆψc=y⋅⋅⋅c−y⋅⋅⋅⋅P+(P−1)ˆμ. | (3.20) |
Likewise, the fitted parameters for i≠r, j≠n, k≠m, and l≠c can be rewritten as
ˆθi=yi⋅⋅⋅P−ˆμ,for i≠r, | (3.21) |
ˆτj=y⋅j⋅⋅P−ˆμ,for j≠n, | (3.22) |
ˆωk=y⋅⋅k⋅P−ˆμ,for k≠m, | (3.23) |
and
ˆψl=y⋅⋅⋅lP−ˆμ,for l≠c. | (3.24) |
The full-model and reduced-model regression sums of squares (RSS) can be determined following Propositions 4.1 and 4.2, respectively.
Proposition 4.1. Given the P × P GLSED with one missing observation, the full-model RSS of yijkl can be calculated by
R(μ,θ,τ,ω,ψ)=(∑Alliy2i⋅⋅⋅+∑Alljy2⋅j⋅⋅+∑Allky2⋅⋅k⋅+∑Allly2⋅⋅⋅l)P+(1−P)(3y2⋅⋅⋅⋅−y2SUM)+(ySUM−3y⋅⋅⋅⋅)2P(P−3)(P−1), | (4.1) |
when ySUM=yr⋅⋅⋅+y⋅n⋅⋅+y⋅⋅m⋅+y⋅⋅⋅c.
Proof. Substituting Eqs (3.17)–(3.24) in Eq (3.6), we have
R(μ,θ,τ,ω,ψ)=ˆμy⋅⋅⋅⋅+(∑Pi=1,i≠ry2i⋅⋅⋅+∑Pj=1,j≠ny2⋅j⋅⋅+∑Pk=1,k≠my2⋅⋅k⋅+∑Pl=1,l≠cy2⋅⋅⋅l)P |
−(ˆμ∑Pi=1,i≠ryi⋅⋅⋅+ˆμ∑Pj=1,j≠ny⋅j⋅⋅+ˆμ∑Pk=1,k≠my⋅⋅k⋅+ˆμ∑Pl=1,l≠cy⋅⋅⋅l)+ˆμ(P−1)(yr⋅⋅⋅+y⋅n⋅⋅+y⋅⋅m⋅+y⋅⋅⋅c) |
+(y2r...+y2.n..+y2..m.+y2...c)P−y⋅⋅⋅⋅P(yr⋅⋅⋅+y⋅n⋅⋅+y⋅⋅m⋅+y⋅⋅⋅c). |
Modifying all the summation terms to account for all indexes (i, j, k, l),
R(μ,θ,τ,ω,ψ)=(∑Pi=1y2i⋅⋅⋅+∑Pj=1y2⋅j⋅⋅+∑Pk=1y2⋅⋅k⋅+∑Pl=1y2⋅⋅⋅l)P+Pˆμ(yr⋅⋅⋅+y⋅n⋅⋅+y⋅⋅m⋅+y⋅⋅⋅c) |
−y....P(yr⋅⋅⋅+y⋅n⋅⋅+y⋅⋅m⋅+y⋅⋅⋅c)−3ˆμy⋅⋅⋅⋅, |
where ∑Pi=1yi⋅⋅⋅=∑Pj=1y⋅j⋅⋅=∑Pk=1y⋅⋅k⋅=∑Pl=1y⋅⋅⋅l=y⋅⋅⋅⋅.
For the sake of convenience,
ySUM=yr⋅⋅⋅+y⋅n⋅⋅+y⋅⋅m⋅+y⋅⋅⋅c. |
The full-model RSS of yijkl can be rewritten as
R(μ,θ,τ,ω,ψ)=(∑Pi=1y2i⋅⋅⋅+∑Pj=1y2⋅j⋅⋅+∑Pk=1y2⋅⋅k⋅+∑Pl=1y2⋅⋅⋅l)P+PˆμySUM−y⋅⋅⋅⋅PySUM−3ˆμy⋅⋅⋅⋅. |
Substituting ˆμ (Eq (3.16)) into the above equation and rearranging, the full-model RSS can be derived as Eq (4.1).
This completes the proof.
Proposition 4.2. In the presence of a single missing observation in the P × P GLSED, the reduced-model regression sums of squares can be determined by
R(μ,τ,ω,ψ)=∑Pj=1y2⋅j⋅⋅+∑Pk=1y2⋅⋅k⋅+∑Pl=1y2⋅⋅⋅lP+(1−P)(2y2⋅⋅⋅⋅−y2SUM_R)+(ySUM_R−2y⋅⋅⋅⋅)2(P−2)(P−1)P, | (4.2) |
R(μ,θ,ω,ψ)=∑Pi=1y2i⋅⋅⋅+∑Pk=1y2⋅⋅k⋅+∑Pl=1y2⋅⋅⋅lP+(1−P)(2y2⋅⋅⋅⋅−y2SUM_N)+(ySUM_N−2y⋅⋅⋅⋅)2(P−2)(P−1)P, | (4.3) |
R(μ,θ,τ,ψ)=∑Pi=1y2i⋅⋅⋅+∑Pj=1y2⋅j⋅⋅+∑Pl=1y2⋅⋅⋅lP+(1−P)(2y2⋅⋅⋅⋅−y2SUM_M)+(ySUM_M−2y⋅⋅⋅⋅)2(P−2)(P−1)P, | (4.4) |
R(μ,θ,τ,ω)=∑Pi=1y2i⋅⋅⋅+∑Pj=1y2⋅j⋅⋅+∑Pk=1y2⋅⋅k⋅P+(1−P)(2y2⋅⋅⋅⋅−y2SUM_C)+(ySUM_C−2y⋅⋅⋅⋅)2(P−2)(P−1)P, | (4.5) |
when ySUM_R=y⋅n⋅⋅+y⋅⋅m⋅+y⋅⋅⋅c, ySUM_N=yr⋅⋅⋅+y⋅⋅m⋅+y⋅⋅⋅c, ySUM_M=yr⋅⋅⋅+y⋅n⋅⋅+y⋅⋅⋅c, and ySUM_C=yr⋅⋅⋅+y⋅n⋅⋅+y⋅⋅m⋅.
Proof. The determination of the reduced-model regression sums of squares can be conducted in an identical fashion to the case of R(μ,θ,τ,ω,ψ) as in Proposition 4.1. For example, let us consider a case of R(μ,θ,ω,ψ) (Eq (2.2)) omitting the Latin letter treatment effect, where the reduced-model linear equation for yi(j)kl is
yi(j)kl=μNT+θNTi+ωNTk+ψNTl+εijkl{i=1,2,…,Pj=1,2,…,Pk=1,2,…,Pl=1,2,…,P. |
It is presumed that Latin letter treatment effects (τj) are not accounted for across all values of j. Consequently, the estimated variables μ, θi, ωk, and ψl in Eq (3.1) will be replaced with ˆμNT, ˆθNTi, ˆωNTk, and ˆψNTl, respectively, as opposed to ˆμ, ˆθi, ˆωk, and ˆψl. The expression for R(μ,θ,ω,ψ) can be formulated as follows:
R(μ,θ,ω,ψ)=ˆμNTy⋅⋅⋅⋅+∑Pi=1ˆθNTiyi⋅⋅⋅+∑Pk=1ˆωNTky⋅⋅k⋅+∑Pl=1ˆψNTly⋅⋅⋅l. |
The subsequent estimated model parameters, namely ˆμNT, ˆθNTi, ˆωNTk, and ˆψNTl, were obtained through a sequence of linear algebraic equations, as outlined below:
ˆμNT=(P−3)y⋅⋅⋅⋅+yr⋅⋅⋅+y⋅⋅m⋅+y⋅⋅⋅c(P−2)(P−1)P, |
ˆθNTr=yr⋅⋅⋅−y⋅⋅⋅⋅P+(P−1)ˆμNT, |
ˆωNTm=y⋅⋅m⋅−y⋅⋅⋅⋅P+(P−1)ˆμNT, |
ˆψNTc=y⋅⋅⋅c−y⋅⋅⋅⋅P+(P−1)ˆμNT, |
ˆθNTi=yi⋅⋅⋅P−ˆμNTfori≠r, |
ˆωNTk=y⋅⋅k⋅P−ˆμNTfork≠m, |
ˆψNTl=y⋅⋅⋅lP−ˆμNTforl≠c. |
Replacing these values in the expression for R(μ,θ,ω,ψ) and after some algebraic simplification, we have Eq (4.3). This completes the proof.
Proposition 5.1. In the P × P GLSED with one missing experimental data, the unbiased Latin letter treatment sum of squares can be determined as
SSTr=∑Pj=1y2⋅j⋅⋅P+1(P−1)[(ySUM−y⋅⋅⋅⋅)2(P−3)−(ySUM_N−y⋅⋅⋅⋅)2(P−2)+y⋅⋅⋅⋅(2y⋅n⋅⋅−y⋅⋅⋅⋅)P]. | (5.1) |
Additionally, the determination of unbiased sums of squares for the rows, Greek letters, and columns can be established as follows:
SSR=∑Pi=1y2i⋅⋅⋅P+1(P−1)[(ySUM−y⋅⋅⋅⋅)2(P−3)−(ySUM_R−y⋅⋅⋅⋅)2(P−2)+y⋅⋅⋅⋅(2yr⋅⋅⋅−y⋅⋅⋅⋅)P], | (5.2) |
SSG=∑Pk=1y2⋅⋅k⋅P+1(P−1)[(ySUM−y⋅⋅⋅⋅)2(P−3)−(ySUM_M−y⋅⋅⋅⋅)2(P−2)+y⋅⋅⋅⋅(2y⋅⋅m⋅−y⋅⋅⋅⋅)P], | (5.3) |
SSC=∑Pl=1y2⋅⋅⋅lP+1(P−1)[(ySUM−y⋅⋅⋅⋅)2(P−3)−(ySUM_C−y⋅⋅⋅⋅)2(P−2)+y⋅⋅⋅⋅(2y⋅⋅⋅c−y⋅⋅⋅⋅)P]. | (5.4) |
Proof. Equation (4.1) in Proposition 4.1 can be rearranged as:
R(μ,θ,τ,ω,ψ)=(∑PAlliy2i⋅⋅⋅+∑PAlljy2⋅j⋅⋅+∑PAllky2⋅⋅k⋅+∑PAllly2⋅⋅⋅l)P |
+(3y2⋅⋅⋅⋅−y2SUM−3Py2⋅⋅⋅⋅+Py2SUM)+(y2SUM−6ySUMy⋅⋅⋅⋅+9y2⋅⋅⋅⋅)P(P−3)(P−1) |
=(∑PAlliy2i⋅⋅⋅+∑PAlljy2⋅j⋅⋅+∑PAllky2⋅⋅k⋅+∑PAllly2⋅⋅⋅l)P+(Py2SUM−2PySUMy⋅⋅⋅⋅+Py2⋅⋅⋅⋅)P(P−3)(P−1) |
+(2PySUMy⋅⋅⋅⋅−4Py2⋅⋅⋅⋅−6ySUMy⋅⋅⋅⋅+12y2⋅⋅⋅⋅)P(P−3)(P−1), |
R(μ,θ,τ,ω,ψ)=(∑PAlliy2i⋅⋅⋅+∑PAlljy2⋅j⋅⋅+∑PAllky2⋅⋅k⋅+∑PAllly2⋅⋅⋅l)P+[(ySUM−y⋅⋅⋅⋅)2(P−3)(P−1)]+[(2PySUMy⋅⋅⋅⋅−4y2⋅⋅⋅⋅)P(P−1)]. | (5.5) |
Equation (4.3) in Proposition 4.2 can be rearranged as:
R(μ,θ,ω,ψ)=(∑PAlliy2i⋅⋅⋅+∑PAllky2⋅⋅k⋅+∑PAllly2⋅⋅⋅l)P+(2y2⋅⋅⋅⋅−y2SUM_N−2Py2⋅⋅⋅⋅+Py2SUM_N)+(y2SUM_N−4ySUMy⋅⋅⋅⋅+4y2⋅⋅⋅⋅)P(P−2)(P−1) |
=(∑PAlliy2i⋅⋅⋅+∑PAllky2⋅⋅k⋅+∑PAllly2⋅⋅⋅l)P+(Py2SUMN−2PySUMNy….+Py2….)P(P−2)(P−1) |
+(2PySUM_Ny⋅⋅⋅⋅−3Py2⋅⋅⋅⋅−4ySUM_Ny⋅⋅⋅⋅+6y2⋅⋅⋅⋅)P(P−2)(P−1), |
R(μ,θ,ω,ψ)=(∑PAlliy2i⋅⋅⋅+∑PAllky2⋅⋅k⋅+∑PAllly2⋅⋅⋅l)P+(ySUM_N−y⋅⋅⋅⋅)2(P−2)(P−1)+(2PySUM_Ny⋅⋅⋅⋅−3y2⋅⋅⋅⋅)P(P−1). | (5.6) |
According to Eq (2.2), the Latin letter treatment sum of squares in Eq (5.1) can be deduced by subtracting Eq (5.5) from Eq (5.6). The calculations for the row, Greek letter, and column sums of squares are similarly performed for the Latin letter treatment sum of squares above. This completes the proof.
The technique employed for estimating missing data through the least squares method constitutes one of various approaches to facilitate the derivation of the analysis of variance based on the original formula. Before proceeding with additional result analysis, it is essential to rectify the bias through subtraction, as detailed in Proposition 5.2.
Proposition 5.2. In the P × P GLSED with a singular missing data point, the formula for bias adjustment concerning the sum of squares for the Latin letter treatment, applied subsequent to the estimation of missing data through the least squares method, is established as follows:
Bias=(y⋅⋅⋅⋅−ySUM−(p−3)y⋅n⋅⋅)2(P−3)2(P−2)(P−1). | (5.7) |
Proof. We have
Bias=SSTr(bias)−SSTr(unbias)_exact, | (5.8) |
where SSTr(unbias)_exact represents the treatment sum of squares without bias, calculated from Eq (5.1) in Proposition 5.1 and SSTr(bias) denotes the treatment sum of squares with bias after estimating missing data via the least squares method, as outlined by Kupolusi and Ojo [19] in the following Eq (5.9) for the missing value (Z) estimate:
Z=PySUM−3y⋅⋅⋅⋅(P−3)(P−1). | (5.9) |
Hence, Eq (2.1) for the complete data can be rewritten as
SSTr=∑Pj=1,j≠ny2⋅j⋅⋅+(y⋅n⋅⋅+Z)2P−(y⋅⋅⋅⋅+Z)2P2, | (5.10) |
where the symbol y⋅⋅⋅⋅ represents the grand total, exclusive of the value represented by Z.
It is noted that the inaugural expression in Eq (5.1) is subtracted from the initial term in Eq (5.10), resulting in the value of 2Zy⋅n⋅⋅+Z2P.
Substituting (5.1), (5.9), and (5.10) into (5.8), we obtain the following:
Bias=2y⋅n⋅⋅(PySUM−3y⋅⋅⋅⋅(P−3)(P−1))P+(PySUM−3y⋅⋅⋅⋅(P−3)(P−1))2P−(y⋅⋅⋅⋅+PySUM−3y⋅⋅⋅⋅(P−3)(P−1))2P2 |
−(ySUM−y⋅⋅⋅⋅)2(P−3)(P−1)+(ySUMN−y⋅⋅⋅⋅)2(P−2)(P−1)−y⋅⋅⋅⋅(2y⋅n⋅⋅−y⋅⋅⋅⋅)(P−1)P. |
The initial three terms may be reformulated as follows:
2(P−3)(pySUM−3y⋅⋅⋅⋅)y⋅n⋅⋅+py2SUM−2pySUMy⋅⋅⋅⋅−((p−7)p+9)y2⋅⋅⋅⋅(P−3)2(P−1)P=2(P−3)y⋅n⋅⋅ySUM+y2SUM−2ySUMy⋅⋅⋅⋅−(p−7)y2⋅⋅⋅⋅(P−3)2(P−1)+−6(P−3)y⋅n⋅⋅y⋅⋅⋅⋅−9y2⋅⋅⋅⋅(P−3)2(P−1)P=(P−2)∗[2(P−3)y⋅n⋅⋅ySUM+y2SUM−2ySUMy⋅⋅⋅⋅−(p−7)y2⋅⋅⋅⋅(P−3)2(P−2)(P−1)+−6(P−3)y⋅n⋅⋅y⋅⋅⋅⋅−9y2⋅⋅⋅⋅(P−3)2(P−2)(P−1)P]. |
The concluding trio of terms can be rephrased as follows:
(P−3)Py2⋅n⋅⋅−2(P−3)(PySUM−2y⋅⋅⋅⋅)y⋅n⋅⋅+P2y2⋅⋅⋅⋅+P(−y2SUM+2ySUMy⋅⋅⋅⋅−6y2⋅⋅⋅⋅)+6y2⋅⋅⋅⋅(P−3)(P−2)(P−1)P=(P−3)y2⋅n⋅⋅−2(P−3)y⋅n⋅⋅ySUM+Py2⋅⋅⋅⋅+(−y2SUM+2ySUMy⋅⋅⋅⋅−6y2⋅⋅⋅⋅)(P−3)(P−2)(P−1)+4(P−3)y⋅n⋅⋅y⋅⋅⋅⋅+6y2⋅⋅⋅⋅(P−3)(P−2)(P−1)P=(P−3)∗[(P−3)y2⋅n⋅⋅−2(P−3)y⋅n⋅⋅ySUM+Py2⋅⋅⋅⋅+(−y2SUM+2ySUMy⋅⋅⋅⋅−6y2⋅⋅⋅⋅)(P−3)2(P−2)(P−1)+4(P−3)y⋅n⋅⋅y⋅⋅⋅⋅+6y2⋅⋅⋅⋅(P−3)2(P−2)(P−1)P]. |
Finally, we have
Bias=P2y2⋅n⋅⋅−6Py2⋅n⋅⋅+9y2⋅n⋅⋅+2Py⋅n⋅⋅ySUM−4Py⋅n⋅⋅y⋅⋅⋅⋅−6y⋅n⋅⋅ySUM+12y⋅n⋅⋅y⋅⋅⋅⋅+y2SUM−2ySUMy⋅⋅⋅⋅+4y2⋅⋅⋅⋅(P−3)2(P−2)(P−1)+−3y2⋅⋅⋅⋅+2(p−3)y⋅n⋅⋅y⋅⋅⋅⋅(P−3)2(P−2)(P−1)=(y2⋅⋅⋅⋅−2ySUMy⋅⋅⋅⋅+y2SUM)−2(p−3)y⋅n⋅⋅y⋅⋅⋅⋅+2(p−3)y⋅n⋅⋅ySUM+(P−3)2y2⋅n⋅⋅(P−3)2(P−2)(P−1)=(y⋅⋅⋅⋅−ySUM)2−2(y⋅⋅⋅⋅−ySUM)((p−3)y⋅n⋅⋅)+((p−3)y⋅n⋅⋅)2(P−3)2(P−2)(P−1),Bias=(y⋅⋅⋅⋅−ySUM−(p−3)y⋅n⋅⋅)2(P−3)2(P−2)(P−1). |
This completes the proof.
The following content presents a comparison of two methods for solving the incomplete data problem: Exact-scheme analysis of variance without data imputation and estimating missing data through the least squares method. This comparison is based on the three case study examples.
Case Study 1: A 4 × 4 GLSED adapted from Subramani [33] examined the influence of four different television assembly methods on assembly time, as shown in Table 3. In this experiment, assembly methods were represented by Latin letter treatments, workstations by Greek letters, assembly order by rows, and workers by columns.
Column Row | ψ1 | ψ2 | ψ3 | ψ4 | |
1 | 2 | 3 | 4 | ||
θ1 | 1 | Cβ= 11 | Bγ= 10 | Dδ= 14 | A∝= 8 |
θ2 | 2 | B∝= 8 | Cδ= 12 | Aγ= 10 | Dβ= 12 |
θ3 | 3 | Aδ= 9 | D∝= 11 | Bβ= missing value | Cγ= 15 |
θ4 | 4 | Dγ= 9 | Aβ= 8 | C∝= 18 | Bδ= 6 |
Case Study 2: A 5 × 5 GLSED adapted from Montgomery [34] examined the influence of five different time intervals on the yield of various chemical processes, as shown in Table 4. In this study, the time intervals were represented as Latin letter treatments, catalyst concentrations as Greek letters, raw materials as rows, and acid concentrations as columns.
Column Row | ψ1 | ψ2 | ψ3 | ψ4 | ψ5 | |
1 | 2 | 3 | 4 | 5 | ||
θ1 | 1 | A∝= missing value | Bβ= 16 | Cγ= 19 | Dδ= 16 | Eε= 13 |
θ2 | 2 | Bγ= 18 | Cδ= 21 | Dε= 18 | E∝= 11 | Aβ= 21 |
θ3 | 3 | Cε= 20 | D∝= 12 | Eβ= 16 | Aγ= 25 | Bδ= 13 |
θ4 | 4 | Dβ= 15 | Eγ= 15 | Aδ= 22 | Bε= 14 | C∝= 17 |
θ5 | 5 | Eδ= 10 | Aε= 24 | B∝= 17 | Cβ= 17 | Dγ= 14 |
Case Study 3: A 7 × 7 GLSED adapted from Hinkelmann and Kempthorne [37] examined the influence of seven different levels of lysine percentages in the diet on milk production in cows, as shown in Table 5. In this study, the percentage of lysine in the diet was represented by Latin letter treatments, the percentage of protein in the diet by Greek letters, cows by rows, and duration by columns.
Column Row |
ψ1 | ψ2 | ψ3 | ψ4 | ψ5 | ψ6 | ψ7 | |
1 | 2 | 3 | 4 | 5 | 6 | 7 | ||
θ1 | 1 | A∝= 304 | Bε= 436 | Cβ= 350 | Dφ= 504 | Eχ= 417 | Fγ= 519 | Gδ= 432 |
θ2 | 2 | Bβ= 381 | Cφ= 505 | Dχ= 425 | Eγ= 564 | Fδ= 494 | G∝= 350 | Aε= 413 |
θ3 | 3 | Cχ= 432 | Dγ= 566 | Eδ= 479 | F∝= 357 | Gε= 461 | Aβ= 340 | Bφ= 502 |
θ4 | 4 | Dδ= 442 | E∝= 372 | Fε= 536 | Gβ= 366 | Aφ= 495 | Bχ= 425 | Cγ= 507 |
θ5 | 5 | Eε= 496 | Fβ= 449 | Gφ= 493 | Aχ= 345 | Bγ= 509 | Cδ= 481 | D∝= 380 |
θ6 | 6 | Fφ= 534 | Gχ= 421 | Aγ= 452 | Bδ= 427 | C∝= 346 | Dε= 478 | Eβ= 397 |
θ7 | 7 | Gγ= 543 | Aδ= 386 | B∝= 435 | Cε= 485 | Dβ= 406 | Eφ= 554 | Fχ= missing value |
Using Eq (5.9), the least squares method for estimating missing data yields values of 15, 21, and 474.38 for Case Studies 1, 2, and 3, respectively. The comparison of the Latin letter treatment sum of squares between the exact method and the least squares method is shown in Table 6.
Case Study | Exact-scheme analysis | Estimating missing data via least squares method | Bias |
1 | SSTr(unbias)_exact = 59.333 | SSTr(bias) = 63.50 | 4.167 |
2 | SSTr(unbias)_exact = 217.467 | SSTr(bias) = 282.80 | 65.333 |
3 | SSTr(unbias)_exact = 32, 704 | SSTr(bias) = 34, 620 | 1916 |
Table 6 shows that the Latin letter treatment sum of squares calculated using the least squares method is biased. However, this bias can be corrected using a bias adjustment formula (Eq (5.7)), or the exact method can be used for an unbiased estimate.
From the three examples, the analysis of variance results can be demonstrated without bias, where the sums of squares are calculated from Proposition 5.1, as shown in Tables 7–9.
Source of variation | Sum of squares | Degrees of freedom | Mean square | F0 |
Latin letter treatment | 59.333 | 3 | 19.778 | 2.55 |
Greek letter | 2.833 | 3 | 0.944 | |
Rows | 6.500 | 3 | 2.167 | |
Columns | 30.833 | 3 | 10.278 | |
Error | 15.500 | 2 | 7.750 | |
Total | 136.933 | 14 |
Source of variation | Sum of squares | Degrees of freedom | Mean square | F0 |
Latin letter treatment | 217.467 | 4 | 54.367 | 9.81 |
Greek letter | 17.917 | 4 | 4.479 | |
Rows | 6.000 | 4 | 1.500 | |
Columns | 22.317 | 4 | 5.579 | |
Error | 38.800 | 7 | 5.543 | |
Total | 355.333 | 23 |
Source of variation | Sum of squares | Degrees of freedom | Mean square | F0 |
Latin letter treatment | 32, 704 | 6 | 5, 450.7 | 9.28 |
Greek letter | 155, 215 | 6 | 25, 869.1 | |
Rows | 7, 412 | 6 | 1, 235.4 | |
Columns | 1, 270 | 6 | 211.7 | |
Error | 13, 515 | 23 | 587.6 | |
Total | 213, 217 | 47 |
The exact method provides an unbiased estimate of both the Latin letter treatment sum of squares and the error sum of squares (Montgomery [1]). This method employs the general regression significance test, which is equivalent to the exact-scheme analysis of variance without data imputation. A more comprehensive validation of this approach was demonstrated through the following simulation studies.
Missing values within plausible ranges, based on each case study, are assumed and generated to estimate the sums of squares for Latin letter treatment and error as shown in Figures 1 and 2 for Case Study 1, Figures 3 and 4 for Case Study 2, and Figures 5 and 6 for Case Study 3. In Case Study 1 (Figure 1), the estimated missing value of 16.98, determined using simulation studies, results in the sum of squares for Latin letter treatment aligning with the exact-scheme method. However, the sum of squares for error is biased and does not match the exact-scheme results. For Figure 2, an estimated value of 15 (or derived using the least squares method) yields a sum of squares for error consistent with the exact-scheme analysis. Nonetheless, the sum of squares for Latin letter treatment remains biased and does not align with the exact-scheme results.
In Case Studies 2 (Figures 3 and 4) and 3 (Figures 5 and 6), similar patterns are observed. There is a high probability that replacing a single missing data introduces bias either in the Latin letter treatment sum of squares or in the error sum of squares.
Therefore, it remains challenging for various missing data estimation methods to consistently produce unbiased sums of squares for both Latin letter treatment and error that correspond to those calculated by the exact-scheme analysis of variance.
Handling multiple missing values requires developing linear algebra equations for fitted parameters in both reduced and full models, as well as recalculating regression sums of squares. Each scenario presents unique equations based on the number and pattern of missing data. We recommend this as an area for future research, as expanding the exact-scheme methodology to address multiple missing data points in GLSED necessitates comprehensive study.
In conclusion, the Greco-Latin square experimental design (GLSED) showcases substantial benefits across a variety of fields, including agriculture, medicine, and industry, affirming its effectiveness in conducting systematic tests on multiple variables within the confines of limited resources and time. Within the framework of Fisher's design of experiments (DOE), the prevalent use of the missing plot technique for estimating missing observations is acknowledged, although it is noted to introduce a positive bias in the treatment sum of squares. Addressing this limitation, the present study introduces a novel model comparison-based exact scheme specifically designed for GLSED configurations encountering a single missing observation, a methodology that is applicable regardless of the levels of treatment and blocks. This advancement not only enhances the accuracy of experimental analyses but also broadens the utility of GLSED by mitigating the inherent bias associated with traditional estimation techniques, thereby contributing a significant methodological innovation to the field of experimental design.
The introduction of the proposed exact scheme represents a significant advancement in the field of experimental design, specifically addressing the limitations inherent in GLSED with a single missing observation. This innovative approach effectively obviates the conventional necessity for estimating missing observations, thus significantly reducing the potential for bias in the treatment sum of squares. A critical gap in existing methodologies— the absence of a readily accessible exact-scheme formula tailored to GLSED scenarios with an unrecorded value—is bridged by this study through the development of an instant formula. This formula, grounded in the exact scheme, is adeptly designed for calculating the sums of squares in P × P fixed-effect GLSED configurations without necessitating the estimation of fitted parameters. Additionally, this research enriches the methodological arsenal available to researchers by providing formulas for both full-model and reduced-model regression sums of squares, facilitating the comprehensive development of sums of squares and ANOVA tables tailored to GLSED contexts with missing data. This methodological innovation not only enhances the precision of statistical analyses but also broadens the scope of experimental designs that can be accurately analyzed under conditions of incomplete data.
This paper articulates and implements a formulated bias adjustment, specifically targeting the sum of squares, after the estimation of missing data via the least squares method. Through a thorough mathematical exposition, it lays out a systematic methodology for rectifying the bias induced by missing data within the ambit of the GLSED model. It emphasizes the critical necessity for researchers and practitioners within the domains of design of experiments, analysis of variance, and experimental design to be fully aware of the limitations inherent in prevailing methods. Moreover, it highlights the unique advantages offered by the model comparison-based exact scheme. This approach not only addresses the bias in a rigorous and structured manner but also signifies a paradigm shift in how missing data are treated, moving toward a more accurate and reliable analysis that fundamentally enhances the integrity of experimental outcomes.
In random-effects models, the expected values of model parameters are zero; however, the sums of squares for random-effects models can be calculated in the same way as for fixed-effects models, yielding the same final formula as shown in Proposition 5.1. Thus, the proposed method in this study is applicable not only to random-effects models but also to mixed-effects models, as the sums of squares remain valid under these conditions. Furthermore, the Greco-Latin square experimental design (GLSED) does not account for interactions between blocks or between blocks and treatment factors, which means that the mean square formulas for random-effects models are identical to those for fixed-effects models (Freund et al. [38]). This simplifies the analysis, allowing the methodology to be applied consistently in both models without requiring further modifications.
While encountering more than one missing value is possible in well-designed experiments, it is less frequent than single missing values, as researchers typically aim to complete their experiments successfully for accurate analysis. This study focuses on the case of a single missing value due to the complexity of deriving unbiased sums of squares in the exact-scheme analysis. Expanding this methodology to accommodate multiple missing data points is a promising direction for future research, requiring the development of mathematical frameworks to handle various missing data patterns. Previous work by Sirikasemsuk and Leerojanaprapa [29], which provided formulas for unbiased treatment sums of squares in Latin square designs with two missing values, lays a strong foundation for these extensions.
Kittiwat Sirikasemsuk: Conceptualization, methodology, formal analysis, writing – original draft, supervision, writing – review & editing; Sirilak Wongsriya: Formal analysis, investigation, methodology, provided the example of comparison; Kanogkan Leerojanaprapa: Conceptualization, methodology, data curation, validation, writing – review & editing, provided the example of comparison. All authors have read and approved the final version of the manuscript for publication.
The authors sincerely thank Ms. Sirilak Wongsriya for her significant contributions to this research. Her 2018 master's thesis, "A Comparison of Treatment Sum of Squares for Greco-Latin Square Design with One Missing Observation, " completed at King Mongkut's Institute of Technology Ladkrabang, Thailand, provided partial data and valuable insights that enhanced our study. Dr. Kittiwat Sirikasemsuk, her advisor, also offered critical guidance in developing her research. Ms. Wongsriya's dedication and analysis greatly enriched this work.
The authors declare that there are no conflicts of interest regarding the publication of this paper.
[1] | D. C. Montgomery, Design and analysis of experiments, 10th Eds., John Wiley & Sons, 2019. |
[2] | R. E. Kirk, Experimental design: Procedures for the behavioral sciences, 4th Eds., SAGE Publications Inc., 2013. https://doi.org/10.4135/9781483384733 |
[3] | R. A. Johnson, G. K. Bhattacharyya, Statistics: Principles and methods, 7th Eds., New Jersey: John Wiley & Sons, 2014. |
[4] | G. Canavos, J. Koutrouvelis, Introduction to the design & analysis of experiments, 1st Ed., Pearson, 2008. |
[5] |
N. Diawara, A. Demuren, E. Gyuricsko, Impairment of continuous insulin delivery therapy and analysis from graeco-latin square design model, J. Biosci. Med., 4 (2016), 40–51. https://doi.org/10.4236/jbm.2016.48006 doi: 10.4236/jbm.2016.48006
![]() |
[6] |
M. R. Mahamud, D. J. Gomes, Enzymatic saccharification of sugar cane bagasse by the crude enzyme from indigenous fungi, J. Sci. Res., 4 (2012), 227. https://doi.org/10.3329/jsr.v4i1.7745 doi: 10.3329/jsr.v4i1.7745
![]() |
[7] |
A. G. Woodside, W. G. Pearce, Testing market segment acceptance of new designs of industrial services, J. Prod. Innovat. Manag., 6 (1989), 185–201. https://doi.org/10.1111/1540-5885.630185 doi: 10.1111/1540-5885.630185
![]() |
[8] |
J. A. Tovar-Aguilar, P. F. Monaghan, C. A. Bryant, A. Esposito, M. Wade, O. Ruíz-Barzola, et al., Improving eye safety in citrus harvest crews through the acceptance of personal protective equipment, community-based participatory research, social marketing, and community health workers, J. Agromedicine, 19 (2014), 107–116. https://doi.org/10.1080/1059924x.2014.884397 doi: 10.1080/1059924x.2014.884397
![]() |
[9] | R. Mead, S. G. Gilmour, A. Mead, Statistical principles for the design of experiments: Applications to real experiments, Cambridge University Press, 2012. https://doi.org/10.1017/CBO9781139020879 |
[10] | W. J. Youden, Use of incomplete block replications in estimating tobacco-mosaic virus, Contrib. Boyce Thomps., 9 (1937), 41–48. |
[11] |
F. Yates, Incomplete randomized blocks, Ann. Eugen., 7 (1936), 121–140. https://doi.org/10.1111/j.1469-1809.1936.tb02134.x doi: 10.1111/j.1469-1809.1936.tb02134.x
![]() |
[12] |
M. Ai, K. Li, S. Liu, D. K. J. Lin, Balanced incomplete Latin square designs, J. Statist. Plann. Inference, 143 (2013), 1575–1582. https://doi.org/10.1016/j.jspi.2013.05.001 doi: 10.1016/j.jspi.2013.05.001
![]() |
[13] |
R. L. Anderson, Missing-plot techniques, Biometrics Bull., 2 (1946), 41–47. https://doi.org/10.2307/3001999 doi: 10.2307/3001999
![]() |
[14] | R. Rangaswamy, A textbook of agricultural statistics, 2nd Eds., New Age International, 2010. |
[15] |
K. Sirikasemsuk, A review on incomplete Latin square design of any order, AIP Conf. Proc., 1775 (2016), 030022. https://doi.org/10.1063/1.4965142 doi: 10.1063/1.4965142
![]() |
[16] | R. J. A. Little, D. B. Rubin, Statistical analysis with missing data, 3rd Eds., John Wiley & Sons, 2019. |
[17] |
F. E. Allan, J. Wishart, A method of estimating the yield of a missing plot in field experimental work, J. Agri. Sci., 20 (1930), 399–406. https://doi.org/10.1017/S0021859600006912 doi: 10.1017/S0021859600006912
![]() |
[18] | F. Yates, The analysis of replicated experiments when the field results are incomplete, Emprie J. Exp. Agri., 1 (1933), 129–142. |
[19] | J. A. Kupolusi, O. O. Ojo, One missing observation in graeco Latin square design: An approximate analysis of variance, Amer. Based Res. J., 10 (2021), 1–8. |
[20] |
E. A. Cornish, The estimation of missing values in incomplete randomized block experiments, Ann. Eugen., 10 (1940), 112–118. https://doi.org/10.1111/j.1469-1809.1940.tb02240.x doi: 10.1111/j.1469-1809.1940.tb02240.x
![]() |
[21] |
H. R. Baird, C. Y. Kramer, Analysis of variance of a balanced incomplete block design with missing observations, J. Roy. Statist. Soc. Ser. C, 9 (1960), 189–198. https://doi.org/10.2307/2985719 doi: 10.2307/2985719
![]() |
[22] |
M. S. Bartlett, Some examples of statistical methods of research in agriculture and applied biology, J. R. Stat. Soc., 4 (1937), 137–183. https://doi.org/10.2307/2983644 doi: 10.2307/2983644
![]() |
[23] |
I. Coons, The analysis of covariance as a missing plot technique, Biometrics, 13 (1957), 387–405. https://doi.org/10.2307/2527922 doi: 10.2307/2527922
![]() |
[24] |
W. G. Cochran, Analysis of covariance: Its nature and uses, Biometrics, 13 (1957), 261–281. https://doi.org/10.2307/2527916 doi: 10.2307/2527916
![]() |
[25] |
G. N. Wilkinson, Estimation of missing values for the analysis of incomplete data, Biometrics, 14 (1958), 257–286. https://doi.org/10.2307/2527789 doi: 10.2307/2527789
![]() |
[26] |
C. E. Ogbonnaya, E. C. Uzochukwu, Estimation of missing data in analysis of covariance: A least-squares approach, Commun. Stat. Theory Methods, 45 (2016), 1902–1909. https://doi.org/10.1080/03610926.2013.868000 doi: 10.1080/03610926.2013.868000
![]() |
[27] | M. H. Kutner, C. J. Nachtsheim, J. Neter, W. Li, Applied linear statistical models, 5th Eds., New York: McGraw-Hill Irwin, 2005. |
[28] | G. P. Quinn, M. J. Keough, Experimental design and data analysis for biologists, 1st Ed., Cambridge University Press, 2002. https://doi.org/10.1017/CBO9780511806384 |
[29] |
K. Sirikasemsuk, K. Leerojanaprapa, S. Sirikasemsuk, Regression sum of squares of randomized complete block design with one unrecorded observation, AIP Conf. Proc., 2016 (2018), 020136. https://doi.org/10.1063/1.5055538 doi: 10.1063/1.5055538
![]() |
[30] | K. Sirikasemsuk, K. Leerojanaprapa, Analysis of two-missing-observation 4×4 Latin squares using the exact approach, In: Recent advances in information and communication technology 2017, Cham: Springer, 566 (2018), 69–81. https://doi.org/10.1007/978-3-319-60663-7_7 |
[31] | K. Sirikasemsuk, One missing value problem in Latin square design of any order: Regression sum of squares, In: 2016 Joint 8th international conference on soft computing and intelligent systems (SCIS) and 17th international symposium on advanced intelligent systems (ISIS), Japan: IEEE, 2016,142–147. https://doi.org/10.1109/SCIS-ISIS.2016.0041 |
[32] |
K. Sirikasemsuk, K. Leerojanaprapa, One missing value problem in Latin square design of any order: Exact analysis of variance, Cogent Eng., 4 (2017), 1411222. https://doi.org/10.1080/23311916.2017.1411222 doi: 10.1080/23311916.2017.1411222
![]() |
[33] |
J. Subramani, Non-iterative least squares estimation of missing values in graeco-Latin square designs, Biometrical J., 33 (1991), 763–769. https://doi.org/10.1002/bimj.4710330619 doi: 10.1002/bimj.4710330619
![]() |
[34] | D. C. Montgomery, Design and analysis of experiments, John Wiley & Sons, 1984. |
[35] | R. Ott, M. Longnecker, An introduction to statistical methods and data analysis, 7th Eds., Cengage Learning, 2021. |
[36] |
A. AlAita, M. Aslam, K. Al Sultan, M. Saleem, Analysis of graeco-latin square designs in the presence of uncertain data, J. Big Data, 11 (2024), 109. https://doi.org/10.1186/s40537-024-00970-1 doi: 10.1186/s40537-024-00970-1
![]() |
[37] | K. Hinkelmann, O. Kempthorne, Design and analysis of experiments: Introduction to experimental design, John Wiley & Sons, 2007. |
[38] | R. J. Freund, W. J. Wilson, D. L. Mohr, Statistical methods, student solutions manual (e-only), Academic Press, 2010. Available from: http://www.sars-expertcom.gov.hk/english/reports/reports.html |
Colum Row | ψ1 | ψ2 | ψ3 | ψ4 | yi⋅⋅⋅ | |
1 | 2 | 3 | 4 | |||
θ1 | 1 | A∝=y1111 | Bβ=y1222 | Cγ=y1333 | Dδ=y1444 | y1⋅⋅⋅ |
θ2 | 2 | Bδ=y2241 | Aγ=y2132 | Dβ=y2423 | C∝=y2314 | y2⋅⋅⋅ |
θ3 | 3 | Cβ=y3321 | D∝=y3412 | Aδ=y3143 | Bγ=y3234 | y3⋅⋅⋅ |
θ4 | 4 | Dγ=y4421 | Cδ=y4342 | B∝=y4213 | Aβ=y4124 | y4⋅⋅⋅ |
y⋅⋅⋅l | y⋅⋅⋅1 | y⋅⋅⋅2 | y⋅⋅⋅3 | y⋅⋅⋅4 | y⋅⋅⋅⋅ |
Column Row | ψ1 | ψ2 | ψ3 | ψ4 | |
1 | 2 | 3 | 4 | ||
θ1 | 1 | Cβ= 11 | Bγ= 10 | Dδ= 14 | A∝= 8 |
θ2 | 2 | B∝= 8 | Cδ= 12 | Aγ= 10 | Dβ= 12 |
θ3 | 3 | Aδ= 9 | D∝= 11 | Bβ= missing value | Cγ= 15 |
θ4 | 4 | Dγ= 9 | Aβ= 8 | C∝= 18 | Bδ= 6 |
Column Row | ψ1 | ψ2 | ψ3 | ψ4 | ψ5 | |
1 | 2 | 3 | 4 | 5 | ||
θ1 | 1 | A∝= missing value | Bβ= 16 | Cγ= 19 | Dδ= 16 | Eε= 13 |
θ2 | 2 | Bγ= 18 | Cδ= 21 | Dε= 18 | E∝= 11 | Aβ= 21 |
θ3 | 3 | Cε= 20 | D∝= 12 | Eβ= 16 | Aγ= 25 | Bδ= 13 |
θ4 | 4 | Dβ= 15 | Eγ= 15 | Aδ= 22 | Bε= 14 | C∝= 17 |
θ5 | 5 | Eδ= 10 | Aε= 24 | B∝= 17 | Cβ= 17 | Dγ= 14 |
Column Row |
ψ1 | ψ2 | ψ3 | ψ4 | ψ5 | ψ6 | ψ7 | |
1 | 2 | 3 | 4 | 5 | 6 | 7 | ||
θ1 | 1 | A∝= 304 | Bε= 436 | Cβ= 350 | Dφ= 504 | Eχ= 417 | Fγ= 519 | Gδ= 432 |
θ2 | 2 | Bβ= 381 | Cφ= 505 | Dχ= 425 | Eγ= 564 | Fδ= 494 | G∝= 350 | Aε= 413 |
θ3 | 3 | Cχ= 432 | Dγ= 566 | Eδ= 479 | F∝= 357 | Gε= 461 | Aβ= 340 | Bφ= 502 |
θ4 | 4 | Dδ= 442 | E∝= 372 | Fε= 536 | Gβ= 366 | Aφ= 495 | Bχ= 425 | Cγ= 507 |
θ5 | 5 | Eε= 496 | Fβ= 449 | Gφ= 493 | Aχ= 345 | Bγ= 509 | Cδ= 481 | D∝= 380 |
θ6 | 6 | Fφ= 534 | Gχ= 421 | Aγ= 452 | Bδ= 427 | C∝= 346 | Dε= 478 | Eβ= 397 |
θ7 | 7 | Gγ= 543 | Aδ= 386 | B∝= 435 | Cε= 485 | Dβ= 406 | Eφ= 554 | Fχ= missing value |
Case Study | Exact-scheme analysis | Estimating missing data via least squares method | Bias |
1 | SSTr(unbias)_exact = 59.333 | SSTr(bias) = 63.50 | 4.167 |
2 | SSTr(unbias)_exact = 217.467 | SSTr(bias) = 282.80 | 65.333 |
3 | SSTr(unbias)_exact = 32, 704 | SSTr(bias) = 34, 620 | 1916 |
Source of variation | Sum of squares | Degrees of freedom | Mean square | F0 |
Latin letter treatment | 59.333 | 3 | 19.778 | 2.55 |
Greek letter | 2.833 | 3 | 0.944 | |
Rows | 6.500 | 3 | 2.167 | |
Columns | 30.833 | 3 | 10.278 | |
Error | 15.500 | 2 | 7.750 | |
Total | 136.933 | 14 |
Source of variation | Sum of squares | Degrees of freedom | Mean square | F0 |
Latin letter treatment | 217.467 | 4 | 54.367 | 9.81 |
Greek letter | 17.917 | 4 | 4.479 | |
Rows | 6.000 | 4 | 1.500 | |
Columns | 22.317 | 4 | 5.579 | |
Error | 38.800 | 7 | 5.543 | |
Total | 355.333 | 23 |
Source of variation | Sum of squares | Degrees of freedom | Mean square | F0 |
Latin letter treatment | 32, 704 | 6 | 5, 450.7 | 9.28 |
Greek letter | 155, 215 | 6 | 25, 869.1 | |
Rows | 7, 412 | 6 | 1, 235.4 | |
Columns | 1, 270 | 6 | 211.7 | |
Error | 13, 515 | 23 | 587.6 | |
Total | 213, 217 | 47 |
Design strategy | Reference |
RCBD | [17,18] |
LSED | [17,18] |
Greco-Latin square experimental design (GLSED) | [19] |
BIBD | [20,21] |
Youden design | [10] |
Split plot design | [13] |
Colum Row | ψ1 | ψ2 | ψ3 | ψ4 | yi⋅⋅⋅ | |
1 | 2 | 3 | 4 | |||
θ1 | 1 | A∝=y1111 | Bβ=y1222 | Cγ=y1333 | Dδ=y1444 | y1⋅⋅⋅ |
θ2 | 2 | Bδ=y2241 | Aγ=y2132 | Dβ=y2423 | C∝=y2314 | y2⋅⋅⋅ |
θ3 | 3 | Cβ=y3321 | D∝=y3412 | Aδ=y3143 | Bγ=y3234 | y3⋅⋅⋅ |
θ4 | 4 | Dγ=y4421 | Cδ=y4342 | B∝=y4213 | Aβ=y4124 | y4⋅⋅⋅ |
y⋅⋅⋅l | y⋅⋅⋅1 | y⋅⋅⋅2 | y⋅⋅⋅3 | y⋅⋅⋅4 | y⋅⋅⋅⋅ |
Column Row | ψ1 | ψ2 | ψ3 | ψ4 | |
1 | 2 | 3 | 4 | ||
θ1 | 1 | Cβ= 11 | Bγ= 10 | Dδ= 14 | A∝= 8 |
θ2 | 2 | B∝= 8 | Cδ= 12 | Aγ= 10 | Dβ= 12 |
θ3 | 3 | Aδ= 9 | D∝= 11 | Bβ= missing value | Cγ= 15 |
θ4 | 4 | Dγ= 9 | Aβ= 8 | C∝= 18 | Bδ= 6 |
Column Row | ψ1 | ψ2 | ψ3 | ψ4 | ψ5 | |
1 | 2 | 3 | 4 | 5 | ||
θ1 | 1 | A∝= missing value | Bβ= 16 | Cγ= 19 | Dδ= 16 | Eε= 13 |
θ2 | 2 | Bγ= 18 | Cδ= 21 | Dε= 18 | E∝= 11 | Aβ= 21 |
θ3 | 3 | Cε= 20 | D∝= 12 | Eβ= 16 | Aγ= 25 | Bδ= 13 |
θ4 | 4 | Dβ= 15 | Eγ= 15 | Aδ= 22 | Bε= 14 | C∝= 17 |
θ5 | 5 | Eδ= 10 | Aε= 24 | B∝= 17 | Cβ= 17 | Dγ= 14 |
Column Row |
ψ1 | ψ2 | ψ3 | ψ4 | ψ5 | ψ6 | ψ7 | |
1 | 2 | 3 | 4 | 5 | 6 | 7 | ||
θ1 | 1 | A∝= 304 | Bε= 436 | Cβ= 350 | Dφ= 504 | Eχ= 417 | Fγ= 519 | Gδ= 432 |
θ2 | 2 | Bβ= 381 | Cφ= 505 | Dχ= 425 | Eγ= 564 | Fδ= 494 | G∝= 350 | Aε= 413 |
θ3 | 3 | Cχ= 432 | Dγ= 566 | Eδ= 479 | F∝= 357 | Gε= 461 | Aβ= 340 | Bφ= 502 |
θ4 | 4 | Dδ= 442 | E∝= 372 | Fε= 536 | Gβ= 366 | Aφ= 495 | Bχ= 425 | Cγ= 507 |
θ5 | 5 | Eε= 496 | Fβ= 449 | Gφ= 493 | Aχ= 345 | Bγ= 509 | Cδ= 481 | D∝= 380 |
θ6 | 6 | Fφ= 534 | Gχ= 421 | Aγ= 452 | Bδ= 427 | C∝= 346 | Dε= 478 | Eβ= 397 |
θ7 | 7 | Gγ= 543 | Aδ= 386 | B∝= 435 | Cε= 485 | Dβ= 406 | Eφ= 554 | Fχ= missing value |
Case Study | Exact-scheme analysis | Estimating missing data via least squares method | Bias |
1 | SSTr(unbias)_exact = 59.333 | SSTr(bias) = 63.50 | 4.167 |
2 | SSTr(unbias)_exact = 217.467 | SSTr(bias) = 282.80 | 65.333 |
3 | SSTr(unbias)_exact = 32, 704 | SSTr(bias) = 34, 620 | 1916 |
Source of variation | Sum of squares | Degrees of freedom | Mean square | F0 |
Latin letter treatment | 59.333 | 3 | 19.778 | 2.55 |
Greek letter | 2.833 | 3 | 0.944 | |
Rows | 6.500 | 3 | 2.167 | |
Columns | 30.833 | 3 | 10.278 | |
Error | 15.500 | 2 | 7.750 | |
Total | 136.933 | 14 |
Source of variation | Sum of squares | Degrees of freedom | Mean square | F0 |
Latin letter treatment | 217.467 | 4 | 54.367 | 9.81 |
Greek letter | 17.917 | 4 | 4.479 | |
Rows | 6.000 | 4 | 1.500 | |
Columns | 22.317 | 4 | 5.579 | |
Error | 38.800 | 7 | 5.543 | |
Total | 355.333 | 23 |
Source of variation | Sum of squares | Degrees of freedom | Mean square | F0 |
Latin letter treatment | 32, 704 | 6 | 5, 450.7 | 9.28 |
Greek letter | 155, 215 | 6 | 25, 869.1 | |
Rows | 7, 412 | 6 | 1, 235.4 | |
Columns | 1, 270 | 6 | 211.7 | |
Error | 13, 515 | 23 | 587.6 | |
Total | 213, 217 | 47 |