1.
Introduction
Computer experiments, as a widely used method in scientific research, simulate complex real-world problems through complex computer codes [1,2,3]. It is very important to plan computer experiments efficiently. Latin hypercube designs (LHDs) introduced by McKay et al. [4] are very suitable to plan computer experiments involving only quanlitative factors. Numerous methods have been proposed to construct LHDs with good properties, such as low-dimensional projection property, orthogonality, and other uniform criteria (such as uniform discrepancies, maximin distance, etc.). Computer experiments with both qualitative and quantitative factors have also received a lot of attention (see, for example, [5,6,7,8,9]). Sliced space-filling designs and sliced LHDs (SLHDs) are efficient choices when both quantitative and qualitative factors are included in computer experiments [10,11]. However, such two types of designs are inefficient due to the increase in the number of runs as the number of level combinations of the qualitative factors increases. To solve this problem, Deng et al. [12] first proposed marginally coupled design (MCD), which is more cost-effective in terms of the number of runs, and possesses excellent space-filling properties, i.e., in which the design for the quantitative factors is an LHD, and such quantitative factor design is also an SLHD with respect to each qualitative factor. Some researchers have worked on improving the low-dimensional stratification of the design for the quantitative factors in MCDs; see, among others, [13,14] and [15]. Other researchers have constructed orthogonal MCDs in which the designs for the quantitative factors are orthogonal [16]. In order to improve the stratification between qualitative and quantitative factors, Yang et al. [17] proposed doubly coupled design (DCD) which has the following attractive space-filling properties: (1) the whole design is an MCD, and (2) the design points for the quantitative factors form an SLHD with respect to the level combinations of any two qualitative factors. In the above improved MCDs and DCDs, the designs for the qualitative factors are all equal-level orthogonal arrays (OAs). However, there exist qualitative factors being mixed-level in real-world problems, and in MCDs the designs of the qualitative factors are often mixed-level OAs. In this paper, we aim to construct MCDs in which the designs for qualitative factors are mixed-level OAs.
For an MCD (D1,D2) where D1 and D2 are the designs for qualitative and quantitative factors, respectively, Deng et al. [12] investigated the existence and construction of an MCD for mixed-level qualitative factors. They gave the existence of an MCD (D1,D2) with D1 being an OA(n,sk11sk22,2), s1=βs2, in terms of the structure of D1. The existence is somewhat limited by the restriction s1=βs2. To overcome this limitation, we provide a necessary and sufficient condition on both D1 and D2 to ensure the existence of an MCD with D1 being an OA(n,sk11sk22,2), s1=βs2, or s1≠βs2. Given a small initial MCD with mixed-level qualitative factors, a large MCD with mixed-level qualitative factors can be constructed by Construction 3 of Deng et al. [12]. However, Deng et al. [12] did not address the question of how to construct the initial MCDs. Fortunately, the MCDs constructed in this paper can be used as initial MCDs for Construction 3 of [12]. Therefore, for the MCDs obtained in this paper, the run sizes are more flexible than for the MCDs constructed in Construction 3 of Deng et al. [12]. Based on the existence result of [12], for s1=βs2, we give an algorithm to construct MCDs for D2 with a large number of columns. By the necessary and sufficient condition in this paper, two algorithms are proposed to construct MCDs with D1 being an OA(n,2k1sk2,2), s=2β, or s≠2β. For the D2 constructed by Construction 3 of Deng et al. [12], the D2 only has stratification property in one-dimensional projections. To enhance the space-filling property of D2, we present two algorithms to construct MCDs with D2 possessing stratification properties in two-, three-, or four- dimensional projections.
The paper is organized as follows: Section 2 introduces the basic definitions and notation. Section 3 gives five methods for constructing MCDs with mixed-level qualitative factors. Section 4 provides the conclusions. All proofs are deferred to Appendix A. Some tables are listed in Appendix B.
2.
Definitions and notation
Let GF(s)={α0,α1,…,αs−1}, α0=0, α1=1, denote a Galois field of order s, which is simplified as GF(s)={0,1,…,s−1} if s is a prime. An n×p matrix is called a Latin hypercube design of n runs and p factors, denoted by LHD(n,p), if each of its columns is a random permutation of {0,1,…,n−1}. An n×k array A is said to be a mixed-level OA of strength 2, denoted by OA(n,s1k1s2k2,2), if any n×2 sub-array of A contains all possible level combinations with equal frequency, where the entries in the first k1 columns and the last k2 columns are taken from {0,1,…,s1−1} and {0,1,…,s2−1}, respectively. When s1=s2=s and k1+k2=k, the orthogonal array A is equal-level, denoted by OA(n,sk,2). An OA(st,sk,2) with v=(st−1)/(s−1) can be constructed using the Rao-Hamming construction, the details of which are described in Section 3.4 of [18]. For a prime power s, let η1 and η2 be two s-level independent columns of length s2, where the entries of both η1 and η2 are taken from GF(s)={α0,α1,…,αs−1}, α0=0, α1=1. We apply the Rao-Hamming construction to create an OA(s2,ss+1,2) Φ as
where the addition and multiplication operations are based on GF(s). An OA(n,sk11sk22,2) A is said to be a (β1 × β2) - resolvable OA, denoted by (β1 × β2)-ROA(n,sk11sk22,2), if for i=1,2, its rows can be divided into n/(βisi) sub-arrays A1,…,An/(βisi) of βisi rows each, where Ai is an OA(βisi,s1k1s2k2,1) for i=1,2. In particular, when s1 = s2 = s,β1 = β2 = β, and k1+k2=k, then the array reduces to β-ROA(n,sk,2). If β = 1, the array A is called a completely resolvable OA (CROA), denoted by CROA(n,sk,2).
Suppose D1 is an OA(n,sk11sk22,2) and D2 is an LHD(n,p), then the design D=(D1,D2) is called a MCD, denoted by MCD(n,sk11sk22,p), where D1 and D2 are sub-designs for qualitative factors and quantitative factors, respectively, if for every level of any factor of D1, the corresponding rows in D2 form a small LHD. When s1=s2=s and k1+k2=k, the MCD is denoted by MCD(n,sk,p).
Let 1s be an s - dimensional column vector whose entries are all ones. An u×r matrix A with entries from GF(s) is called a difference scheme of strength 2 based on GF(s), denoted by D(u,r,s), if for all i and k with 1≤i, k≤r, i≠k, the vector difference between the ith and kth columns contains each element of GF(s) exactly u/s times. Throughout, D(u,r,s) is a u-row, r-column, and s-level difference schme (of strength 2). For an n×m matrix X and an s×p matrix Y, their Kronecker sum and Kronecker product are defined as X⊕Y = (xij+Y) and X⊗Y = (xijY), respectively, where xij is the (i,j)th entry of X. For a matrix X=(xij)n×m, define an n×m matrix f(X,s) as
He et al. [13] demonstrated a necessary and sufficient condition for the design (D1,D2) being an MCD(n,sk,p), as stated in Lemma 1.
Lemma 1 ([13]). Suppose D1 is an OA(n,sk,2) and D2 is an LHD(n,p). Let di be the ith column of D2 for i=1,2,…,p. Then, D=(D1,D2) is an MCD(n,sk,p) if, and only if, for i=1,2,…,p, the (D1,f(di,s)) is an OA(n,sk(n/s),2), where f(∗,⋅) can be obtained from Equation (2.1).
Lemma 2 given by Deng et al. [12] presents a necessary and sufficient condition for the existence of an MCD(n,sk11sk22,p) with s1=βs2.
Lemma 2 ([12]). Given that D1 is an OA(n,sk11sk22,2) with s1=βs2, an MCD(n,sk11sk22,p) D=(D1,D2) exists if, and only if, D1 is a (1×β)-ROA(n,sk11sk22,p) that can be expressed as
such that (Ai1,Ai2) is an OA(s1,sk11sk22,1), where m=n/s1, and the Ai2 is a CROA(s1,sk22,2), for i=1,2,…,m.
The necessary and sufficient condition given by Lemma 2 is rather restricted by the restriction s1=βs2. Similar to Lemma 1, we provide directly a necessary and sufficient condition to break this restriction, as shown in the following lemma.
Lemma 3. Suppose D1=(Ω,Λ) is an OA(n,sk11sk22,2), where Ω and Λ are an OA(n,sk11,2) and an OA(n,sk22,2), respectively, and D2 is an LHD(n,p). Let di be the ith column of D2 for i=1,2…,p. Then, D = (D1,D2) is an MCD(n,sk11sk22,p) if, and only if, for i=1,2,…,p, (Ω,f(di,s1)) and (Λ,f(di,s2)) are an OA(n,sk11(n/s1),2) and an OA(n,sk22(n/s2),2), respectively, where f(∗,⋅) can be obtained from Equation (2.1).
3.
Construction of MCDs
3.1. Construction of MCDs for D1 being an OA(N,sk11sk22,2) with N=s21 (s1=βs2), N=2s (s≥2), N=2λs2 (s≥2)
This section presents three construction algorithms to construct MCDs. First, we construct an MCD D=(D1,D2) via an OA(s21,sk1+21,2) and a CROA(s1,sk22,2) with s1=βs2. Motivated by Lemma 2, we present the following algorithm.
Theorem 1. For D=(D1,D2) constructed by Algorithm 1, we have
(i) D1=(A,B∗) is an OA(s21,sk11sk22,2) with s1=βs2;
(ii) D2 is an LHD(s21,p);
(iii) D=(D1,D2) is an MCD(s21,sk11sk22,p).
If we take G in Step 1 of Algorithm 1 to be a regular saturated OA(s21,ss1+11,2), then D=(D1,D2) is an MCD(s21,ss1−11sk22,p); alternatively, the G can also be a non-regular s1-level OA. Hence, s1 may or may not be a prime power. Furthermore, if B is a CROA(s1,ss22,2) and s1=s22 in Step 2, then D=(D1,D2) is an MCD(s21,ss1−11ss22,p).
In Theorem 1, the MCDs with at most β!s1!s2! distinct quantitative columns can be constructed from Steps 4 and 5 of Algorithm 1. Thus, Theorem 1 provides the MCD with a large number of quantitative factors. Let η=β!s1!s2!, and there can be as many as (η!)/(p!(η−p)!) different MCDs. Thus, an optimal D2 under maximin distance criterion [19] (or the centered L2-discrepancy criterion [20,21]) can be found by ranking the (η!)/(p!(η−p)!) candidate MCDs or via the simulated annealing [22] or the threshold accepting algorithms [23] when the number of candidate MCDs is very large.
Example 1. Applying the Rao-Hamming construction to generate an OA(16,45,2) G, then the A is obtained and listed in Table 1. The CROA(4,22,2) B is obtained as B=(01010110)T, then B∗=(BT,BT,BT,BT)T. Thus, D1 is constructed as D1=(A,B∗). Consider the case p=3. In Step 4, μ1, μ2, and μ3 are obtained as μ1=(0,1,2,3)T, μ2=(1,0,2,3)T, μ3=(1,2,0,3)T, then E can be obtained and listed in Table 1. In Step 5, w1, w2, and w3 are obtained as w1=(0,1,2,3)T and w2=w3=(2,3,0,1)T, then C can be obtained and listed in Table 1. By the matrix operation of s1E+C in Step 6, D2 can be generated. It is easy to check that D=(D1,D2) is an MCD(16,4322,3), which is provided in Table 2.
For the MCD(s21,sk11sk22,p) constructed by Algorithm 1, the s1 can be a non-prime power. The following example provides an illustration of the non-prime power case in Algorithm 1.
Example 2. The OA(144,127,2) G listed in Table 17 of Appendix B and the CROA(12,22,2) B listed in Table 3 are obtained from the library of orthogonal arrays maintained by Sloane (http://neilsloane.com/oadir/index.html). Then, divide G into G=(l1,l2,A) listed in Table 17 of Appendix B. Moreover, B∗=(BT,BT,BT,BT,BT,BT)T is obtained and listed in Table 18. Thus, D1 is constructed as D1=(A,B∗). Consider the case p=3. In Step 4, μ1, μ2, and μ3 are obtained, which are listed in Table 3, and then the E can be obtained, which is listed in Table 17 of Appendix B. In Step 5, w1, w2, and w3 are obtained, which are listed in Table 3, then the C can be obtained and listed in Table 17 of Appendix B. By the matrix operation of s1E+C in Step 6, D2 can be generated. It is easy to check that D=(D1,D2) is an MCD(144,12522,3), which is provided in Table 18.
Algorithm 1 can produce some MCDs based on the above Theorem 1, as shown in Table 4.
To employ Algorithm 1, we need several CROAs. Theorem 3 of He et al. [13] gives four types of CROAs, as shown in Lemma 4.
Lemma 4 ([13]). For a prime h and three positive integers k, t (t≥2), and w, if s=hk, the following four CROAs exist: (i) CROA(st,sst−1,2); (ii) CROA(2st,s2st−1,2); (iii) CROA(4st,s4st−1,2); and (iv) CROA(hws2,shws,2).
According to Theorem 1 and Lemma 4, we can obtain a wealth of MCDs for mixed-level qualitative factors as follows.
Corollary 1. For a prime h and three positive integers k, t (t≥2), and w, let s=hk, then by Algorithm 1,
(i) if an OA(s2t,(st)k1+2,2) exists, an MCD(s2t,(st)k1(s)st−1,p) can be obtained;
(ii) if an OA(4s2t,(2st)k1+2,2) exists, an MCD(4s2t,(2st)k1(s)2st−1,p) can be obtained;
(iii) if an OA(16s2t,(4st)k1+2,2) exists, an MCD(16s2t,(4st)k1(s)4st−1,p) can be obtained;
(iv) if an OA(h2ws4,(hws2)k1+2,2) exists, an MCD(h2ws4,(hws2)k1(s)hws,p) can be obtained.
If there exists a small initial MCD for mixed-level qualitative factors, then a series of large MCDs for mixed-level qualitative factors can be constructed by Construction 3 of Deng et al. [12], as shown in Lemma 5.
Lemma 5 ([12]). Let D(0)1=(Φ,Ψ) and D(0)2 be an OA(n,sk11sk22,2) and an LHD(n,p), respectively, where Φ and Ψ are an OA(n,sk11,2) and an OA(n,sk22,2), respectively. For some u, there are two difference schemes D(u,r1,s1) and D(u,r2,s2) (of strength 2), denoted by D(i) for i=1,2, respectively. Let C=(cij) be an u×f matrix with cij=1 and H be an LHD(u,pf). Construct D1=(D(1)⊕Φ,D(2)⊕Ψ) and D2=C⊗D(0)2+nH⊗1n. If D(0)=(D(0)1,D(0)2) is an MCD, then D=(D1,D2) is also an MCD, where D1 and D2 are an OA(nu,sk1r11sk2r22,2) and an LHD(nu,pf), respectively.
The key to constructing MCDs, D=(D1,D2), using Lemma 5 is the existence of the initial MCD D(0)=(D(0)1,D(0)2). However, the construction method of D(0)=(D(0)1,D(0)2) is not mentioned in [12]. Excitingly, the MCDs obtained by Theorem 1 can be used as the initial MCDs. Based on Lemma 5, a large number of MCDs with more columns can be constructed from the initial MCDs obtained by Theorem 1 as follows.
Corollary 2. For D=(D1,D2) constructed by Algorithm 1 and Theorem 1, if there exist two difference schemes D(u,r1,s1) and D(u,r2,s2) (of strength 2) for some u, then for any integer f, an MCD(us21,(s1)k1r1(s2)k2r2,pf) can be obtained by Lemma 5.
Based on Algorithm 1, Theorem 1 and Corollary 2 can generate a series of MCDs with D1 being an OA(n,sk11sk22,2) with s1=βs2, but they can be criticized for the s1=βs2 restriction. However, when s1≠βs2, an MCD also exists, as in the following example.
Example 3. Given D1 is an OA(6,2131,2) and D2 is an LHD(6,6) as listed in Table 5, it is easy to verify that D=(D1,D2) is an MCD(6,2131,6) according to Lemma 3.
Obviously, the MCD(6,2131,6) listed in Table 5 cannot be constructed by Algorithm 1. Next, we propose a new algorithm for constructing MCDs(2s,21s1,s!).
Theorem 2. The design D=(D1,D2) constructed by Algorithm 2 is an MCD(2s,21s1,s!), where D1 is an OA(2s,21s1,2) and D2 is an LHD(2s,s!).
If p<s!, there can be as many as (s!)/(p!(s−p)!) different MCDs from Algorithm 2. Similar to Algorithm 1, an optimal D2 under the maximin distance criterion or the centered L2-discrepancy criterion can be obtained Hickernell [19,20]. Next, we provide an example to illustrate Algorithm 2 and Theorem 2.
Example 4. Let s=4, and an 8×2 matrix D1=(L1,L2) is obtained from Step 1, as shown in Table 6. For 1≤i≤24, D2=(d1,d2,⋯,d24) is constructed according to Step 2, as shown in Table 6. It is easy to verify that D=(D1,D2) is an MCD(8,2141,24) from Lemma 3, which is provided in Table 6.
Algorithm 2 can produce some MCDs based on the above Theorem 2, as shown in Table 7.
In Lemma 5, the MCDs constructed by Theorem 2 can also be used as the initial MCDs for Construction 3 of [12]. Based on Lemma 5, a large number of MCDs with D1 being an OA(2us,2r1sr2,2) can be obtained from the initial MCDs constructed by Theorem 2 as follows.
Corollary 3. For D=(D1,D2) constructed by Algorithm 2 and Theorem 2, if there exist two difference schemes D(u,r1,2) and D(u,r2,s) (of strength 2) for some u, then for any integer f, an MCD(2us,2r1sr2,pf) can be obtained by Lemma 5.
In the MCD (D1, D2) constructed by Algorithm 2 and Theorem 2, the D1 has only two columns. In order to construct D1 that can accommodate more qualitative factors, we present Algorithm 3 as follows.
Theorem 3. For D(0)1 and D(0)2 in Algorithm 3, if D(0)=(D(0)1,D(0)2) is an MCD(n,sm,p), then the design D=(D1,D2) constructed by Algorithm 3 is an MCD(2n,21sm,p), where D1 is an OA(2n,21sm,2), and D2 is an LHD(2n,p).
Remark 1. Note that Algorithm 2 and Algorithm 3 can construct MCDs with D1 being an OA(N,21sk,2), s=2β, or s≠2β, but the values of N in the two Algorithms are different. Algorithm 2 works for k=1 and N=2s, while Algorithm 3 works for k≥2 and N=2λs2, where λ is a positive integer. Thus, Algorithm 3 is able to construct MCDs with more columns in D1 than Algorithm 2, and Algorithm 2 is not a special case of Algorithm 3. For example, for s=3, Algorithm 2 constructs an MCD(6,2131,6), where D1 and D2 are an OA(6,2131,2) and an LHD(6,6), respectively, while Algorithm 3 constructs an MCD(18,2132,2), where D1 and D2 are an OA(18,2132,2) and an LHD(18,2), respectively. This shows that Algorithm 2 and Algorithm 3 cannot be replaced by each other.
Next, we provide an example to illustrate Algorithm 3 and Theorem 3.
Example 5. Table 8 gives an MCD(9,32,2) D(0)=(D(0)1,D(0)2), where D(0)1 is an OA(9,32,2) and D(0)2 is an LHD(9,2). Then, an 18×3 matrix D1=(L1,L2) is obtained by the operations L1=(0,1)T⊗1n and L2=12⊗D(0)1 in Step 2, as shown in Table 9. An 18×2 matrix D2 is obtained by the operations D2=((2D(0)2)T,((2n−1)1n−2D(0)2)T)T in Step 3, as shown in Table 9. It is easy to verify that D=(D1,D2) is an MCD(18,2132,2) from Lemma 3, which is provided in Table 9.
Algorithm 3 can produce some MCDs based on the above Theorem 3, as shown in Table 10.
Similar to Corollary 2 and Corollary 3, we can obtain the following Corollary 4 for the initial MCDs constructed by Algorithm 3 and Theorem 3.
Corollary 4. For D=(D1,D2) constructed by Algorithm 3 and Theorem 3, if there exist two difference schemes D(u,r1,2) and D(u,r2,s) (of strength 2) for some u, then for any integer f, an MCD(2un,2r1sr2m,pf) can be obtained by Lemma 5.
Table 11 presents some designs D1 for mixed-level qualitative factors in MCDs constructed via Algorithms 1, 2, and 3. In the fourth column of Table 11, the D1's are obtained by Construction 3 of Deng et al. [12] from the initial designs listed in the first three columns.
For the MCD(s21,sk11sk22,p) constructed by Algorithm 1, the relation s1=βs2 is indispensable. When s=2β, the MCD(s2,22ss−1,p), MCD(2s,21s1,s!), and MCD(2λs2,21sm,p) (λ≥1) can be constructed by Algorithms 1, 2, and 3, respectively. Clearly, the three MCDs have different numbers of run sizes. For s≠2β, the MCD(2s,21s1,s!) and MCD(2λs2,21sm,p) (λ≥1) can also be obtained using Algorithms 2 and 3, respectively. Algorithm 2 is not a special case of Algorithm 3 due to the different number of run sizes for the constructed MCDs.
3.2. Construction of MCDs for D1 being an OA(s21,sk11sk22,2) with s1=s22 and D2 with the better space-filling property
In the MCDs (D1,D2) constructed by the above three algorithms, the space-filling property of D2 is not considered. The space-filling property is very important for the quantitative factor design D2. In this section, we introduce another algorithm to construct MCDs D=(D1,D2) for D2 with the better space-filling property.
Theorem 4. For s1=s22, D1, and D2 obtained in Algorithm 4, we have
(i) D1 is an OA(s21,ss1−11ss22,2);
(ii) D2 is an LHD(s21,2k), where if s2 is odd, k=(s2+1)/2; if s2 is even, k=s2/2;
(iii) (D1,D2) is an MCD(s21,ss1−11ss22,2k);
(iv) any two distinct columns of D2 achieve s2×s2 grids stratification.
Theorem 4 (iv) tells us that D2 has two-dimensional projection property without considering D1. For each level of any factor in D1, and for each level combination of any two factors in some columns of D1, the corresponding rows in D2 can also achieve the two-dimensional space-filling property, as stated in the following corollary.
Corollary 5. For D=(D1,D2) (D1=(F0,U0,u1)) constructed by Algorithm 4 and Theorem 4, we have
(i) the rows in D2 corresponding to each level of any factor in D1 can achieve stratification on the s2×s2 grids in any two-dimensional projection;
(ii) the rows in D2 corresponding to each level combination of any two factors in (U0,u1) can achieve stratification on the s2×s2 grids in any two-dimensional projection.
Next, we provide an example to illustrate Algorithm 4 and Theorem 4.
Example 6. Consider the case s1=4 and s2=2. An OA(16,45,2) F and an OA(4,23,2) H are obtained from the Rao-Hamming construction. Divide F as F=(F0,f1,f2) listed in Table 12. For the H listed in Table 12, we obtain an 16×3 matrix U by replacing the levels 0,1,2,3 of the f1 with the 1st, 2nd, 3rd, and the 4th row of the H, respectively. Then partition U as U=(U0,u1,u2) listed in Table 12. In Step 3 and Step 4, H∗ is the first 2 columns of H and k=s2/2=1, after replacing the levels 0,1,2,3 of the f2 by the 1st, 2nd, 3rd, and the 4th row of the H∗, respectively. Then, V is obtained, and denote V as V=(v1,v2) listed in Table 12.
From Step 5, D1=(F0,U0,u1), and it is easy to check that D1 is an OA(16,4322,2). In Step 6, let W1=V, W2=(v2,v1), W3=(u2,u2), W4=(u1,u1), then by matrix operation of s32W1+s22W2+s2W3+W4, D2 can be generated. It is easy to verify that (D1,D2) is an MCD(16,4322,2), which is provided in Table 13.
Next, let D2=(d1,d2). It is easy to see that d1 and d2 achieve stratification on 2×2 grids, as shown in Figure 1.
Algorithm 4 can produce some MCDs based on the above Theorem 4, as shown in Table 14.
Next, we introduce the following algorithm to generate MCDs by modifying Algorithm 4, that is, we rearrange the columns of F in Algorithm 4 and apply the idea of Step 4 in Algorithm 4 twice.
Theorem 5. For s_{1} = s_{2}^{2} , D_{1} , and D_{2} obtained in Algorithm 5, we have
\rm(i) D_1 is an OA\left(s_{1}^{2}, s_{1}^{s_{1}-2}s_{2}^{s_2}, 2\right) ;
\rm(ii) D_2 is an LHD\left(s_{1}^{2}, 4k\right) , where if s_{2} is odd, k = (s_{2}+1)/2 ; if s_{2} is even, k = s_{2}/2 ;
\rm(iii) (D_1, D_2) is an MCD\left(s_{1}^{2}, s_{1}^{s_{1}-2}s_{2}^{s_2}, 4k\right) .
Theorem 6. For D_1 and D_2 constructed by Algorithm 5 and Theorem 5, D_2 can be partitioned into two disjoint groups of 2k columns, i.e., D_2 = (D_{21}, D_{22}) . For i = 1, 2, \ldots, 2k , let d_{1}^{i} and d_{2}^{i} be the i th columns of D_{21} and D_{22} , respectively. Then,
\rm(i) any two distinct columns of D_2 achieve s_2 \times s_2 grids stratification;
\rm(ii) any two columns from different groups, d_1^{j} and d_2^{j'} , achieve s_2^2 \times s_2^2 grids stratification, where j, j' = 1, 2, \ldots, k ;
\rm(iii) any three columns from two different groups, d_i^{j} , d_{i'}^{t} and d_{i'}^{h} , achieve s_2^2 \times s_2 \times s_2 grids stratification, where i, i' = 1, 2 , i \neq i' , j, t, h = 1, 2, \ldots, 2k , t \neq h ;
\rm(iv) any four columns from two different groups, d_i^{j} , d_i^{r} , d_{i'}^{t} d_{i'}^{h} , achieve s_2 \times s_2 \times s_2 \times s_2 grids stratification, where i, i' = 1, 2 , i\neq i^{'} , j, r, t, h = 1, 2, \ldots, 2k , , j\neq r , and t \neq h .
According to Theorem 6, there are 4k^2 two-column groups achieving stratifications on s_2^2 \times s_2^2 grids, 2k^2(2k-1) three-column groups achieving stratifications on s_2^2 \times s_2 \times s_2 grids, and k^2(2k-1)^2 four-column groups achieving stratifications on s_2 \times s_2 \times s_2 \times s_2 grids, respectively. Theorem 6 shows that a large number of columns in D_2 have good two-, three-, or four-dimensional projections. Next, we provide an example to illustrate Algorithm 5, Theorem 5, and Theorem 6.
Example 7. Consider the case s_1=9 and s_2=3. An an OA\left(9, 3^{4}, 2\right) H listed in Table 19 of Appendix B and an OA\left(81, 9^{10}, 2\right) F listed in Table 20 of Appendix B are obtained from the library of orthogonal arrays maintained by Sloane (http://neilsloane.com/oadir/index.html). Divide F as F=({F_0}^{*}, f_{0}, f_{1}, f_{2}) listed in Table 20 of Appendix B. For the H, obtain an 81\times 4 matrix U by replacing the levels 0, 1, \ldots, 8 of the f_{1} with the 1st, 2nd, 3rd, \ldots, the 9th row of the H according to Step 2 of Algorithm 4, respectively. Then, partition U as U=(U_{0}, u_{1}, u_{2}) listed in Table 20 of Appendix B. In Step 3 and Step 4 of Algorithm 4, due to s_2=3, let H^{*}=H and k=(s_{2}+1)/2 =2, after replacing the levels 0, 1, \ldots, 8 of the f_{2} by the 1st, 2nd, 3rd, \ldots, the 9th row of the H^{*}, respectively. Then, V is obtained, and denote V as V=(v_{1}, v_{2}, v_{3}, v_{4}) listed in Table 20 of Appendix B. From Step 3, D_1=({F_0}^{*}, U_{0}, u_{1}), and it is easy to check that D_{1} is an O A\left(81, 9^{7} 3^{3}, 2\right). In Step 4, obtain an 81\times 4 matrix Z by replacing the levels 0, 1, \ldots, 8 of the f_{0} with the 1st, 2nd, 3rd, \ldots, the 9th row of the H^{*}, respectively. Then, Z is obtained, and denote Z as Z=(z_{1}, z_{2}, z_{3}, z_{4}) listed in Table 20 of Appendix B. In Step 5, let W_1=V, W_2=(v_{2}, v_{1}, v_{4}, v_{3}), W_3=(u_{2}, u_{2}, u_{2}, u_{2}), W_4=(u_{1}, u_{1}, u_{1}, u_{1}) according to Step 6 of Algorithm 4 and let X_1=Z, X_2=(z_{2}, z_{1}, z_{4}, z_{3}), then by matrix operation of s_{2}^{3}W_{1}+s_{2}^{2}W_{2}+s_{2}W_{3}+W_{4} and s_{2}^{3}X_{1}+s_{2}^{2}X_{2}+s_{2}W_{3}+W_{4}, D_{21} and D_{22} can be generated, respectively. Then, D_2=(D_{21}, D_{22}). It is easy to verify that (D_{1}, D_{2}) is an MCD\left(81, 9^{7} 3^{3}, 8\right) listed in Table 21 of Appendix B. Next, let the first two columns of D_{21} be d_1 and d_2, and the first two columns of D_{22} be d_3, d_4. After collapsing the levels of d_1, d_2, d_3, d_4, it is easy to see that the d_1, d_2, d_3, d_4 satisfies the stratifications of (i) and (ii) in Theorem 6, as shown in Figure 2 and Figure 3.
Inspired by Corollary 5, Corollary 6 is given as follows.
Corollary 6. For D = \left(D_1, D_2\right) ( D_1 = ({F_0}^{*}, U_{0}, u_{1}) , D_2 = (D_{21}, D_{22}) ) constructed by Algorithm 5 and Theorem 5, we have
\rm(i) the rows in D_{2i} , i = 1, 2 , corresponding to each level of any factor in D_1 can achieve stratification on the s_2 \times s_2 grids in any two-dimensional projection;
\rm(ii) the rows in D_{2i} , i = 1, 2 , corresponding to each level combination of any two factors in (U_{0}, u_{1}) can achieve stratification on the s_2 \times s_2 grids in any two-dimensional projection.
Algorithm 5 can produce some MCDs based on the above Theorem 5, as shown in Table 15.
4.
Conclusions
Many researchers have constructed MCDs for equal-level qualitative factors. However, there has been less research on MCDs when the qualitative factors are mixed-level. Construction 3 of Deng et al. [12] generates large MCDs for mixed-level qualitative factors from small initial MCDs for mixed-level qualitative factors. Obviously, such a construction is not valid when the initial MCD does not exist. The key to Construction 3 of Deng et al. [12] is how to obtain a small initial MCD. However, they did not answer the question. Fortunately, the constructed MCDs in this paper can be considered as the initial MCDs for Construction 3 of [12].
In this paper, we propose five algorithms to construct MCDs where the designs for the qualitative factors are mixed-level. The construction of the first algorithm is characterized by the fact that it is based on an OA\left(s_1^2, s_1^{k_1+2}, 2\right) and a CROA\left(s_1, s_2^{k_2}, 2\right) with s_1 = \beta s_2 . Clearly, its constructed MCD is limited by s_{1} = \beta s_{2} . To break this limitation, Algorithms 2 and 3 employ a mirror-symmetric structure to construct D_2 . Moreover, the D_1 constructed by Algorithm 3 can accommodate more columns than the one constructed by Algorithm 2, and the two algorithms construct different numbers of run sizes. The fourth and fifth algorithms construct the MCD using the level replacement method and the rotation method, where D_2 has stratification in two- or higher-dimensional projection. Finally, Table 16 lists some types and features of MCDs that can be constructed using our five algorithms. Obviously, compared to the MCDs constructed by Construction 3 of Deng et al. [12], our constructed MCDs have more flexible run sizes, and the more flexible fixed level D_1 - D_1 is an OA\left(n, s_{1}^{k_{1}} s_{2}^{k_{2}}, 2\right) , s_{1} = \beta s_{2} , or s_{1} \neq \beta s_{2} . Moreover, in contrast to Construction 3 of Deng et al. [12], which does not consider the space-filling property of D_2 , Algorithm 4 and Algorithm 5 construct D_2 with the space-filling property.
For future work, a direction is to introduce methods that can produce MCDs with three or more mixed-level qualitative factors, which deserves further investigation.
Author contributions
Weiping Zhou: Algorithm, methodology, validation, investigation, resources, data curation, writing—original draft preparation, writing—review and editing; Wan He: Algorithm, software, validation, writing—original draft preparation; Wei Wang: Methodology, writing—review and editing, visualization, supervision, project administration; Shigui Huang: Software, writing—original draft preparation, writing—review and editing. All authors have read and agreed to the published version of the manuscript.
Acknowledgments
This work was supported by the Special Fund for Scientific and Technological Bases and Talents of Guangxi (Grant No. Guike AD21075008), the Guangxi Young Teachers Basic Ability Improvement Project (Nos. 2021KY0203), and Science and Technology Project of Guangxi (Grant No. Guike AD23023002).
Conflict of interest
The authors declare no conflict of interest in this paper.
Appendix A.
PROOFS
Proof of Lemma {\rm3}. From the definition of an MCD, it is clear that D = \left(D_1, D_2\right) is an MCD\left(n, {s_1}^{k_1} {s_2}^{k_2}, p\right) if, and only if, (\Omega, D_2) and (\Lambda, D_2) are an MCD\left(n, {s_1}^{k_1}, p\right) and an MCD\left(n, {s_2}^{k_2}, p\right) , respectively. Let d_i be the i th column of D_2 , for i = 1, \ldots, p . From Lemma 1, we have (i) (\Omega, D_2) is an MCD\left(n, {s_1}^{k_1}, p\right) if, and only if, \left(\Omega, f\left(d_i, s_1\right)\right) is an OA\left(n, {s_1}^{k_1}(n/{s_1}), 2\right) ; (ii) \left(\Lambda, D_2\right) is an MCD\left(n, {s_2}^{k_2}, p\right) if, and only if, \left(\Lambda, f\left(d_i, s_2\right)\right) is an OA\left(n, {s_2}^{k_2}(n/{s_2}), 2\right) for i = 1, \ldots, p . □
Proof of Theorem {\rm1}. (i) In the design \left(l_2, A, B^*\right) , the levels 0 , 1 , \ldots, s_1-1 of the l_2 correspond to the 1 st, 2 nd, \ldots, s_1 th rows of the B , respectively, where B^{*} = 1_{s_{1}} \otimes B . Thus, D_1 = \left(A, B^*\right) is an OA\left(s_1^2, s_1^{k_1} s_2^{k_2}, 2\right) with s_1 = \beta s_2 .
(ii) From Steps 4 and 5 of Algorithm 1, it is clear that \left(e_i, c_i\right) is an OA\left(s_1^2, s_1^{2}, 2\right) for i = 1, 2, \ldots, p , where s_1 = \beta s_2 . Thus, D_2 is an {LHD}\left(s_1^2, p\right) from Step 6 of Algorithm 1.
(iii) Let a , b , and d be any columns of A , B^* , and D_2 , respectively. From Steps 3, 4, 5, and 6, let e and c be the columns corresponding to d in E and C , respectively. From Step 6 and Equation (2.1), \left(a, f\left(D_2, s \right)\right) = \left(a, e\right) , thus \left(a, f\left(D_2, s_1 \right)\right) is an OA\left(s_1^2, s_1^{2}, 2\right) . From Steps 4, 5, and 6 of Algorithm 1, it is clear that f\left(D_2, s_2 \right) = \beta e + c^{*} , where c^{*} = 1_{s_{1}} \otimes w , w is a random permutation of \left((c_{i, 1}^{*})^{T}, (c_{i, 2}^{*})^{T}, \cdots, (c_{i, \beta }^{*})^{T}\right)^{T} with c_{i, j}^{*} = (j-1) \textbf{1}_{s_{2}} . Since \left(b, e, c^{*} \right) is an OA\left(s_1^2, s_2^{1}s_1^{1}\beta ^{1}, 3\right) , \left(b, f\left(D_2, s_2 \right)\right) is an OA\left(s_1^2, s_2^{1}(\beta s_1)^{1}, 2\right) . Thus, D = \left(D_1, D_2\right) is an MCD\left(s_1^2, s_1^{k_1}s_2^{k_2}, p\right) from Lemma 3. □
Proof of Theorem {\rm2}. From Steps 1 and 2 of Algorithm 2, it is easy to check that D_{1} is an OA\left(2s, 2^{1}s^{1}, 2\right) and D_{2} is an LHD\left(2s, s!\right) . By Step 2, we can see that d_i is the i th column of D_2 for i = 1, 2, \ldots, s! . Since f\left(d_i, 2\right) = \left((u_{i})^{T}, \left((s-1)\textbf{1}_{s}- u_{i} \right)^{T} \right)^{T} , (L_1, f(d_i, 2)) is an OA\left(2s, 2^{1}s^{1}, 2\right) , where u_{i} is a random permutation of (0, 1, 2, \dots, s-1)^{T} , i = 1, 2, \ldots, s! . For 1\le i\le s! , let \xi_{1} = 2u_{i} and \xi_{2} = (2s-1)\textbf{1}_{s}- 2u_{i} , then
where e = (0, 1, \dots, s-1)^{T} . Obviously, the elements of f(\xi_{1}, s) and f(\xi_{2}, s) are all taken from \{0, 1\} . Since f(\xi_{2}, s) = \textbf{1}_{s}-f(\xi_{1}, s) , (L_2, f(d_i, s)) is an OA\left(2s, s^{1}2^{1}, 2\right) , i = 1, 2, \ldots, s! . From Lemma 3, the design D = (D_{1}, D_{2}) is an MCD\left(2s, 2^{1}s^{1}, s!\right) □
Proof of Theorem {\rm3}. The proof of Theorem 3 is similar to that of Theorem 2 and is therefore omitted here. □
Proof of Theorem {\rm4}. For i = 1, 2, \ldots, s_{1}-1 , j = 1, 2, \ldots, s_{2}-1 , let f_{0i} and u_{0j} be the i th and j th columns of F_{0} and U_{0} , respectively.
(i) Since F = (F_{0}, f_{1}, f_{2}) is an OA\left(s_{1}^{2}, s_{1}^{s_{1}+1}, 2\right) , U = (U_{0}, u_{1}, u_{2}) is an OA\left(s_{1}^{2}, s_{2}^{s_{2}+1}, 2\right) , (f_{0i}, u_{0j}) is an OA\left(s_{1}^{2}, s_{1}^{1}s_{2}^{1}, 2\right) , and (f_{0i}, u_{1}) is an OA\left(s_{1}^{2}, s_{1}^{1}s_{2}^{1}, 2\right) , i = 1, 2, \ldots, s_{1}-1 , j = 1, 2, \ldots, s_{2}-1 , thus D_1 is an OA\left(s_{1}^{2}, s_{1}^{s_{1}-1}s_{2}^{s_2}, 2\right) .
(ii) According to Proposition 1 of [24], we can obtain that (v_{i}, v_{j}, u_{1}, u_{2}) is an OA\left(s_{1}^{2}, s_{2}^{4}, 4\right) , where s_{1} = s_{2}^{2} , i\neq j , i, j = 1, 2, \ldots, 2k . Thus, D_2 is an LHD\left(s_{1}^{2}, 2k\right) , where if s_{2} is odd, k = (s_{2}+1)/2 ; if s_{2} is even, k = s_{2}/2 .
(iii) For h = 1, 2, \ldots, k , let d_{2h-1} and d_{2h} be the (2h-1) th and 2h th columns of D_{2} , respectively, then d_{2h-1} = s_{2}^{3}v_{2h-1}+s_{2}^{2}v_{2h}+s_{2}u_{2}+u_{1} and d_{2h} = s_{2}^{3}v_{2h}+s_{2}^{2}v_{2h-1}+s_{2}u_{2}+u_{1} . Obviously, for i = 1, 2, \ldots, s_{1}-1 , j = 1, 2, \ldots, s_{2}-1 , h = 1, 2, \ldots, k , (f_{0i}, f(d_{2h-1}, s_{1})) = (f_{0i}, s_{2}v_{2h-1}+v_{2h}) , (f_{0i}, f(d_{2h}, s_{1})) = (f_{0i}, s_{2}v_{2h}+v_{2h-1}) , (u_{0j}, f(d_{2h-1}, s_{1})) = (u_{0j}, s_{2}^{2}v_{2h-1}+s_{2}v_{2h}+u_{2}) , (u_{0j}, f(d_{2h}, s_{1})) = (u_{0j}, s_{2}^{2}v_{2h}+s_{2}v_{2h-1}+u_{2}) , (u_{1}, f(d_{2h-1}, s_{1})) = (u_{1}, s_{2}^{2}v_{2h-1}+s_{2}v_{2h}+u_{2}) , and (u_{1}, f(d_{2h}, s_{1})) = (u_{0j}, s_{2}^{2}v_{2h}+s_{2}v_{2h-1}+u_{2}) , where s_{1} = s_{2}^{2} . According to Proposition 1 of [24], for s_{1} = s_{2}^{2} , it is easy to obtain that (f_{0i}, v_{2h-1}, v_{2h}) is an OA\left(s_{1}^{2}, s_{1}^{1}s_{2}^{2}, 3\right) , and both (u_{1}, u_{2}, v_{2h-1}, v_{2h}) and (u_{2}, u_{0j}, v_{2h-1}, v_{2h}) are OA\left(s_{2}^{4}, s_{2}^{4}, 4\right) 's, where i = 1, 2, \ldots, s_{1}-1 , j = 1, 2, \ldots, s_{2}-1 , h = 1, 2, \ldots, k . Therefore, both (f_{0i}, f(d_{2h-1}, s_{1})) and (f_{0i}, f(d_{2h}, s_{1})) are OA\left(s_{1}^{2}, s_{1}^{2}, 2\right) 's, and (u_{0j}, f(d_{2h-1}, s_{2})) , (u_{0j}, f(d_{2h}, s_{2})) , (u_{1}, f(d_{2h-1}, s_{2})) , and (u_{1}, f(d_{2h}, s_{2})) are all OA\left(s_{2}^{4}, s_{2}^{1}(s_{2}^{3})^{1}, 2\right) 's, where s_{1} = s_{2}^{2} , i = 1, 2, \ldots, s_{1}-1 , j = 1, 2, \ldots, s_{2}-1 , h = 1, 2, \ldots, k . From Lemma 3, the design (D_1, D_2) is an MCD\left(s_{1}^{2}, s_{1}^{s_{1}-1}s_{2}^{s_2}, 2k\right) .
(iv) Since f\left (D_2, {s_2}^3 \right) = W_1 and W_1 is an OA\left(s_{1}^{2}, s_{2}^{2k}, 2\right) with s_1 = {s_2}^2 , thus any two distinct columns of D_2 achieve s_2 \times s_2 grids stratification. □
Proof of Theorem {\rm5}. The proof of Theorem 5 is similar to that of Theorem 4 and is therefore omitted here. □
Proof of Theorem {\rm6}. (i) Since f(D_{2}, s_{2}^{3}) = (W_{1}, X_{1}) and (W_{1}, X_{1}) is an OA\left(s_{1}^{2}, s_{2}^{4k}, 2\right) , thus Theorem 6 (i) is true.
(ii) For j, j' = 1, 2, \ldots, 2k , it is easy to see that f(d_1^{j}, s_{2}^{2}) = s_{2}v_{2j-1}+v_{2j} or f(d_1^{j}, s_{2}^{2}) = s_{2}v_{2j}+v_{2j-1} , and f(d_2^{j'}, s_{2}^{2}) = s_{2}z_{2j'-1}+z_{2j'} or f(d_2^{j'}, s_{2}^{2}) = s_{2}z_{2j'}+z_{2j'-1} . According to Proposition 1 of [24], it is easy to obtain that (v_{2j-1}, v_{2j}, z_{2j'-1}, z_{2j'}) is an OA\left(s_{2}^{4}, s_{2}^{4}, 4\right) . Thus, Theorem 6 (ii) is true.
(iii-iv) From Proposition 1 of [24], it is known that any two columns of V in Algorithm 4 and any two columns of Z in Algorithm 5 form an OA\left(s_{1}^{4}, s_{2}^{4}, 4\right) with s_1 = {s_2}^2 . Similar to the proof of (ii), thus (iii) and (iv) are true. □
Appendix B.
Tables