This vignette gives a high level overview on how to use the R package LHD to generate different types of efficient Latin hypercube designs (LHDs) via different algorithms. In addition, popular optimality criteria as well as some handy and helpful functions will be introduced as well.
Load the library first.
The overview starts with basic functions. The first function to be introduced is the “rLHD”, which generates a random LHD with dimension \(n\) by \(k\). Suppose we want a random LHD with 6 rows and 3 columns, and denote it as X. We can run the following code.
set.seed(1)
X=rLHD(n=6,k=3);X
#>      [,1] [,2] [,3]
#> [1,]    1    3    5
#> [2,]    4    2    2
#> [3,]    3    6    6
#> [4,]    6    4    4
#> [5,]    2    1    3
#> [6,]    5    5    1If we want to know the inter-site distance between row \(i\) and row \(j\) of X (\(i \neq j\)), we could use function “dij”. For example, the inter-site distance of the 2nd and the 4th row of X can be got via the following code.
dij(X=X,i=2,j=4) #The default setting is the rectangular distance.
#> [1] 6
#If Euclidean distance is desired, run the following
dij(X=X,i=2,j=4,q=2)
#> [1] 3.464102The formula of “dij” is given by the following
\[\begin{equation} d(\boldsymbol{x_i}, \boldsymbol{x_j})= (\sum_{l=1}^{k}|x_{il}-x_{jl}|^q)^{1/q}, \end{equation}\]
where \(\boldsymbol{x_i}\) and \(\boldsymbol{x_j}\) denote two design points from LHD matrix X. Having all the inter-site distances calculated, we could further calculate the \(\phi_p\) criterion of X according to the formula from Jin, Chen and Sudjianto (2005).
\[\begin{equation} \phi_{p}= \bigg\{\sum_{i=1}^{n-1}\sum_{j=i+1}^{n}d(\boldsymbol{x_i}, \boldsymbol{x_j})^{-p} \bigg\} ^{1/p}. \end{equation}\]
The function “phi_p” provides calculation for above criterion. We can run the following code.
phi_p(X=X,p=15)  
#> [1] 0.2517586
#If Euclidean distance is desired, run the following
phi_p(X=X,p=15,q=2)  
#> [1] 0.4102696\(\phi_p\) criterion is popular for maximin distance LHDs. When orthogonal or nearly orthogonal LHDs are desired, average absolute correlation and maximum absolute correlation are widely used (Georgiou (2009)). Their formula are given by the following
\[\begin{equation} ave(|q|) = \frac{2 \sum_{i=1}^{k-1} \sum_{j=i+1}^{k}|q_{ij}|}{k(k-1)}, \, \, max(|q|)= max_{ij} |q_{ij}|, \end{equation}\]
where \(q_{ij}\) is the columnwise correlation. The function “AvgAbsCor” and “MaxAbsCor” provide calculations for above criteria. We can run the following code.
AvgAbsCor(X=X)  #The average absolute correlation of X
#> [1] 0.3714286
MaxAbsCor(X=X)  #The maximum absolute correlation of X
#> [1] 0.4285714Another optimality criterion to be introduced is the maximum projection criterion (Joseph, Gul and Ba (2015)), which focuses on the peojection property of the design matrix, and its formula is given by the following
\[\begin{equation} \psi = \Bigg\{ \frac{1}{{n \choose 2}} \sum_{i=1}^{n-1} \sum_{j=i+1}^{n} \frac{1}{\Pi_{l=1}^{k}(x_{il}-x_{jl})^2} \Bigg\}^{1/k}. \end{equation}\]
The function “MaxProCriterion” provides calculation for above criterion. We can run the following code.
Usually, a random LHD does not guarantee good space-filling property, orthogonality, nor full projection property. Morris and Mitchell (1995) proposed a version of the simulated annealing algorithm (SA) for generating maximin distance LHDs. Our function “SA” implements their algorithm, and we also added OC (optimality criterion) input argument in the “SA” function. Therefore, not only does “SA” generate maximin distance LHDs, but also it generates orthogonal and maximum projection LHDs. This is also true for the other algorithms in our package.
The function “exchange” makes “SA” workable since it is capable of switching two random elements from a given column of X. The following code shows examples of both “exchange” and “SA”.
#Suppose we want to exchange two random elements from the 1st column of X.
Xnew=exchange(X=X,j=1)
#Look and compare
X;Xnew
#>      [,1] [,2] [,3]
#> [1,]    1    3    5
#> [2,]    4    2    2
#> [3,]    3    6    6
#> [4,]    6    4    4
#> [5,]    2    1    3
#> [6,]    5    5    1
#>      [,1] [,2] [,3]
#> [1,]    2    3    5
#> [2,]    4    2    2
#> [3,]    3    6    6
#> [4,]    6    4    4
#> [5,]    1    1    3
#> [6,]    5    5    1
#The SA function with default setting
set.seed(1)
trySA=SA(n=6,k=3)
trySA
#>      [,1] [,2] [,3]
#> [1,]    6    2    3
#> [2,]    1    6    4
#> [3,]    5    4    6
#> [4,]    4    5    2
#> [5,]    2    3    1
#> [6,]    3    1    5
#The phi_p of trySA is much smaller than the phi_p of X, which was 0.2517586
phi_p(X=trySA,p=15) 
#> [1] 0.2046486
#Now we try SA function with a different optimality criterion
set.seed(1)
trySA2=SA(n=6,k=3,OC="AvgAbsCor")
AvgAbsCor(trySA2)      #The average absolute correlation is about 0.05
#> [1] 0.04761905Leary, Bhaskar and Keane (2003) modified the work of Morris and Mitchell (1995). They showed that orthogonal-array-based LHD (OALHD) with SA converges faster and generates better designs. Our function “OASA” implements their algorithm.
The function “OA2LHD” makes “OASA” workable since it is capable of transfering an Orthogonal Array (OA) into an LHD with corresponding size, based on the work of Tang (1993). The following code shows examples of both “OA2LHD” and “OASA”.
#create an OA(9,2,3,2)
OA=matrix(c(rep(1:3,each=3),rep(1:3,times=3)),ncol=2,nrow=9,byrow = F);OA
#>       [,1] [,2]
#>  [1,]    1    1
#>  [2,]    1    2
#>  [3,]    1    3
#>  [4,]    2    1
#>  [5,]    2    2
#>  [6,]    2    3
#>  [7,]    3    1
#>  [8,]    3    2
#>  [9,]    3    3
#Transfer the OA above into a 9 by 2 LHD
tryOA=OA2LHD(OA=OA)
tryOA
#>       [,1] [,2]
#>  [1,]    3    1
#>  [2,]    1    6
#>  [3,]    2    9
#>  [4,]    6    2
#>  [5,]    4    5
#>  [6,]    5    7
#>  [7,]    8    3
#>  [8,]    7    4
#>  [9,]    9    8
#The OASA function
set.seed(1)
tryOASA=OASA(OA=OA)
tryOASA
#>       [,1] [,2]
#>  [1,]    3    3
#>  [2,]    1    5
#>  [3,]    2    9
#>  [4,]    5    1
#>  [5,]    6    4
#>  [6,]    4    7
#>  [7,]    8    2
#>  [8,]    9    6
#>  [9,]    7    8
#The phi_p of tryOASA is smaller than the phi_p of SA 
phi_p(X=tryOASA,p=15); phi_p(X=SA(n=9,k=2),p=15) 
#> [1] 0.2899805
#> [1] 0.3356489
#Now we try OASA function with a different optimality criterion
tryOASA2=OASA(OA=OA,OC="AvgAbsCor")
AvgAbsCor(tryOASA2)      #The average absolute correlation is 0
#> [1] 0Joseph and Hung (2008) modified the work of Morris and Mitchell (1995) in another way that is able to reduce both \(\phi_{p}\) and column-wise correlations at the same time. Our function “SA2008” implements their algorithm. See the following code and example.
set.seed(1)
trySA2008_63=SA2008(n=6,k=3)
trySA2008_63
#>      [,1] [,2] [,3]
#> [1,]    4    1    5
#> [2,]    1    2    3
#> [3,]    3    6    2
#> [4,]    6    4    4
#> [5,]    2    5    6
#> [6,]    5    3    1
phi_p(X=trySA2008_63,p=15) 
#> [1] 0.203588
#Another example with different n and k
trySA2008_92=SA2008(n=9,k=2)
trySA2008_92
#>       [,1] [,2]
#>  [1,]    8    2
#>  [2,]    3    6
#>  [3,]    5    9
#>  [4,]    1    8
#>  [5,]    4    1
#>  [6,]    7    7
#>  [7,]    9    5
#>  [8,]    6    4
#>  [9,]    2    3
phi_p(X=trySA2008_92,p=15) 
#> [1] 0.2899805
#Now we try SA2008 function with a different optimality criterion
set.seed(1)
trySA2008=SA2008(n=6,k=3,OC="AvgAbsCor")
AvgAbsCor(trySA2008)      #The average absolute correlation is about 0.03
#> [1] 0.02857143Ba, Myers and Brenneman (2015) extended the idea of Sliced Latin Hypercube Designs (SLHD) from Qian (2012) and had their own modifications of SA from Morris and Mitchell (1995). They called their algorithm “improved two-stage algorithm”. Our function “SLHD” implements their algorithm. See the following code and example.
set.seed(1)
trySLHD_63F=SLHD(n=6,k=3,t=2)   #The default setting (stage2 is FALSE)
trySLHD_63F
#>      [,1] [,2] [,3]
#> [1,]    6    4    5
#> [2,]    1    2    4
#> [3,]    3    5    2
#> [4,]    4    3    3
#> [5,]    2    6    6
#> [6,]    5    1    1
phi_p(X=trySLHD_63F,p=15) 
#> [1] 0.2517316
#If the second stage is desired, run the following
trySLHD_63T=SLHD(n=6,k=3,t=2,stage2=TRUE)
trySLHD_63T
#>      [,1] [,2] [,3]
#> [1,]    5    1    2
#> [2,]    4    4    4
#> [3,]    2    6    5
#> [4,]    3    5    1
#> [5,]    1    2    3
#> [6,]    6    3    6
phi_p(X=trySLHD_63T,p=15)       #Result is improved (phi_p is smaller than above)
#> [1] 0.21651
#Now we try SLHD function with a different optimality criterion
set.seed(1)
trySLHD=SLHD(n=6,k=3,OC="AvgAbsCor",stage2=TRUE)
AvgAbsCor(trySLHD)      #The average absolute correlation is about 0.07
#> [1] 0.06666667Chen et al. (2013) followed the structure of Particle Swarm Optimization (PSO) framework and proposed a version of it for constructing maximin distance LHDs, and their algorithm is called LaPSO. Our function “LaPSO” implements their algorithm. See the following code and example.
set.seed(1)
tryLaPSO_63=LaPSO(n=6,k=3)
tryLaPSO_63
#>      [,1] [,2] [,3]
#> [1,]    1    5    4
#> [2,]    5    6    1
#> [3,]    6    2    3
#> [4,]    4    4    6
#> [5,]    3    3    2
#> [6,]    2    1    5
phi_p(X=tryLaPSO_63,p=15) 
#> [1] 0.2052545
#Another example with different n and k
tryLaPSO_92=LaPSO(n=9,k=2)
tryLaPSO_92
#>       [,1] [,2]
#>  [1,]    1    9
#>  [2,]    7    8
#>  [3,]    2    2
#>  [4,]    9    6
#>  [5,]    6    5
#>  [6,]    5    1
#>  [7,]    4    7
#>  [8,]    3    4
#>  [9,]    8    3
phi_p(X=tryLaPSO_92,p=15) 
#> [1] 0.3361717
#Now we try LaPSO function with a different optimality criterion
set.seed(1)
tryLaPSO=LaPSO(n=6,k=3,OC="AvgAbsCor")
AvgAbsCor(tryLaPSO)      #The average absolute correlation is about 0.1
#> [1] 0.1047619Liefvendahl and Stocki (2006) proposed a version of the genetic algorithm (GA) which implemented an elite strategy to focus on the global best directly for constructing maximin distance LHDs. Our function “GA” implements their algorithm. See the following code and example.
set.seed(1)
tryGA_63=GA(n=6,k=3)
tryGA_63
#>      [,1] [,2] [,3]
#> [1,]    2    5    6
#> [2,]    5    3    1
#> [3,]    3    6    2
#> [4,]    6    4    4
#> [5,]    4    1    5
#> [6,]    1    2    3
phi_p(X=tryGA_63,p=15) 
#> [1] 0.203588
#Another example with different n and k.
tryGA_92=GA(n=9,k=2)
tryGA_92
#>       [,1] [,2]
#>  [1,]    8    6
#>  [2,]    2    1
#>  [3,]    3    7
#>  [4,]    5    9
#>  [5,]    6    5
#>  [6,]    1    4
#>  [7,]    4    3
#>  [8,]    7    2
#>  [9,]    9    8
phi_p(X=tryGA_92,p=15) 
#> [1] 0.3501968
#Now we try GA function with a different optimality criterion
set.seed(1)
tryGA=GA(n=6,k=3,OC="AvgAbsCor")
AvgAbsCor(tryGA)      #The average absolute correlation is about 0.07
#> [1] 0.06666667