Copula based Bayesian approach - Multivariate Probabilistic Characterization of Hydraulic and P

Chapter 4 Probabilistic analysis of soil-water

characteristic curve using limited data

as vG parameters α and n. N number of SWCCs are equivalent to N pairs of α, n. Therefore probabilistic characterization of SWCC (a curve) in this study means evolving joint distribution of α, n. It should be noted that in Chapter 3, Eq. 4.1 was not used with the constraintm= 1−1/n i.e. probabilistic characterization of SWCC in Chapter 3 meant evolving joint distribution of three parameters i.e. α, n, m. In Chapter 3, the best form of vG constraint (independent m) was determined on the basis of AIC and BIC over the bentonite database. This Chapter describes a general case of methodology intended to be applicable to various textures of soil ranging from silt, loam to clay, therefore the most popular form of vG with m=1-1/n was chosen. This is also in line with other prominent probabilistic studies (Carsel and Parrish, 1988; Phoon et al., 2010, e.g.) which use the constraint i.e. m= 1−1/n.

For carrying out site specific unsaturated RBD, joint PDF of vG parameters,f(α, n) is required.

The same after observingN site specific data i.e. N pairs ofα, ncan be expressed using the theorem of total probability (Ang and Tang, 2007, p. 59, Eq. 2.19) as follows:

f(α, n|data) = Z

f(α, n|W)f(W|data)dW (4.2)

where f(α, n|W) is the joint PDF of α, ngiven its distribution parameters W = [Wα, Wn, Wα,n] and can be expressed using copula theory (Prakash et al., 2018b) as follows:

(2018b) as follows:

fX(x|λ, ζ) = 1 x√

2πζexp[−1

2(ln(x−c)−λ

ζ )²], x > c (4.4)

whereW_X = [λ, ζ] are the lognormal distribution parameters andcis the physical lower bound for the random variable X. The above mentioned three parameter lognormal distribution is the usual two parameter log-normal distribution truncated from left at x = c. This is necessary as the SWCC parameter n has a theoretical lower boundc = 1 i.e. n > 1. Ignoring this aspect can result in unrealistic SWCCs during simulation (Phoon et al., 2010; Prakash et al., 2018b). For the SWCC parameter α , c= 0 i.e. α >0 which leads to the conventional two parameter log-normal distribution. The parameters W_X for bothα and ncan be estimated from the population meanµ and standard deviation σ as follows:

λ= ln(µ−c)− ζ²

2 (4.5)

ζ² = ln[1 + σ²

(µ−c)²] (4.6)

D(u, v|W_α,n) in Eq. 4.3 is the copula density function accounting for dependence betweenα

and nand can be expressed using Gaussian copula Prakash et al. (2018b) as follows:

D(u, v|W_α,_n) = 1

√

1−θ²exp[−(ξ₁θ)²+ (ξ₂θ)²−2θξ₁ξ₂

2(1−θ²) ] (4.7)

where ξ1 = Φ⁻¹(u) and ξ2 = Φ⁻¹(v),Φ is the standard normal cumulative distribution function (CDF). u=F(α) andv=F(n) can be obtained using lognormal CDF transformation as follows:

F(x) = Φ[ln(x−c)−λ

ζ ] (4.8)

For Gaussian copula the dependence parameter Wα,n = θ ∈ [−1,1] is related to rank correlation coefficient Kendall’sτ as follows:

τ = 2

πarcsin(θ) (4.9)

f(W|data) term in Eq. 4.2 can be obtained using Bayes’ rule (Ang and Tang, 2007, p. 63, Eq.

2.20) as follows:

f(W|data) = P(data|W)P(W) R ..R

WP(data|W)P(W)dW (4.10)

whereP(data|W) represents the likelihood of obtained SWCC data given the joint distribution parameters W. Assuming that the site specific N SWCCs i.e N pairs of α, n are independent realizations from the joint PDFf(α, n|W), P(data|W) can be expressed using Eq. 4.3 as follows:

P(data|W) =

i=1

f(data_i|W) =

i=1

f(α_i|W_α)f(n_i|W_n)D(u_i, v_i|W_α,_n) (4.11) P(W) term in Eq. 4.10 represents the prior knowledge about the parameters W. Assuming all the components in W are independent, the joint PDF P(W) can be expressed as product of marginal univariate distribution as follows:

P(W) =P(Wα)P(Wn)P(Wα,n) (4.12) adopting a uniform marginal distributions, each component in Eq.4.12 can be expressed as,











P(W_α) = (λ_αmax−λ_αmin)⁻¹×(ζ_αmax−ζ_αmin)⁻¹ ;λ_α∈[λ_αmin−λ_αmax], ζ_α∈[ζ_αmin−ζ_αmax] P(Wα) = 0 ;λα∈/ [λαmin−λαmax], ζα∈/ [ζ_αmin−ζ_αmax] P(W_n) = (λ_nmax−λ_nmin)⁻¹×(ζ_nmax−ζ_nmin)⁻¹ ;λ_n∈[λ_αmin−λ_αmax], ζ_n∈[ζ_nmin−ζ_nmax] P(Wn) = 0 ;λn∈/ [λαmin−λαmax], ζn∈/ [ζ_nmin−ζ_nmax] P(W_α,n) = (θ_max−θ_min)⁻¹ ;θ∈[θ_min−θ_max]

P(W_α,n) = 0 ;θ /∈[θ_min−θ_max]

α [kPa^-1]

0 0.5 1 1.5 2 2.5

1 1.5 2 2.5

3 (a)

α [kPa^-1]

0 0.05 0.1 0.15

1 1.5 2 2.5 3 3.5

4 (b)

α [MPa^-1]

10^-2 10^-1 10⁰

1 1.5 2 2.5 3 3.5

4 (c)

Figure 4.1: van Genuchten van Genuchten (1980) SWCC parameters for the database of (a) loam (Nemes et al., 2001) (b) fly ash (Prakash et al., 2018b) (c) bentonite (from Chapter 3) used in this study.

The denominator term R ..R

WP(data|W)P(W)dW in Eq. 4.10 is a normalizing term required to make the posterior distribution of parametersf(W|data) a valid PDF. The same will be denoted as K hereafter and it will be shown that calculation of this term is not required in the proposed framework. Substituting for the terms in Eq. 4.2 the joint PDF of SWCC parameters α, n given the observed data can be expressed as follows:

f(α, n|data) =K⁻¹ Z

f(α|W_α)×f(n|W_n)×D(u, v|W_α,n)

i=1

f(αi|W_α)×f(ni|W_n)×D(ui, vi|W_α,n)

×P(W_α)×P(W_n)×P(W_α,n)dW

(4.13)

where K = R ..R

WP(data|W)P(W)dW is the normalizing term in denominator of Eq. 4.10.

W = [W_α, W_n, W_α,n] = [λ_α, ζ_α, λ_n, ζ_n, θ] is the distribution parameter’s space.

Eq. 4.13 is a five dimensional integral over the parameter space (λα, λn, ζα, ζn, θ) and after substituting for all the components in Eq. 4.13, it is difficult to directly use this PDF or to compute the statistical moments. This is not an uncommon feature in Bayesian approach. This aspect can be circumvented by a general class of acceptance-rejection sampling algorithm called Markov Chain Monte Carlo (MCMC) (Gelman et al., 2013, p. 285, sec. 11.2). MCMC algorithms allow us to

Table 4.1: Population Statistics of van Genuchten van Genuchten (1980) soil water characteristic curve (SWCC) parameters α andn for the databases of Loam, Fly ash and Bentonite.

Statistic Parameter Loam^∗ Fly ash^∗ Bentonite^# Sample

Size 52 54 49

Mean α 0.52 0.03 0.19

n 1.51 2.16 1.77

COV(%) α 77.86 45.91 205.82

n 13.13 26.57 20.30

Range α [0.09-2.10] [0.01-0.07] [0.01-1.26]

n [1.20-2.20] [1.36-3.64] [1.26-3.04]

ρ α, n -0.36 -0.56 -0.52

τ α, n -0.20 -0.49 -0.71

∗ Unit for α is kPa⁻¹,^# Unit forα is MPa⁻¹

sample from a complicated PDF like in Eq. 4.13. As the name suggests, MCMC consists of two parts, a sequence called Markov Chain and the traditional hit and trial Monte Carlo simulation.

Formally, Markov chain is a stochastic process wherein the probability of an event depends only upon the previous event. Mathematically, it can be represented as follows.

P(Xj+1=y|X_j =xj, Xj−1 =xj−1, ..., X0=x0) =P(Xj+1=y|X_j =xj) (4.14) The general idea behind MCMC is to simulate from conventional distribution (easy to sample) and iteratively correct these simulated values till they reach the desired target distribution. The values at each iteration are generated sequentially conditioned only on the previous value (hence Markov chain). Statistical moments can further be computed over these simulated samples. In this study, Metropolis-Hastings (M-H)algorithm (Gelman et al., 2013, p. 289, sec. 11.4) is adopted for the MCMC simulation. M-H algorithm also allows to avoid calculating the K term and hence improves computational efficiency of the Bayesian approach. There are other algorithms also such as Gibbs sampler (Gelman et al., 2013, p. 287, sec. 11.3). The M-H algorithm was chosen in this study as it doesn’t require conditional distributions of the target PDF to be sampled and also it is among the simplest to implement. The M-H algorithm used in this study is described below.

An initial state (i = 1) for the Markov chain of α and n i.e. αi, ni was chosen first. While length of chain i<Ns, the following was performed

• Calculate the target PDF f(αi, ni) in Eq.4.13 without the normalizing constant termK⁻¹

• Generate a random candidate sampleα^∗, n^∗conditioned onf(αi, ni) using a suitable proposal distribution

• Calculate the proposal PDF f(αi, ni|α^∗, n^∗) andf(α^∗, n^∗|α_i, ni)

• Calculate the target PDF f(α^∗, n^∗) in Eq.4.13 without the normalizing constant termK⁻¹

• Calculate the acceptance ratio r= ^f_f(α^(α^∗^,n^∗^)f(αⁱ^,nⁱ^|α^∗^,n^∗⁾

i,ni)f(α^∗,n^∗|αi,ni)

• Generate a random numberU (0,1)

• Ifr > U accept αi+1, ni+1 =α^∗, n^∗ elseαi+1, ni+1=αi, ni

It can be noted that even if the target distribution in Eq.4.13 was calculated including theK⁻¹ term, it would get eliminated during the calculation ofrin step 5. It can also be noted that the terms P(data|W) and P(W) need to be calculated once, only the term f(α|W_α)f(n|W_n)D(u, v|W_α,_n) need to be calculated for every iterationiof the Markov chain. For the proposal distribution in step 2, bivariate normal was chosen for simplicity in this study. Instead of directly obtaining prior onW it is convenient to obtain prior on µ_α, µ_n, σ_α, σ_n,and τ_a,n. The corresponding prior on W can be determined using Eqs.4.5, 4.6, 4.9 respectively. It should be noted that for the proposed Bayesian approach, SWCCs were assumed to be from some definite copula or marginals for the limited data.

Jaynes (1957) asserted that the exponential-family model (e.g. Gaussian) is the maximum entropy model under sparse constraints. Ideally, Bayesian model comparison should be performed (Zhang et al., 2018, e.g.) for designation of marginals and dependences structure (copula) for the case of limited data.

Dalam dokumen Multivariate Probabilistic Characterization of Hydraulic and Physicochemical Soil Properties (Halaman 58-63)