The methods of Schall and Breslow and Clayton

Chapter 1 Introduction

5.2 The Generalized Linear Mixed Model

5.2.6 The methods of Schall and Breslow and Clayton

In this derivation u denotes the vector of random effects. If we assume the random effects are u∼ N(0,G), the normal “errors” linear mixed model is given by

y =Xβ +Zu+e.

the conditional mean of the observations given the random model effects is E[y|u] = Xβ+Zu

and the conditional variance

var[y|u] =R =var(e).

The observations can then be described as y =µ+e

whereµdenotes the conditional meanE[y|µ]. The Generalized Linear Mixed Model (GLMM) can also be described in terms of the conditional means. It takes the form:

η = Xβ+Zu

whereηis the linear predictor through the link function g(µ).Thus we could write the GLMM as

g{E[y|u]} = Xβ+Zu

As in the conventional mixed model, the random model effectsuare assumed to have a multivariate normal distribution with mean 0and variance covariance matrix G while as in the conventional generalized linear model, the underlying distribution of y is assumed to be a member of the exponential family (for any given u)

To see the difficulty that arises with GLMM’s, suppose that we have a cluster sampled survey data with a random sample of clusters indexed i= 1, . . . , N with elementary unitsj = 1, . . . , n_iwithin each cluster. The observations sat- isfy generalized linear models with common distributions and a link function.

We now introduce a random term u_i corresponding to the random cluster effect, and assume that the random effects u_i are normally distributed with a mean 0 and a variance σ_u². We put u_i =γz_i where z_i ∼N ID(0,1)

As an example we consider a logistic regression model for binary outcomes.

Let

p_ij = Prob(y_ij = 1)

so that

(p_ij|z_i) = Prob(y_ij = 1|z_i)

gives the probability of “success” for the j^th unit in clusteri, given the value of the random cluster effect Z_i in cluster i. Then the model can be written as

logit(p_ij|z_i) = x_ijβ+γz_i. (5.8) Overall therefore regardless of the cluster effect, p_ij is found by integrating over the random cluster effects to get

p_ij = Z ∞

zi=−∞

(p_ij|z_i)φ(z_i)dz_i (5.9) where φ is the density function the standard normal random variable (N(0,1)). For model Eq. (5.8), the joint likelihood for y_ij, j = 1, . . . , n_i and i= 1, . . . , N involves the integral in Eq. (5.9) since

l(β, γ) = X

{y_ijlogp_ij + (n_ij −y_ij)log(1−p_ij)}+ contsant. (5.10) The difficulty is that the integral has to be evaluated numerically. This can be done for example using the Gaussian Hermite formula for numerical quadrature, (discussed in detail in Section 5.7.10 of this chapter) but briefly under this method, an integral such as above is approximated by means of sums as follows:

Z ∞

−∞

f(u)e^−u²du≈

j=1

c_jf(s_j) (5.11)

where the values ofc_j ands_j are given in standard tables. When we integrate out (numerically) the random effects, we obtain the marginal likelihood l(β, γ) and thismarginallikelihood can be maximized to find the maximum likelihood estimates (MLEs) of β andγ. This approach was used by Hedeker

and Gibbons (1994) who proposed a random effects ordinal regression model for the analysis of clustered response data. Now a binary response can be considered a special case of an ordinal response with only two(ordered) out- come categories. They developed the model for both the probit and the logistic response functions using the “threshold” concept in which it is assumed that the observed ordered category is determined by the value of a latent unobservable continuous response that follows a linear model incorpo- rating random effects. Hedeker and Gibbons (1996) in addition developed a program called MIXOR (and an extended version MIXORE) to implement this method of marginal maximum likelihood estimation (MMLE). MIXORE is a public domain computer program that can be downloaded from the inter- net, together with a manual that describes how the data should be prepared for analysis, together with a specification file MIXORE.DEF which describes the setup of the the data in terms of which columns contain the random effects, which columns contain the fixed effects and which column contains the (ordinal) response. In this program, any given set of factors (i.e. categorical explanatory variables) should be first converted into the required number of indicator variables which are then stored as separate columns in the input data set. This could be seen as a drawback for analyses with a large number of factors each having a large number of levels (In the Kilifi data set, factors such as “visit”-44 levels). In addition, a constant term is required in the linear predictor of the model, thus the data file should have a column of 1’s as one of the explanatory variables. For these reasons We will not use the MIXORE program in the analysis of the current data set.

There are a number of issues associated with this approach, among them:

• It is in fact relatively easy to extend NLMIXED analyses for correlated random effects such as those found in random coefficient regression

models. Such models are quite easy to fit in WinBUGS.

• It may be computationally demanding

• It is not widely implemented in existing commercial software packages (it is only freely available in MIXORE and also available in Stata ver- sions 6 and 9)

One might wonder why we opt to use the likelihood conditional on the random effects u in the case of the non-normal response. The answer is that in the case of a normal response and the identity link, the random effects do not appear explicitly in the likelihood, but only appear through the variance covariance parameters σ²₁, σ₂², . . . , σ²_k in the case of k random effects or equivalently through the “ratios or gammas”γ_i = ^σ_σ²ⁱ2 whereσ² is the resid- ual variance. In the above example E(z_j) = 0, thus E[x_ijβ+γz_j] = x_ijβ.

However E[g(µ)]6=g[E(µ)] in the case of the non-identity link functiong, so taking the expectations will not cause the random terms to vanish. Hence we will consider a number of alternative approaches based on modifications to the mixed model equations.

5.2.7 Estimation approaches by Schall and by Breslow

Dalam dokumen Analysis of longitudinal binary data : an application to a disease process. (Halaman 126-130)