Estimation in a random e¤ects model with parallel polynomial growth curves

(1)

Estimation in a random e¤ects model with parallel polynomial growth curves

Takahisa Yokoyama

(Received January 5, 2001) (Revised April 24, 2001)

Abstract. In this paper we consider an extended growth curve model with two hierarchical within-individuals design matrices, which is useful in analyzing mean proﬁles of several groups with parallel polynomial growth curves. The covariance structure based on a random e¤ects model is assumed. The maximum likelihood estimators (MLE’s) are obtained under the random e¤ects covariance structure. The e‰ciency of the MLE is discussed. A numerical example is also given.

1. Introduction

Suppose that a response variable x has been measured at p di¤erentocca- sions on each of Nindividuals, and each individual belongs to one of kgroups.

Let x^ðgÞ_j ¼ ½x^ðgÞ_1j ;. . .;x^ðgÞ_pj ⁰ be a p-vector of measurements on the j-th individual in the g-th group, and assume that x^ðgÞ_j ’s are independently distributed as Npðm^ðgÞ;SÞ, whereSis an unknownpppositive deﬁnite matrix, j¼1;. . .;Ng, g¼1;. . .;k. Further, we assume that mean proﬁles of k groups are parallel polynomial growth curves, i.e.,

m^ðgÞ¼x^ðgÞ1_pþB⁰x₂; g¼1;. . .;k;

ð1:1Þ

where 1_p is a p-vector of ones,

B¼ 1_p⁰

B₂

¼ 2 66 66 64

1 1 t₁ t_p

...

... t₁^q1 t_p^q1

3 77 77 75 ð1:2Þ

is a qp within-individuals design matrix of rank q ðapÞ. Yokoyama and Fujikoshi [10] considered a parallel proﬁle model with

m^ðgÞ¼x^ðgÞ1_pþm; g¼1;. . .;k:

2000 Mathematics Subject Classiﬁcation. 62H12.

Key words and phrases. extended growth curve model, hierarchical within-individuals design matrices, random e¤ects covariance structure, maximum likelihood estimator, e‰ciency.

(2)

Therefore, the model (1.1) means that m has a linear structure. It may be noted that the mean structure (1.1) includes two hierarchical within-individuals design matrices. Without loss of generality, we may assume that x^ðkÞ¼0. In the following we shall do this. Let

X¼ ½x^ð1Þ₁ ;. . .;x^ð1Þ_N₁;. . . .;x^ðkÞ₁ ;. . .;x^ðkÞ_N_k⁰; N¼N₁þ þN_k: Then the model of X can be written as

X@NNpðA1x₁1_p⁰þ1_Nx₂⁰B;SnINÞ;

ð1:3Þ where

A₁¼ 2 66 66 66 4

1_N₁ 0

.. .

0 1_N_k1

0

3 77 77 77 5

is an N ðk1Þ between-individuals design matrix of rank k1 ðaN p1Þ, x₁¼ ½x^ð1Þ;. . .;x^ðk1Þ⁰ and x₂ are vectors of unknown parameters. The model (1.3) may be called a parallel growth curve model. This is a nested model based on the growth curve model with two di¤erent within-individuals design matrices. For a generalized nested model based on the growth curve model with several di¤erent within-individuals design matrices, see, e.g., von Rosen [9]. The model (1.3) with B¼Ip is a special case of mixed MANOVA- GMANOVA models considered by Chinchilli and Elswick [2], Kshirsagar and Smith [4, p. 85], etc. The mean structure of (1.3) can be written as

EðXÞ ¼ ½A1 1_N

x₁₁ 0 x₂₁ x₂₂⁰

ð1:4Þ B;

wherex₁¼x₁₁andx₂⁰ ¼ ½x₂₁ x₂₂⁰ . We note that the model (1.3) is the ordinary growth curve model (Pottho¤ and Roy [5]) with a linear restriction on mean parameters.

Fujikoshi and Satoh [3] obtained the MLE’s in the growth curve model with two di¤erent within-individuals design matrices when Shas no structures, i.e., is any unknown positive deﬁnite. When there is no theoretical or empirical basis for assuming special covariance structures, we need to assume that S is any unknown positive deﬁnite. However, for analysis of repeated measures or growth curves, it has been imposed to consider certain parsimonious covariance structures. As one of such structures, we are interested in a random e¤ects

(3)

covariance structure (see, e.g., Rao [6]). In our model, the structure can be expressed as

S¼d²1_p1_p⁰þs²I_p; ð1:5Þ

where d²b0 and s² >0. The covariance structure (1.5) can be introduced by assuming the following random e¤ects model:

x^ðgÞ_j ¼ ðx^ðgÞþh^ðgÞ_j Þ1pþB⁰x₂þe^ðgÞ_j ; ð1:6Þ

where h^ðgÞ_j ’s ande^ðgÞ_j ’s are independently distributed asNð0;d²Þ andNpð0;s²I_pÞ, respectively. Therefore, the covariance matrix of x^ðgÞ_j is given by (1.5). This implies that

X@NNpðA1x₁1_p⁰þ1_Nx₂⁰B;ðd²1_p1_p⁰þs²I_pÞnI_NÞ:

ð1:7Þ

In this paper we consider the problems of estimating the unknown parameters x₁,x₂,d² and s² when S has the structure (1.5). InO2 we obtain a canonical form of (1.7). InO3 we obtain the MLE’s in the model (1.7), using a canonical form. In O4 it is shown how much gains can be obtained for the maximum likelihood estimation of x₁ by assuming a random e¤ects covariance structure.

In O5 we give a numerical example of the results of O4.

2. Transformation of the model

In order to transform (1.7) to a model which is easier to analyze, we use a canonical reduction. Let H¼ ½H1 N¹⁼²1_N H₃be an orthogonal matrix of order N such that

½A1 1_N ¼ ½H1 N¹⁼²1_N

L11 0 l₂₁⁰ N¹⁼²

¼Hð2ÞL;

where H1:N ðk1Þ, andL11 :ðk1Þ ðk1Þis a lower triangular matrix.

Similarly, let Q¼ ½p¹⁼²1_p Q₂⁰ Q₃⁰⁰ be an orthogonal matrix of order p such that

1_p⁰ B2

¼

"

p¹⁼² 0⁰ g₂₁ G₂₂

#"

p¹⁼²1_p⁰ Q₂

#

¼GQ_ð2Þ;

where Q2:ðq1Þ p, and G22:ðq1Þ ðq1Þ is a lower triangular matrix.

Then the mean structure of (1.7) can be written as

A₁x₁1_p⁰þ1_Nx₂⁰B¼p¹⁼²H₁y₁1_p⁰þN¹⁼²1_Ny₂⁰Q_ð2Þ; ð2:1Þ

(4)

where

y1¼p¹⁼²L₁₁x₁; y₂⁰ ¼N¹⁼²x₂⁰Gþl₂₁⁰ ½x₁ 0G:

Here we note that ðx₁;x₂Þis an invertible function of ðy1;y2Þ. In fact, x₁ and x₂ can be expressed in terms of y₁ and y₂ as

x₁ ¼p¹⁼²L¹₁₁y1; x₂⁰ ¼N¹⁼²y₂⁰G¹N¹⁼²l₂₁⁰ ½p¹⁼²L¹₁₁y1 0: ð2:2Þ

Using the above transformation, we can write a canonical form of (1.7) as

Y¼H⁰XQ⁰¼ 2 64

y₁₁ Y₁₂ Y₁₃ y₂₁ y₂₂⁰ y₂₃⁰ y₃₁ Y₃₂ Y₃₃

3

75@NNpðEðYÞ;CnI_NÞ;

ð2:3Þ

where means EðYÞ and covariance matrix C are given by

EðYÞ ¼ 2

4y11 0 0 y₂₁ y₂₂⁰ 0⁰

0 0 0

3 5;

y11 0 y21 y₂₂⁰

¼L

x₁₁ 0 x₂₁ x₂₂⁰

ð2:4Þ G;

y₁¼y₁₁; y₂⁰ ¼ ½y21 y₂₂⁰ and

C¼QSQ⁰¼ pd²þs² 0⁰ 0 s²Ip1

" # ð2:5Þ :

3. The MLE’s

In this section we obtain the MLE’s ofx₁,x₂,d² ands² in the model (1.7), using (2.3). Let

U¼ ½y₁₁ Y12 Y13⁰½y₁₁ Y12 Y13 ¼ 2 64

u11 u₁₂⁰ u₁₃⁰ u21 U22 U23

u31 U32 U33

3 75;

V¼ ½y₂₁ y₂₂⁰ y₂₃⁰ ⁰½y₂₁ y₂₂⁰ y₂₃⁰ ¼ 2 64

v11 v₁₂⁰ v₁₃⁰ v21 V22 V23

v31 V₃₂ V₃₃ 3 75;

W ¼ ½y₃₁ Y₃₂ Y₃₃⁰½y₃₁ Y₃₂ Y₃₃ ¼ 2 64

w₁₁ w₁₂⁰ w₁₃⁰ w₂₁ W₂₂ W₂₃ w31 W32 W33

3 75

(5)

and

T¼UþW ¼ 2 64

t₁₁ t₁₂⁰ t₁₃⁰ t₂₁ T₂₂ T₂₃ t₃₁ T₃₂ T₃₃

3 75:

It is easy to see that the MLE’s of y1 and y2 are given by y^

y₁¼y₁₁; yy^₂⁰¼y_2ð12Þ⁰ ;

where y_2ð12Þ⁰ ¼ ½y₂₁ y₂₂⁰ . Hence the MLE’s of x₁ and x₂ are given by x^

x₁¼p¹⁼²L¹₁₁y₁₁; xx^₂⁰ ¼N¹⁼²y_2ð12Þ⁰ G¹N¹⁼²l₂₁⁰ ½p¹⁼²L¹₁₁y₁₁ 0: ð3:1Þ

Using a technique similar to the one in estimating variance components in a one-way random e¤ects model by maximum likelihood (see, e.g., Searle, Casella and McCulloch [7, p. 148]), we can obtain the MLE’s of d² and s². Let Lðy1;y₂;s²;d²Þ be the likelihood function of Y. Then we have

gðs²;d²Þ ¼ 2 logLðyy^₁;yy^₂;s²;d²Þ

¼Nplogð2pÞ þNlogðpd²þs²Þ þ w11

pd²þs² þNðp1Þlogs²þ 1

s²ðtrT_ð23Þð23ÞþtrV₃₃Þ; where

Tð23Þð23Þ¼ T22 T23

T₃₂ T₃₃

:

The minimum of gðs²;d²Þ with respect to d²b0 and s²>0 is achieved at dd^² ¼max 1

p 1

Nw11 1

Nðp1ÞðtrT_ð23Þð23ÞþtrV33Þ

;0

;

s^ s² ¼min

1

Nðp1ÞðtrT_ð23Þð23ÞþtrV₃₃Þ;

1

NþNðp1Þðw11þtrT_ð23Þð23ÞþtrV₃₃Þ ð3:2Þ

(see, e.g., Arnold [1, p. 251]). Therefore, the MLE’s ofd² ands² are given by (3.2).

Now we express the MLE’s given in (3.1) and (3.2) in terms of the original observations. Let S_w andS_t be the matrices of the sums of squares and prod- ucts due to the within variation and total variation, i.e.,

(6)

Sw¼X⁰H3H₃⁰X¼X^k

g¼1

X^N^g

j¼1

ðx^ðgÞ_j x^ðgÞÞðx^ðgÞ_j x^ðgÞÞ⁰;

S_t¼X⁰ðH1H₁⁰þH₃H₃⁰ÞX ¼X^k

g¼1

X^N^g

j¼1

ðx^ðgÞ_j xÞðx^ðgÞ_j xÞ⁰;

where x^ðgÞ andx are the sample mean vectors of observations of theg-th group and all the groups, respectively. Further, let

A~

A₁¼ I_N1 N1_N1_N⁰

A₁; BB~₂¼B₂ I_p1 p1_p1_p⁰

ð3:3Þ :

Then, from the deﬁnitions of L and G itis easily seen that H1¼AA~1L¹₁₁; l₂₁⁰ ¼ 1

ffiffiffiffiffi

pN1_N⁰ A1; Q2¼G₂₂¹BB~2; g₂₁¼ 1 ffiffiffip p B21_p: Using these results, we have the following theorem.

Theorem3.1. The MLE’s ofx₁,x₂,d² ands² in the extended growth curve model (1.7) are given as follows:

x^ x₁¼1

pðAA~₁⁰AA~1Þ¹AA~₁⁰X1_p; x^

x₂₁¼1

p x⁰fIpBB~₂⁰ðBB~2BB~₂⁰Þ¹B2g 1

N1_N⁰A1ðAA~₁⁰AA~1Þ¹AA~₁⁰X

1_p; x^

x₂₂⁰ ¼x⁰BB~₂⁰ðBB~₂BB~₂⁰Þ¹; dd^²¼max 1

p 1

Ns₁² 1 Nðp1Þs²₂

;0

;

s^

s²¼min 1

Nðp1Þs₂²; 1

N pðs₁²þs₂²Þ

;

where AA~1 and BB~2 are given by (3.3), and s₁² and s²₂ are deﬁned by s₁²¼1

p1_p⁰Sw1_p and

s²₂ ¼trS_t1

p1_p⁰S_t1_pþNx⁰ I_p1

p1_p1_p⁰BB~₂⁰ðBB~₂BB~₂⁰Þ¹BB~₂

x;

respectively.

We note that the MLE’s dd^² andss^² are notunbiased. The usual unbiased estimators of d² and s² may be deﬁned by

(7)

dd~²¼1 p

1

Nkw₁₁ 1

Nðp1Þ ðq1ÞðtrT_ð23Þð23ÞþtrV₃₃Þ

;

s~

s²¼ 1

Nðp1Þ ðq1ÞðtrT_ð23Þð23ÞþtrV₃₃Þ;

ð3:4Þ

respectively. There is the possibility that the use of dd~² can lead to a negative estimate of d², while dd^² is non-negative. As a modiﬁcation of the MLE’s, we propose the estimators obtained from the MLE’s by replacing N andNðp1Þ byNk and Nðp1Þ ðq1Þ, respectively, in (3.2). The modiﬁed MLE’s, which are based on the joint distribution of w11 andðtrT_ð23Þð23ÞþtrV33Þ only, may be called restricted maximum likelihood estimators (REMLE’s). These estimators can be expressed in terms of the original observations, again using the notations in Theorem 3.1.

4. E‰ciency of xx^₁

Next we consider the e‰ciency of the MLE for x₁ in the case when the covariance structure (1.5) is assumed. When no special assumptions about S are made, the MLE of x₁ is given by

x~

x₁¼ ð1_p⁰S¹_w 1_pÞ¹ðAA~₁⁰AA~₁Þ¹AA~₁⁰XS_w¹1_p: ð4:1Þ

The estimators xx^₁ and xx~₁ have the following properties.

Theorem 4.1. In the extended growth curve model (1.7) it holds that both the estimators xx^₁ and xx~₁ are unbiased, and

Varðxx^₁Þ ¼1

pðpd²þs²ÞðAA~₁⁰AA~1Þ¹; Varðxx~₁Þ ¼1

pðpd²þs²Þ 1þ p1 Nkp

ðAA~₁⁰AA~1Þ¹:

Proof. From (2.2), (3.1) and AA~₁⁰AA~₁¼L₁₁⁰ L₁₁, we obtain the result onxx^₁. It can be shown that for any positive deﬁnite covariance matrix S,

Eðxx~₁Þ ¼x₁ and Varðxx~₁Þ ¼ ð1_p⁰S¹1_pÞ¹ 1þ p1 Nkp

ðAA~₁⁰AA~₁Þ¹: Under the assumption that S¼d²1p1_p⁰þs²Ip, itholds that

ð1_p⁰S¹1_pÞ¹¼1

pðpd²þs²Þ;

which proves the desired result on xx~₁.

(8)

From Theorem 4.1, we obtain

Varðxx~₁Þ Varðxx^₁Þ ¼ ðpd²þs²Þ p1

pðNkpÞðAA~₁⁰AA~₁Þ¹>0;

ð4:2Þ

which implies that xx^₁ is more e‰cientthan xx~₁ in the model (1.7). This shows that we can get a more e‰cient estimator for x₁ by assuming a random e¤ects covariance structure. Especially, when p is large relative to N, we can obtain greater gains.

5. Numerical example

In this section we give a numerical example to illustrate the e‰ciency ofxx^₁ by assuming a random e¤ects covariance structure. We apply the results of O4 to the data (see, e.g., Srivastava and Carter [8, p. 227]) of the price indices of hand soaps packaged in four ways, estimated by twelve consumers. For six of the consumers, the packages have been labeled with a well-known brand name. For the remaining six consumers, no label is used. Then, from the data we obtain

x^ð1Þ¼ ½:31667; :45833; :47500; :64167⁰; x^ð2Þ¼ ½:60000; :66667; :85000; :96667⁰;

x¼ ½:45833; :56250; :66250; :80417⁰;

S_w¼

:21833 :15167 :20500 :08333

:25542 :16375 :21375

:30375 :17875

:29542 2

66 64

3 77 75;

St¼

:45917 :32875 :52375 :35958

:38563 :39813 :41688

:72563 :54438

:61229 2

66 64

3 77 75:

For the observation matrix X :124, we assume the model (1.7) with EðXÞ ¼ 1₆

0

x₁₁1₄⁰þ1₁₂½x₂₁; x₂₂ 1 1 1 1 t₁ t₂ t₃ t₄

ð5:1Þ

¼ 1₆ 1₆ 0 1₆

x₁₁ 0 x₂₁ x₂₂

1 1 1 1

t1 t2 t3 t4

and

VarðvecðXÞÞ ¼ ðd²1₄1₄⁰ þs²I4ÞnI12: ð5:2Þ

(9)

Now we estimate how much gains can be obtained for the maximum likelihood estimation of x₁₁ by assuming the covariance structure (5.2). Since p¼4, N¼12, k¼2, AA~₁⁰AA~1¼3, dd^² ¼:01353 and ss^²¼:00976, itfollows from Theorem 4.1 and (4.2) that

V^

Varðxx~₁₁Þ VVarð^ xx^₁₁Þ ¼ 1

24ð4dd^²þss^²Þ ¼:00266:

Acknowledgements

The author would like to thank the referee and Professor Y. Fujikoshi, Hiroshima University, for many helpful comments and discussions.

References

[ 1 ] S. F. Arnold, The Theory of Linear Models and Multivariate Analysis, New York: Wiley, 1981.

[ 2 ] V. M. Chinchilli and R. K. Elswick, A mixture of the MANOVA and GMANOVA models, Commun. Statist.—Theor. Meth. 14 (12), (1985), 3075–3089.

[ 3 ] Y. Fujikoshi and K. Satoh, Estimation and model selection in an extended growth curve model, Hiroshima Math. J.26 (1996), 635–647.

[ 4 ] A. M. Kshirsagar and W. B. Smith, Growth Curves, New York: Dekker, 1995.

[ 5 ] R. F. Pottho¤ and S. N. Roy, A generalized multivariate analysis of variance model useful especially for growth curve problems, Biometrika 51 (1964), 313–326.

[ 6 ] C. R. Rao, The theory of least squares when the parameters are stochastic and its appli- cation to the analysis of growth curves, Biometrika 52 (1965), 447–458.

[ 7 ] S. R. Searle, G. Casella and C. E. McCulloch, Variance Components, New York: Wiley, 1992.

[ 8 ] M. S. Srivastava and E. M. Carter, An Introduction to Applied Multivariate Statistics, New York: North-Holland, 1983.

[ 9 ] D. von Rosen, Maximum likelihood estimators in multivariate linear normal models, J.

Multivariate Anal. 31 (1989), 187–200.

[10] T. Yokoyama and Y. Fujikoshi, A parallel proﬁle model with random-e¤ects covariance structure, J. Japan Statist. Soc. 23 (1993), 83–89.

Department of Mathematics and Informatics Faculty of Education

Tokyo Gakugei University Koganei, Tokyo 184-8501, Japan

e-mail: [email protected]