Properties of Linear Functions of Random VectorsVectors

MULTIPLE REGRESSION IN MATRIX NOTATION

3.4 Properties of Linear Functions of Random VectorsVectors

Note that β, Y, andeare random vectors because they are functions of the random vectorY. In the previous sections, these vectors are expressed as linear functionsAY ofY. The matrixAis

• (XX)⁻¹X forβ,

• P forY, and

• (I−P) fore.

Before studying the properties of β, Y, and e, it is useful to study the general properties of linear functions of random vectors.

LetZ = (z1 · · · zn) be a random vector consisting of random vari- Random VectorsZ ablesz1, z2, . . . , zn. The meanµz of the random vectorZ is deﬁned as an

n×1 vector with theith coordinate given byE(zi). The variance–covariance matrixVzforZis deﬁned as ann×nsymmetric matrix with the diagonal elements equal to the variances of the random variables (in order) and the (i, j)th oﬀ-diagonal element equal to the covariance betweenziandzj. For

example, ifZis a 3×1 vector of random variablesz1, z2, andz3, then the E(Z) mean vector ofZ is the 3×1 vector

E(Z) =



E(z₁) E(z2) E(z3)



=µ_z=



µ₁ µ2

µ3



 (3.15)

and the variance–covariance matrix is the 3×3 matrix Var(Z)

Var(Z) =







V ar(z₁) Cov(z₁, z₂) Cov(z₁, z₃) Cov(z2, z1) V ar(z2) Cov(z2, z3) Cov(z3, z1) Cov(z3, z2) V ar(z3)







= Vz (3.16)

3.4 Properties of Linear Functions of Random Vectors 83







E[(z1−µ1)²] E[(z1−µ1)(z2−µ2)] E[(z1−µ1)(z3−µ3)]

E[(z₂−µ₂)(z₁−µ₁)] E[(z₂−µ₂)²] E[(z₂−µ₂)(z₃−µ₃)]

E[(z3−µ3)(z1−µ1)] E[(z3−µ3)(z2−µ2)] E[(z3−µ3)²]







= E{[Z− E(Z)][Z− E(Z)]}. (3.17)

Let Z be ann×1 random vector with meanµ_z and variance–covariance Linear Functions ofZ matrixVz. Let





 a₁ a₂ a..._k







be ak×nmatrix of constants. Consider the linear transformationU =AZ.

That is, U is ak×1 vector given by U=AZ

U =





 a₁Z a₂Z a_k...Z





=





 u1

u...k





. (3.18)

Note that

E(u_i) = E(a_iZ)

= E[ai1z1+ai2z2+· · ·+ainzn]

= ai1E(z1) +ai2E(z2) +· · ·+ainE(zn)

= a_iµ_z,

and hence E(U)

E[U] =





 E(u1) E(u2) E(u...k)





=





 a₁µ_z a₂µ_z a_k...µ_z







= Aµ_z. (3.19)

Thek×k variance–covariance matrix forU is given by Var(U) Var(U) = Vu

= E[U− E(U)][U− E(U)].

84 3. MULTIPLE REGRESSION IN MATRIX NOTATION Substitution ofAZ forU and factoring gives

Vu = E[AZ−Aµ_z][AZ−Aµ_z]

= EA[Z−µ_z][Z−µ_z]A

= AE[Z−µ_z][Z−µ_z]A

= A[Var(Z)]A

= AVzA. (3.20)

The factoring of matrix products must be done carefully; remember that matrix multiplication is not commutative. Therefore,Ais factored both to the left (from the ﬁrst quantity in square brackets) and to the right (from the transpose of the second quantity in square brackets). Remember that transposing a product reverses the order of multiplication (CD)=DC. SinceAis a matrix of constants it can be factored outside the expectation operator. This leaves an inner matrix which by deﬁnition isVar(Z).

Note that, ifVar(Z) =σ²I, then

Var(U) = A[σ²I]A

= AAσ². (3.21)

Theith diagonal element ofAA is the sum of squares of the coefficients (a_ia_i) of the ith linear functionu_i =a_iZ. This coefficient multiplied by σ² gives the variance of the ith linear function. The (i,j)th off-diagonal element is the sum of products of the coefficients (a_iaj) of theith andjth linear functions and, when multiplied byσ², gives the covariance between two linear functionsui=a_iZanduj=a_jZ.

Note that ifAis just a vectora, then u=aZ is a linear function of Z. The variance ofuis expressed in terms ofVar(Z) as

σ²(u) = aVar(Z)a. (3.22)

IfVar(Z) =Iσ², then

σ²(u) = a(Iσ²)a=aaσ². (3.23) Notice thataais the sum of squares of the coeﬃcients of the linear function a²_i, which is the result given in Section 1.5.

Two examples illustrate the derivation of variances of linear functions using the preceding important results.

Matrix notation is used to derive the familiar expectation and variance of Example 3.5 a sample mean. SupposeY1, Y2, . . . , Yn are independent random variables

with meanµand varianceσ². Then, forY = (Y1 Y2 · · · Yn),

E(Y) =





 µµ µ...





=µ1

3.4 Properties of Linear Functions of Random Vectors 85

and Var(Y) =Iσ².

The mean of a sample ofnobservations,Y =

Yi/n, is written in matrix notation as

Y = (_n¹ _n¹ · · · _n¹)Y. (3.24) Thus,Y is a linear function ofY with the vector of coeﬃcients being

a= (_n¹ _n¹ · · · _n¹). Then,

E(Y) = aE(Y) =a1µ=µ (3.25) and

Var(Y) = a[Var(Y)]a=a(Iσ²)a

= (_n¹ ¹_n · · · _n¹) (Iσ²)







...







= n

1 n

₂

σ²= σ²

n. (3.26)

For the second example, consider two linear contrasts on a set of four Example 3.6 treatment means withnobservations in each mean. The random vector in

this case is the vector of the four treatment means. If the means have been computed from random samples from four populations with meansµ1,µ2, µ3, andµ4and equal varianceσ², then the variance of each sample mean will beσ²/n(equation 3.26, and all covariances between the means will be zero. The mean of the vector of sample meansY = (Y1 Y2 Y3 Y4) isµ= (µ1 µ2 µ3 µ4). The variance–covariance matrix for the vector of meansY isVar(Y) =I(σ²/n). Assume that the two linear contrasts of interest are

c1 = Y1−Y2 and c2=Y1−2Y2+Y₃.

Notice that Y4 is not involved in these contrasts. The contrasts can be written as

C = AY, (3.27)

86 3. MULTIPLE REGRESSION IN MATRIX NOTATION where

C = c1

and A=

1 −1 0 0 1 −2 1 0

. Then,

E(C) = AE(Y) =Aµ=

µ1−µ2

µ1−2µ2+µ3

(3.28) and

Var(C) = A[Var(Y)]A=A

I σ²

= AA

σ² n

= 2 3

3 6 σ²

n . (3.29)

Thus, the variance of c1 is 2σ²/n, the variance of c2 is 6σ²/n, and the covariance between the two contrasts is 3σ²/n.

We now develop the multivariate normal distribution and present some Multivariate Normal Distribution properties of multivariate normal random vectors. We ﬁrst deﬁne a mul-

tivariate random vector when the elements of the vector are mutually independent. We then extend the results to normal random vectors with a nonzero mean and a variance–covariance matrix that is not necessarily di- agnonal. Finally, we present a result for linear functions of normal random vectors.

Suppose z1, z2, . . . , zn are independent normal random variables with Normal Random Vectors mean zero and varianceσ². Then, the random vectorZ= (z1 · · · zn)is

said to have a multivariate normal distribution with mean0= ( 0 · · · 0 ) and variance–covariance matrixVz=Iσ². This is denoted as

Z∼N(0,Iσ²).

The probability density function of Z is given in equation (3.3) and can also be expressed as

(2π)^−n/2|Iσ²|^−1/2e⁻ Z(Iσ²)⁻¹Z/2!

. (3.30)

It is a general result that ifU is any linear functionU=AZ+b, whereA is ak×nmatrix of constants andbis ak×1 vector of constants, thenU is itself normally distributed with mean µ_u= b and variance–covariance matrixVar(U) =Vu=AAσ²(Searle, 1971). The random vectorU has a multivariate normal distribution which is denoted by

U ∼N(µ_u,V_u). (3.31)

Dalam dokumen Applied Regression Analysis: A Research Tool (Halaman 99-104)