Directory UMM :Journals:Journal_of_mathematics:EJQTDE:

(1)

ISSN1842-6298 (electronic), 1843-7265 (print) Volume5₍₂₀₁₀₎_{, 311 – 320}

IDENTIFIABILITY OF THE MULTIVARIATE

NORMAL BY THE MAXIMUM AND THE

MINIMUM

Arunava Mukherjea

Abstract_{. In this paper, we have discussed theoretical problems in statistics on identification of} parameters of a non-singular multi-variate normal when only either the distribution of the maximum or the distribution of the minimum is known.

1 Introduction

Let X1, X2, . . . , Xm, where Xi = (Xi1, Xi2, . . . , Xin) be m independent random n

-vectors each with an-variate non-singular density in some classF. LetY1, Y2, . . . , Yp

be another such independent family of randomn-vectors with each Yi having its n

-variate non-singular density inF. Suppose that

X0 = (X01, . . . , X0n), where X0j =max{Xij : 1≤i≤m},

and

Y₀= (Y₀1, . . . , Y0n), where Y0j =max{Yij : 1≤i≤p},

have the samen-dimensional distribution function. The problem is to determine ifm must equalp, and the distributions of_{X1, X2, . . . Xm}are simply a rearrangement

of those of_{Y₁, Y₂, . . . , Yp}.

This problem comes up naturally in the context of a supply-demand problem in econometrics, and as far as we know, was considered first in [2], where it was solved (in the affirmative ) for the class of univariate normal distributions, and also, for the class of bivariate normal distributions with positive correlations. In [18], it was solved in the affirmative for the class of bivariate normal distributions with positive or negative correlations. However, if such distributions with zero correlations are allowed, then unique factorization of product of such distributions no longer holds, and this can be verified by considering simply four univariate normal distributions

2010 Mathematics Subject Classification: 62H05; 62H10; 60E05.

(2)

F1(x), F2(x), F3(y), F4(y) and observing that for allx, y, we have the equality:

[F1(x)F3(y)][F2(x)F4(y)] = [F1(x)F4(y)][F2(x)F3(y)].

In [19], it was shown in the generaln-variate normal case, that when then-variate normal distributions of each Xi, 1≤i≤m, and each Yj, 1≤j ≤p, have positive

partial correlations (that is, the off-diagonal entries of the covariance matrix are all negative), then when Xo and Yo, as defined earlier, have the same distribution

function, we must have m =p, and the distributions of the Xi, 1 ≤i ≤m, must

be a permutation of those of the Yj, 1 ≤j ≤ m. In [17], this problem was solved

in the affirmative for the generaln-variate case when the covariance matrices of all then-variate normal distributions are of the form: Σij =ρσiσj fori6=j. As far as

we know, the general problem discussed above is still open.

The maximum problem above occurs in m-component systems where the com-ponents are connected in parallel. The system lifetime is then given by

max_{X₁, X₂, . . . , Xm},

and this is observable. There are instances where this maximum is observable, but the individualXis are not.

It may also be noted that if the distribution function of Yis F(x), and if X1

and X2 are two independent random variables each with distribution

p

F(x), then max_{X1, X2} andY have the same distribution function.

In [8], a corresponding minimum problem was discussed in the context of a prob-ability model describing the death of an individual from one of several competing causes. Let X1, X2, . . . , Xn be independent random variables with continuous

dis-tribution functions,Z=min_{X1, X2, . . . , Xi} andI =k iffZ =Xk. If theXi have

a common distribution function, then it is uniquely determined by the distribution function of Z. In [8], it was shown that when the distribution of the Xi are not

all the same, then the joint distribution function of the identified minimum (that is, that of (I, Z)) uniquely determines each Fi(x), the distribution function of Xi,

i= 1,2, . . . , n.

Since max_{X1, X2, . . . , Xn} =−min{−X1,−X2, . . . ,−Xn}, the distribution of

the identified maximum also uniquely determines the distribution of theXi. Notice

that if we considern= 2 and the case where the independent random variablesX₁ and X2 are both exponential such that their density functions are given by

f1(x) =λe−λx, x >0;

=0, otherwise,

and

f2(x) =µe−µx, x >0;

(3)

where λ and µ are both positive, then even though the joint distribution of their identified minimum (that is, that of (I, min_{X1, X2}) uniquely determines the

pa-rameters λ and µ, the min_{X1, X2}, by itself, does not identify uniquely the

pa-rameters λand µ. Thus, one natural question comes up: when does the minimum of an-variate random vector uniquely determine the parameters of the distribution of the random vector? In what follows, in the rest of this section, we discuss this problem.

As far as we know, the problem on identification of parameters of a random vector (X1, X2, . . . , Xn) by the distribution of the minimum (namely,min{Xi: 1≤i≤n})

is still unsolved even in the case of an-variate normal vector. In the bivariate normal case, the problem was solved in the affirmative (in the natural sense) in [5] and [16] independently. The problem was considered also in the tri-variate normal case in [5] in the context of an identified minimum, and solved partially. The general minimum problem in then-variate normal case, in the case of a common correlation, was solved in [9], and in the case of the tri-variate normal with negative correlations was solved in [12].

In the next section, we present asymptotic orders of certain tail probabilities for a multi-variate normal random vector that are useful in the context of the problems mentioned above. The purpose of this note here is to present some essential results which are useful in solving the identified minimum problem in the general tri-variate normal case.

2 Tail probabilities of a multivariate normal

Let (X1, X2, . . . , Xn) be an-variate normal random vector with a symmetric positive

definite covariance matrix Σ such that the vector 1Σ−1 _{= (}_α

1, α2, . . . , αn), where

1 = (1,1, . . . ,1), is positive (that is, each αi is positive). Then it was proven in [9]

that ast_{→ ∞}, the tail probability

P(X1 > t, X2 > t, . . . , Xn> t)

is of the same (asymptotic) orders as that of

C exp(₋1 2t

2_[1Σ−1₁T_])_,

where

1

c = (2π)

n

2p|detΣ|(α

1α2. . . αn)tn.

(4)

Lemma 1. _Let₍_X₁_{, X}₂₎_{be a bivariate normal with zero means and variances}_σ2

σ1 for the general bivariate normal (with zero means, for simplicity) considered in Lemma 1 In this case, we no longer have 1Σ−1>0, and thus, we need to use another idea. We can write

σ1, fortsufficiently large (for a pre-assigned positive δ), we can write: P(X₁ > t, X₂ > t)>(1₋δ)

σ1(<1) for the bivariate normal considered above, we have, similarly, for any ǫ >0 and all sufficiently larget,

(5)

Lemma 2. _Let ₍_X₁_{, X}₂₎ _{be a bivariate normal with zero means, variances each} ₁_,

Both Lemmas2and3follow from Lemma1and equation (2.1) above. In Lemma

(6)

Lemma 5. _Let ₍_X₁_{, X}₂₎ _{be as in Lemma} ₂_{. Let} _{α <}₀_{, β >}₀_,₋_{α > β. Then}

P(X₁ > αt, X₂ > βt)_∼ √ 1 2π βtexp

−1₂β2t2

as t_{→ ∞}

Proof. Let us write:

P(X1 > αt, X2 > βt)

=P(X2 > βt)−P(−X1 >(−α)t), X2 > βt)

Lett_{→ ∞}. Since the correlation of (₋X₁, X₂) is₋ρ , it follows from Lemma2that when ₋ρ <₋β_α,

P(₋X1>(−α)t, X2 > βt) =o

exp

−1₂β2t2

.

Also, it follows from Lemma3and inequalities in (2.3) that when₋ρ_≥ β_α, ast_{→ ∞},

P(₋X1 >(−α)t, X2> βt)

∼C(t)_√ 1

2π_|αt_|exp

−1

2α

2_t2

=o

exp

−1₂β2t2

.

where 1₂ _≤C(t)_≤1. The lemma now follows easily.

Lemma 6. _Let ₍_X₁_{, X}₂₎ _{be as in Lemma} ₂_{. Let} _{α <}₀_{. Then as} _t_{→ ∞}_,

P(X1> αt, X2>(−α)t)∼

1 √

2π_|α_|texp

−1₂α2t2

Proof. It is enough to observe that

P(X1 > αt, X2>(−α)t)

=P(X2 >(−α)t)−P(−X1 >(−α)t, X2>(−α)t)

(7)

3 The pdf of the identified minimum

joint distribution of (Y, I) such that

F(y,1) =P(Y _≤y, I= 1), Now by differentiating with respect to y, we obtain:

f1(y)

where (W21, W31) is a bivariate normal with zero means, variances each one, and

correlationρ23.1 = √ ρ23−ρ12ρ13 1−ρ122√1−ρ132

, andϕis the standard normal density. Similarly,

(8)

f3(y) = 1

σ3

ϕ

y σ3

P



W13≥

1₋ρ13

σ1

σ3

σ1

p

1₋ρ132

y, W23≥

1₋ρ23

σ2

σ3

σ2

p

1₋ρ232

y



 (3.3)

where (W12, W32) and (W13, W23) are both bivariate normals with zero means and

variances all ones, and correlations given, respectively, by ρ13.2 = √ ρ13−ρ12ρ23 1−ρ122

√

1−ρ232 and ρ12.3 = √ ρ12−ρ13ρ23

1−ρ132

√

1−ρ232

. Now the problem of identified minimum is the

follow-ing: we will assume that the functionsf1(y), f2(y) andf3(y) are given, and we need

to prove that there can be only a unique set of parameters σ₁2, σ2₂, σ2₃, ρ12, ρ23, ρ13

that can lead to the same three given functionsf1, f2 and f3.

The proof, besides other arguments, uses mainly the lemmas given in section2. The proof is rather involved and will appear elsewhere in full.

References

[1] T. Amemiya, A note on a Fair and Jaffee model, Econometrica 42 _(1974), 759-762.

[2] T. W. Anderson and S. G. Ghurye, Identification of parameters by the distri-bution of a maximum random variable, J. of Royal Stat. Society 39B _(1977), 337-342. MR0488416 (58 7960).Zbl 0378.62020.

[3] T. W. Adnerson and S. G. Ghurye,Unique factorization of products of bivariate normal cumulative distribution functions, Annals of the Institute of Statistical Math. 30_{(1978), 63-69.} _MR0507082_._{Zbl 0444.62020}_.

[4] A. P. Basu, Identifiability problems in the theory of competing and comple-mentary risks: a survey in Statistical Distributions in Scientific Work, vol. 5_, (C.Taillie, et. al, eds.), D. Reidel Publishing Co., Dordrecht, Holland, 1981, 335-347. MR0656347.Zbl 0472.62097.

[5] A. P. Basu and J. K. Ghosh,Identifiability of the multinormal and other distri-butions under competing risks model, J. Multivariate Analysis8_{(1978), 413-429.}

MR0512611.Zbl 0396.62032.

[6] A. P. Basu and J. K. Ghosh, Identifiablility of distributions under competing risks and complementary risks model, Commun. StatisticsA9_{(14) (1980),} 1515-1525. MR0583613.Zbl 0454.62086.

(9)

[8] S. M. Berman, Note on extreme values, competing risks and semi-Markov pro-cesses, Ann. Math. Statist.34_{(1963), 1104-1106.}_MR0152018_._{Zbl 0203.21702}_.

[9] M. Dai and A.Mukherjea, Identification of the parameters of a multivariate normal vector by the distribution of the maximum, J. Theoret. Probability 14 (2001), 267-298.MR1822905.Zbl 1011.62050.

[10] H. A. David,Estimation of means of normal population from observed minima, Biometrika 44 _{(1957), 283-286.}

[11] H. A. David and M. L. Moeschberger,The Theory of Competing Risks, Griffin, London, 1978. MR2326244.Zbl 0434.62076.

[12] J. Davis and A. Mukherjea, Identification of parameters by the distribution of the minimum, J. Multivariate Analysis 9 _{(2007), 1141-1159.} _MR0592960_. _Zbl

1119.60008.

[13] M. Elnaggar and A. Mukherjea, Identification of parameters of a tri-variate normal vector by the distribution of the minimum, J. Statistical Planning and Inference 78_{(1999), 23-37.}_MR1705540_._{Zbl 0928.62039}_.

[14] R. C. Fair and H. H. Kelejian, Methods of estimation for markets in disequi-librium: a further study, Econometrica 42 _{(1974), 177-190.} _MR0433788_. _Zbl

0284.90011.

[15] F.M. Fisher, The Identification Problem in Econometrics, McGraw Hill, New York, 1996.

[16] D. C. Gilliand and J. Hannan, Identification of the ordered bi-variate normal distribution by minimum variate, J. Amer. Statist. Assoc. 75_{(371) (1980),} 651-654. MR0590696.Zbl 0455.62089.

[17] J. Gong and A. Mukherjea, Solution of the problem on the identification of parameters by the distribution of the maximum random variable: A multivariate normal case, J. Theoretical Probability4_{(4) (1991), 783-790.} _MR1132138_._Zbl

0743.60021.

[18] A. Mukherjea, A. Nakassis and J. Miyashita,Identification of parameters by the distribution of the maximum random variable: The Anderson-Ghuiye theorem, J. Multivariate Analysis 18_{(1986), 178-186.}_MR0832994_._{Zbl 0589.60013}_.

(10)

[20] A. Mukherjea and R. Stephens, The problem of identification of parameters by the distribution of the maximum random variable: solution for the tri-variate normal case, J. Multivariate Anal. 34_{(1990), 95-115.} _MR1062550_._Zbl

0699.62009.

[21] A. Nadas, On estimating the distribution of a random vector when only the smallest coordinate is observable, Technometrics 12 _{(4) (1970), 923-924.} _Zbl

0209.50004.

[22] A. A. Tsiatis, A non-identifiability aspect of the problem of computing risks, Proc. Natl. Acad.Sci. (USA)72 _{(1975), 20-22.}_MR0356425_._{Zbl 0299.62066}_.

[23] A. A. Tsiatis,An example of non-identifiability in computing risks, Scand. Ac-tuarial Journal 1978_{(1978), 235-239.}_{Zbl 0396.62084}_.

Arunava Mukherjea

Department of Mathematics,

The University of Texas-Pan American,