QUASI-EMPIRICAL BAYES MODELING OF MEASUREMENT ERROR MODELS AND R-ESTIMATION OF THE

(1)

Bangladesh

QUASI-EMPIRICAL BAYES MODELING OF MEASUREMENT ERROR MODELS AND R-ESTIMATION OF THE

REGRESSION PARAMETERS

A.K.Md.Ehsanes Saleh Carleton University, Ottawa, Canada

summary

This paper deals with the R-estimation of the regression parameters of a measurement error model: yi=β0+β1xi+eiandx⁰_i =xi+ui, i= 1, . . . , n. By com- bining the two sets of the information, an emaculate regression model is obtained using “quasi-empirical Bayes” estimates of the unknown covariates x1, . . . , xn. The model produces consistent estimates of the attenuated slope and the intercept parameters and applies to broad range of regression problems. Asymptotic properties of the R-estimators are provided based on the “quasi-Bayes regression model”. Some simulated results are presented as evidence of the performances of the estimators.

Keywords and phrases: ME model; asymptotic relative efficiency; rank estimators;

quasi-empirical Bayes model; Theil-Sen estimator; asymptotic normality.

AMS Classification: AMS 2000 Subject Classification: 62G05, 62G20, 62G35.

1 Introduction

Consider the simple linear measurement errors (ME) model y_i=β₀+β₁x_i+e_i

x⁰_i =xi+ui,







i= 1, . . . , n, (1.1)

where e_i is the response error and u_i is the measurement error in the regressor variable xi, which is unobservable while x⁰_i is the corresponding observed value. Our problem is the R-estimation of the intercept and slope parameters β = (β0, β1)^T in the model (1.1).

The commonly used methods of estimating (β0, β1)^T are the least squares or maximum likelihood methods. For robust methods, one follows the R-,L- and M-estimation methods.

The literature is void of all these estimation methods for the measurement error models.

Some attempts have been made by Zamar (1989) and Cheng and Tsai (1995) to study the M-estimators by robustifying the variance and covariance matrices following Huber’s(1981)

c Institute of Statistical Research and Training (ISRT), University of Dhaka, Dhaka 1000, Bangladesh.

(2)

technique. Recently, Jureˇckov´a, Picek and Saleh (2008) initiated testing of hypothesis problems with rank statistics in measurement error models. In this paper we attempt to develop the theory of R-estimation for (β0, β1)^T for the measurement error model (1.1) similar to the theoretical works of Hodges and Lehmann (1963), Adichie (1967), Jaeckel (1972), Saleh and Sen (1978, 1987), Sen (1968), and Jureˇckov´a and Sen (1996) among others. The least squares estimators for the measurement error model are recalled in Section 2, “quasi- empirical” Bayes regression model is proposed in Section 3 and the joint R-estimation of the slope and the intercept in Section 4. Asymptotic properties are discussed in Section 5 along with the expression for the joint asymptotic relative efficiency (JARE) of the LSE as well as R-estimators are given for the ME model.

2 Least Squares Estimation

First we consider the traditional regression model

y=β₀1_n+β₁x+e (2.1)

wherey= (y1, . . . , yn)⁰,x= (x1, . . . , xn)⁰ ande= (e1, . . . , en)⁰. In this case it is well known that the LSE ofβ= (β0, β1)⁰ is given by

β_n^∗(L)=



 β_0n^∗(L) β_1n^∗(L)



=





y_n−β_1n^∗(L)x_n

(x−xn1_n)⁰(y−y_n1_n) (x−xn1_n)⁰(x−xn1_n)



 (2.2)

wherexn=n⁻¹1⁰_nxandy_n=n⁻¹1⁰_ny. Under the following assumptions (i)

n→∞lim x_n=µ_x and lim

n→∞ max

1≤i≤n

(x_i−x_n)²

(x−xn1n)⁰(x−xn1n)→0 (2.3) (ii) the Fisher information, I(f) =R∞

−∞

−^f

0(x) f(x)

²

f(x)dx <∞.

Then, the asymptotic normality of√

n(βn^∗(L)−β) follows easily as





√n(β_0n^∗(L)−β₀)

√n(β_1n^∗(L)−β0)



∼N₂







 0 0



, σ²_e



 1 + ^µ_σ²^x₂

x −^µ_σ^x2 x

−^µ_σ^x2 x

1 σ²_x







 (2.4)

where S_nxx=_n¹(x−x_n1_n)⁰(x−x_n1_n)→σ²_xasn→ ∞.

Now, consider the measurement error model (1.1) given by

y= [1n,x]β+e, x⁰=x+u (2.5)

(3)

which may be written as

Y= 1_n,x⁰

β+w (2.6)

wherey= (y1, . . . , yn)⁰,x⁰= x⁰₁, . . . , x⁰_n⁰

,w=e−β1uandu= (u1, . . . , un)⁰.

1n,x⁰

=





 1 x⁰₁

... ... 1 x⁰_n







n×2

. (2.7)

Following Fuller (1987), we minimize y−

1_n,x⁰ β⁰

y− 1_n,x⁰

β

(2.8) with respect toβ to obtain the least squares estimator (LSE) ofβ as

˜ ν_n^(L)=







 1⁰_n x⁰⁰





1n,x⁰





−1

 1⁰_n x⁰⁰



y=





˜ ν_0n^(L)

˜ ν_1n^(L)



. (2.9)

We observe that

˜

ν_n^(L)=β+





1 +_S ^nx⁰ⁿ

xx+Suu −_S ^nx⁰ⁿ

xx+Suu

nx⁰_n S_xx+S_uu

n S_xx+S_uu









en−β1un 1

nS_x0e−β11

nS_x0u+x⁰_nen−βx⁰_nun



 (2.10) where Sab = (a−a1n)

0

b−b1n

, a = (a1,· · · , an)

0

and b= (b1,· · · , bn)

0

. We make the following assumptions to obtain the relevant properties of ˜νn^(L).

(A1) eanduare stochastically independent vectors such thatE(e,u) = (0,0).

(A2) lim_n→∞xn=µx∈R¹ and lim_n→∞max_1≤i≤n ^(xⁱ^−xⁿ⁾²

(x−xn1n)⁰(x−xn1n) = 0

(A3) The observed vectorx⁰is the adjusted vectorxby an error-vectoruwith a known distribution function, sayG_uassumed to possess up to fourth moment and plim_n→∞S_nuu= σ²_u>0, where

Snuu= 1

n(u−un1n)⁰(u−un1n), un= 1 nu⁰1n

and plim{¹_nPn

i=1(u_i−u¯_n)⁴}=µ₄≤ ∞. We assumeσ_u² is known as an identifiability condition for estimation.

(A4) xand uare asymptotically uncorrelated i.e.

plim_n→∞S_nxu = plim_n→∞

1

n(x−x_n1_n)⁰(u−u1_n)

= 0

(4)

Then,

plim_n→∞Snx⁰x⁰ = plim_n→∞1

n x0−x⁰_n1n

⁰

x⁰−x⁰_n1n

=σ_x²+σ²_u also

plim_n→∞1

nx⁰⁰1n =µx. (2.11)

Using the above assumptions we have

˜

ν_n^(L)=β−β1

Snuu

Snxx+Snuu





−µx

1



+Op(1) (2.12)

where plim_n→∞_S ^S^nxx

nxx+S_nuu = plim_n→∞κ_n=κ_x= _σ₂^σ^x²

x+σ_u². Observe that

˜

ν_n^(L)=β−β1(1−κx)





−µx

1



+Op(1) (2.13)

where κx is reliability ratio (RR). Hence,

plim_n→∞ν˜_n^(L)=β−β₁(1−κ_x)





−µ_x 1



=





β₀+β₁(1−κ_x)µ_x κxβ1



=



 ν₀ ν1



=ν (2.14) and ˜νn^(L) estimates ν consistently but not β. Hence, the consistent estimator of β follows by writing

β˜_1n^(L)=κ⁻¹_n ν˜_n^(L) and ˜β_0n^(L)=y_n−β˜_1nx⁰_n (2.15) where a consistent estimator ofκ_xis given by

ˆ κx=h

1−Snuu(Snxx+Snuu)⁻¹i

, (2.16)

ifκ_xis unknown. As for the asymptotic normality of ˜β_1n^(L)=



 β˜^(L)_0n β˜^(L)_1n



, we have the

following result from Fuller (1987) and Schneeweiss (1976)





√n

βˆ^(L)_0n −β0

√n

βˆ^(L)_1n −β1



∼N₂









 0 0



, σ_e²+κ_xβ₁²σ_u²





1 +µ²_xG −µxG

−µxG G









 (2.17)

(5)

as n→ ∞, where

G= 1

κxσ²_x+ β₁²E[u⁴−2σ_u⁴] (σ_e²+κxβ₁²σ_u²)σ_x⁴

.

If there is no measurement error, we obtain the asymptotic distribution of the LSE of the standard model given by (2.4). Using (2.4) and (2.17) and remembering that 0 ≤κx≤1, and ∆² = σ_e⁻²κxβ²₁σ²_u

, the joint asymptotic relative efficiency (JARE) is given by:

J ARE

β˜_n^(L);β^∗(L)_n

=

(1 + ∆²)G⁻¹ .

If there is no measurement error, then the JARE reduces to unity.

3 Quasi-Empirical Bayes Regression Models

In this section, we develop an emaculate regression model using the component information of the ME model. Let us consider the general case of the ME model under non-normal errors

y_i=β₀+β₁x_i+e_i x⁰_i =xi+ui

o

i= 1, . . . , n (3.1)

wheree1, . . . , enare i.i.d.r.v. with the symmetric pdf with finite Fisher’s Information,I(f0) as follows:

f_e(e) =f₀(y−β₀−β₁x) (3.2) such that

E(yi) =β0+β1xi, i= 1, . . . , n and V ar(yi) =σ²_e. (3.3) We assume that

(i) x⁰₁, x⁰₂, . . . , x⁰_n are independently distributed with the symmetric pdf f_x0 x⁰_i

= 1 λ_uf₁

x⁰_i −xi

λ_u

= 1 λ_uf₁

ui

λ_u

(3.4) withE x⁰_i|xi

=x_i and theknownvariance,λ²_uσ²_u.

(ii) The unknown covariates x1,· · · , xn are independently distributed with the symmetric pdf

f_x(x_i) = 1 λx

f₂

x_i−µ_x λx

withE(xi) =µxand varianceλ²_xσ²_x.

(6)

(iii) Snxu= _n¹Pn

i=1(xi−xn) (ui−un)→^P 0 Further, we have

(a) S_nuu=_n¹Pn

i=1(u_i−u_n)²→^P λ²_uσ²_u (b) Snxx= _n¹Pn

i=1(xi−xn)²→λ²_xσ²_xand (c) Snx⁰x⁰ = ¹_nPn

i=1 x⁰_i −x⁰_n2 P

→λ²_xσ_x²+λ²_uσ_u²=σ²_x0 (say)

Firstly, we note that the model (2.6) has the inherent problem thatx⁰_i is not uncorrelated withei−β1ui. As a result, the LSE ˜ν_1n^(L)is not consistent forβ1, though it is consistent for the attenuated slopeν1=κxβ1. Now we consider the model (where the slope has to be determined) as

yi=β0+bx⁰_i +zi i= 1, . . . , n, (3.5) so that for someb, we have

p lim

n→∞

(1 n

n

X

i=1

x⁰_i −x⁰_n⁰

(z_i−z) )

= 0. (3.6)

Accordingly, for someb, the LHS of (3.7) in the bracket may be written as:

(β1−b)1 n

n

X

i=1

(xi−xn)²−b1 n

X(ui−u¯n)²+b1 n

X(xi−xn)(ui−u¯n) (3.7)

+1 n

X(xi−xn)(ei−e¯n) +1 n

X(ui−un)(ei−e¯n)1 n

n

X

i=1

(xi−xn)(ui−u¯n) Under the assumptions A3 - A4 and (3.5), asn→ ∞, it reduces to

(β₁−b)λ²_xσ²_x−bλ²_uσ_u²= 0. (3.8) Solving for b, we obtainb =κxβ1, where κx =λ²_xσ_x²/(λ²_xσ²_x+λ²_uσ²_u). Thus, the emaculate regression model representing the ME model may be written as:

yi=ν0+ν1x⁰_i +zi i= 1, . . . , n, (3.9) with

ν0=β0+β1(1−κx)µx, ν1=κxβ1 and zi=ei+β1{(1−κx)(xi−µx)−κxui}.

Note that the model (3.11) may be obtained by the Bayesian approach, and is discussed by Whittamore (1989). For the general error distribution given by (3.2) and (3.5), we follow the quasi-empirical Bayes” method (see Saleh (2006), Ch. 4) for estimating the unknown

(7)

xi’s in (1.1). Following the method, we obtain the “Stein-type estimates” of the unknown covariatesx_i’s as

x^s_i = (n−3)x⁰_nL⁻¹_n +x⁰_i

1−(n−3)L⁻¹_n (3.10)

whereLn=nSnx⁰x⁰/λ²_uσ²_uis the test-statistic for testing the null-hypothesis

H₀:x_i =µ_x ∀ i vs H₁:x_i 6=µ_x for at least oneiunder (3.5). (3.11) We may re-writex^s_i as

x^s_i = (1−κˆx)x⁰_n+ ˆκxx⁰_i, i= 1, . . . , n (3.12) where ˆκ_x =

1−(n−3)

λ²_uσ²_u/nS_nx0x⁰ where ˆκ_x is a consistent estimator of κ_x. Fur- ther, from the fact that

(i) x⁰_n →^P µx, and (ii) (1/n)Snx⁰x⁰

→P λ²_xσ²_x +λ²_uσ²_u, it follows that plim ˆκx = κx = λ²_xσ²_x/(λ²_xσ_x²+λ²_uσ_u²) (the reliability ratio) so that we may write the parametric version of (3.17) as

E xi|x⁰_i

≡(1−κx)µx+κxx⁰_i, i= 1, . . . , n (3.13) The expression (3.18) allows us to write the “quasi - empirical Bayes regression model”

corresponding to the ME model (1.1) as

yi=β0+β1(1−κx)µx+κxβ1x⁰_i +zi (3.14) withz_i=e_i+β₁{(1−κ_x) (x_i−µ_x)−κ_xu_i}, (i= 1, . . . , n) as in (3.11). This model is the conditional expectation ofyi givenx⁰_i under normal errors. Writing

zi=ei−β1{κxui−(1−κx)(xi−µx)}=ei−vi, vi=ν1ui−β1(1−κx)(xi−µx), (say), we define c.d.f. and p.d.f. as a convolution ofe_i andv_i as

F^∗(x) = Z ∞

−∞

F(x−κxβ1v)hθ(v)dv and f^∗(x) =κxβ1

Z ∞

−∞

f(x−κxβ1v)hθ(v)dv (3.15) where hθ(v) is the p.d.f. of the v’s depending on θ = (κx, β1) given by vi = {ui−(1− κ_x)κ⁻¹_x (x_i−µ_x)},i= 1, . . . , n. Now assumef₁=f₂=f₀in (3.5(i) & (ii)). Then,

I(f^∗) = Z ∞

−∞

(

−f^∗⁰(x) f^∗(x)

)²

f^∗(x)dx≤κxβ1I(f). (3.16) Under this model with the associated assumptions, the LSE (2.15) ofβhas an asymptotic distribution given by

√n( ˆβ^(L)_n −β)≈N2

n

0,(σ²_e+κxλ²_uβ²₁σ²_u)Σo

(3.17)

(8)

where Σ=





1 +µ²_xG^∗ −µxG^∗

−µxG^∗ G^∗



 with G∗= 1

κxσ_x² + β₁²E[u⁴−2σ_u⁴] (σ_e²+κxλ²_uβ₁²σ²_u)σ_x⁴

.

Thus, the JARE( ˜β^(L)n :β^∗(L)n ) = [(1 + ∆^∗2)G∗]⁻¹with ∆^∗2=σ_e⁻²κxλ²_uβ₁²σ²_u. Further, define the Fisher scores

ϕ(u, f^∗) =−f^∗⁰(F^∗−1(u))

f^∗(F^∗−1(u)), 0< u <1 and γ^∗(u, f^∗) = Z 1

0

ϕ(u)ϕ(u, f^∗)du.

based on the distributionf^∗ of the errors, zi’s.

In the next section we develop the R-estimators ofβ.

4 R-estimators of Regression Parameters

Consider the traditional model (2.1) again with the associated assumptions in (2.3). We consider the R-estimators ofβ= (β₀, β₁)⁰ using linear rank statistics as given by Saleh and Sen (1978)

L⁰_n(b) =n^−1/2

n

X

i=1

(xi−x¯n)a^ϕ_n(Ri(b)) and

T_n⁰(a, b) =n^−1/2

n

X

i=1

a^ϕ_n⁺(R⁺_ni(a, b))(yi−a−bxi) (4.1) where Ri(b) is the rank of yi−bxi amongy1−bx1, . . . , yn−bxn andR⁺_i (b) is the rank of

a^ϕ_n(i) =Eϕ(U_(i)) or ϕ i

n+ 1

, i= 1, . . . , n, (4.2)

a^ϕ_n⁺(i) =Eϕ(U_(i)) or ϕ⁺ i

n+ 1

, i= 1, . . . , n,

whereU₍₁₎≤ · · · ≤U_(n)the order statistics of a sample of sizenfrom the uniform distribu- tionU(0,1).Here ϕ⁺(u) =ϕ ^1+u₂

with the condition: ϕ(u) +ϕ(1−u). We define A²_ϕ=

Z 1 0

ϕ²(u)du− Z 1

0

ϕ(u)du ²

, (4.3)

γ(ϕ, f) = Z 1

0

ϕ(u)ϕ(u, f)du, ϕ(u, f) =−f⁰(F⁻¹(u)) f(F⁻¹(u))

(9)

with 0< u <1. Following Saleh and Sen (1978) we define the R-estimators ofβ1as β^∗(R)_1n =1

2{sup[b:L_n(b)>0] + inf[b:L_n(b)<0]} (4.4) and that ofβ0 by

β_0n^∗(R)=1 2

n

sup[a:T(a, β^∗(R)_1n )>0] + inf[a:T(a, β^∗(R)_1n )>0]o

(4.5) where

T_n⁰

a, β^∗(R)_1n

=n^1/2

n

X

i=1

a^ϕ_n⁺

Rni(a, β_1n^∗(R))

sgn(yi−a−β_1n^∗(R)xi) (4.6) is the aligned rank statistics. Now, using asymptotic linearity results of Jureˇckov´a (1971) given by

sup

|δ|<c

nn^1/2

L⁰_n(n^−1/2δ)−L⁰_n(0) +γ(ϕ, f)σ²_xδ

o→O_p(1) (4.7)

and sup

|δi|<ci, i=1,2

n n^1/2

T_n⁰(n^−1/2δ₁, n^−1/2δ₂)−T_n⁰(0,0) +γ(ϕ, f)[δ₁+δ₂µ_x] o

=O_p(1). (4.8) From the non-decreasing property ofLn(b) and the linearity results given by (4.7) and (4.8) we obtain the fact that n^1/2|β^∗(R)_0n −β0| = Op(1) and n^1/2|β^∗(R)_1n −β1| = Op(1) and the asymptotic normality given by Saleh (2006) as





√n

β_0n^∗(R)−β0

√n

β_1n^∗(R)−β1



∼N2







 0 0



, A²_ϕ γ²(ϕ, f)



 1 + ^µ_σ²^x2

x −^µ_σ^x2 x

−^µ_σ^x2 x

1 σ_x²







, (4.9)

Thus,AREofβn^∗(R)compared toβn^(L)is given by ARE

β_n^∗(R), β_n^(L)

=σ_e²γ²(ϕ, f)

A²_ϕ (4.10)

Iff is normal and we have used the Wilcoxon’s score,ϕ(u) = 2u−1, then ARE

β_n^∗(R), β_n^(L)

= 3

π. (4.11)

Now, we consider the measurement error model (3.1) together with the “quasi - empirical Bayes” model (3.19) along with the assumptions and (3.5). Let ϕ : (0,1) → R¹ be the score functions belonging to the class of non-constant, nondecreasing and square integrable functions. Let us consider the scores a^ϕ_n(1), . . . , a^ϕ_n(n) generated by score functions ϕ : (0,1)→Ras defined by (4.2). Define the linear rank statistic (LRS) for the slope parameter of the model (3.19) based on the observations{(x⁰_i, yi)|i= 1, . . . , n}as

L_n(b) =n^−1/2

n

X

i=1

(x⁰_i −x¯⁰_n)a^ϕ_n(R_i(b)), (4.12)

(10)

whereRi(b) is the rank ofyi−bx⁰_i amongy1−bx⁰₁, . . . , yn−bx⁰_n. Notice thaty1−bx⁰₁, . . . , yn− bx⁰_n are independently distributed random variables with pdff∗. Hence,

P {R1(b), . . . , R_n(b)}= 1

n! (4.13)

Thus (4.12) is a legitimate linear rank statistics for the estimation of the slope parameter of the model. Now, settingLn(b) = 0, we define the solution as the estimator ofν1=κxβ1

which is the slope of the “quasi - empirical Bayes” model. Due to the non-decreasing nature ofLn(b) as a function ofb, we define R-estimator ˆν_1n^(R)routinely as

ˆ ν_1n^(R)=1

2{sup [b;L_n(b)>0] + inf [b;L_n(b)>0]} (4.14) which satisfies the asymptotic linearity results of Jureˇckov´a (1971) given by

sup

|δ^∗|<c

n

Ln(ν1+n^−1/2δ^∗)−Ln(ν1) +γ(ϕ, f^∗)σ²_x0δ^∗

δ^∗∈R⁰o

=Op(1) (4.15) whereδ^∗=κxδwithκx=λ²_xσ_x²/(λ²_xσ²_x+λ²_uσ²_u). To justify its consideration we look at the

“quasi- empirical Bayes” model in vector form as:

y= [βo+β1(1−κx)µx]1n+κxβ1x⁰+z

wherezis the error vector of independent components, so that the R-estimator defined by (4.14) estimatesν1=κxβ1. The proof of (4.15) may be found in Jureˇckov´a and Sen (1996), Hajek et al (1999) and Koul (2000). SinceL_n(b) is non-increasing function ofb and (4.15) hold we conclude thatn^1/2

νˆ_1n^(R)−ν1

=Op(1) whereν1=κxβ1. Hence, ˆν_1n^(R)is a consistent estimator of ν1. Therefore, κ⁻¹_x νˆ_1n^(R) is also a consistent estimator ofβ1. Hence, inserting δ^∗=n^1/2

ˆ

ν_1n^(R)−ν₁

in (4.15) we obtain n^1/2(ˆν_1n^(R)−ν₁) = 1

γ(ϕ, f^∗)L^∗_n(γ₁) +o_p(1), (4.16) where

L^∗_n(ν1) =n^−1/2

n

X

i=1

(x⁰_i −x¯⁰_n)ϕ(F^∗(zi)) +op(1)≈N(0, A²_ϕσ_x²0), (4.17) whereσ²_x0=λ²_xσ²_x+λ²_uσ²_uThus, we obtain

n^1/2(ˆν_1n^(R)−ν₁)∼N 0, A²_ϕ γ²(ϕ, f^∗)σ_x²0

!

. (4.18)

Consequently,

n^1/2( ˆβ_1n^(R)−β1)∼N 0, A²_ϕ γ²(ϕ, f^∗)κxσ_x²

!

. (4.19)

(11)

The asymptotic relative efficiency (ARE) of ˆβ_1n^(R) relative to ˆβ^(L)_1n is given by ARE( ˆβ_1n^(R): ˆβ_1n^(L)) = σ_e²+κ_xλ²_uβ²₁σ²_uγ²(ϕ, f^∗)

A²_ϕ

1 + κ_xβ²₁E[u⁴−2σ⁴_u] (σ²_e+κxλ²_uσ_u²β₁²)λ²_xσ²_x

. (4.20) Ifϕ(u) = 2u−1 i.e. Wilcoxon’s score, theAREturns out to be

48 σ²_e+κxλ²_uβ₁²σ_u²

1 + κxβ₁²E[u⁴−2σ_u⁴] (σ_e²+κ_xλ²_uσ_u²β₁²)λ²_xσ_x²

B²(F^∗) (4.21) and, theAREof ˆβ_1n^(R) w.r.t ˆβ_1n^∗(R)is simply given by

κ_xγ²(ϕ, f^∗)

γ²(ϕ, f) ≤κ_xB²(F^∗) B²(F) ≤

Z

h²_θ(t)dt (4.22)

where

B²(F) =Z

f²(x)dx²

and B²(F^∗) = Z ∞

−∞

f^∗²(x)dx 2

.

The above result follows sinceϕ(u) = 2u−1 and γ(ϕ, f) =

Z

ϕ(F(x))f²(x)dx= 2 Z

f²(x)dx (4.23)

γ^∗(ϕ, f^∗) = 2 Z

f^∗²(x)dx (4.24)

where

f^∗(x) = Z

f(x−t)hθ(t)dt, γ^∗(ϕ, f^∗) γ(ϕ, f) =

R(R

f(x−t)h_θ(t)dt)²dx Rf²(x)dx ≤

Z

h²_θ(t)dt.

5 Joint R-estimation of Intercept and Slope

In this section, we consider the estimation of the intercept and slope parameters jointly by using the two linear rank statistics as in Saleh and Sen (1978):

Ln(b) =n^−1/2

n

X

i=1

(x⁰_i −x¯⁰_n)a^ϕ_n(Ri(b)) (5.1) and

T_n(a, b) =n⁻¹

n

X

i=1

a^ϕ_n⁺(i) =Eϕ⁺(U_(i)) or ϕ⁺ i

n+ 1

(5.3)

(12)

with

ϕ⁺(u) =ϕ 1 +u

2

, ϕ(u) +ϕ(1−u) = 0, 0< u <1. (5.4) Let ˆβ_1n^(R)be the R-estimator of the slope parameter using ˆν_1n^(R)=κxβ1. Then to obtain the R-estimator of the intercept parameter, we consider the aligned statistics

Tn(a,νˆ_1n^(R)) =n^−1/2

n

X

i=1

a^ϕ_n⁺(Ri(a,νˆ^(R)_1n ))sgn(yi−a−νˆ_1n^(R)x⁰_i). (5.5) This allows us to obtain the R-estimator as

ˆ ν_0n^(R)= 1

2 n

suph

a; Tn(a,νˆ_1n^(R))>0i + infh

a; Tn(a,νˆ_1n^(R))<0io

. (5.6)

Now, recall that F^∗ is symmetric about 0 so that Tn(0, ν1) is symmetric about 0 and we obtain (see Saleh and Sen (1978))

supn n^1/2

T_n

n^−1/2t₁, ν₁+n^−1/2t₂

−T_n(0, ν₁) +γ(ϕ, f^∗) [t₁+t₂µ_x]

o=o_p(1), (5.7) where the supremum is taken over|t1| ≤c1 and t2 ≤c2. This together with (4.15) implies that we can use the statistics

L^∗_n(ν₁) =n^−1/2

n

X

i=1

(x⁰_i −x¯⁰_n)ϕ(F^∗(z_i)) +o_p(1) (5.8) and

T_n^∗(0, ν₁) =n^−1/2

n

X

i=1

ϕ⁺(F^∗(z_i))sgn(z_i) +o_p(1) (5.9) to represent the estimator ofν0 as

√n(ˆν^(R)_0n −ν₀) =γ⁻¹(ϕ, f^∗)

T_n^∗(0, ν₁)−µ_xL⁰_n(ν₁)

+o_p(1), (5.10) since





Tn(0, ν1) L_n(ν₁)



∼N2







 0 0



, A²_ϕ





1 0

0 λ²_xσ_x²+λ²_uσ_u²







, (5.11)

and we have





√n( ˆβ^(R)_0n −β0)

√n( ˆβ^(R)_1n −β₁)



∼N2







 0 0



, A²_ϕ γ²(ϕ, f^∗)





1 +_κ ^µ²^x

xλ²_xσ²_x −_κ ^µ²^x

xλ²_xσ²_x

−_κ ^µ²^x

xλ²_xσ_x²

1 κxλ²_xσ²_x







. (5.12) Hence, theJAREof ˆβ^(R)n compared to β^∗(R)n is given by

JARE( ˆβ_n^(R):β_n^∗(R)) =κ_xγ²(ϕ, f^∗)

γ²(ϕ, f) =κ_xB²(F^∗)

B²(F) (5.13)

and theJAREof ˆβn^(R) compared to the LSE is given by

48σ_e²(1 + ∆^∗2)G^∗B²(F^∗) (5.14) if Wilcoxon’s score is used in both cases.

(13)

6 Jaeckel’s Estimator of Slope

In this section we consider the Jaeckel’s (1972) estimator of the slopeν1of the model (3.11).

The dispersionD(z) ofz1, . . . , zn is minimized, where

z_i=y_i−ν₁x⁰_i, i= 1, . . . , n. (6.1) Accordingly, leta^ϕ_n(i), i= 1, . . . , n,be a non-decreasing set of scores, not all equal, satisfying the condition

n

X

i=1

a^ϕ_n(i) = 0. (6.2)

Based on the observations{(x⁰_i, yi)⁰|i= 1, . . . , n} from (3.16), considerz₍₁₎≤ · · · ≤z_(n) to be the ordered values of the residuals (6.1). Let us writez_(k)=y_i(k)−ν₁x⁰_i(k), wherei(k) is the index of the observations giving rise to thek-th ordered residuals. The dispersion of residuals as a function ofν1 is then given by

D(y−ν₁x⁰) =

n

X

k=1

a^ϕ_n(k)h

y_i(k)−ν₁x⁰_i(k)i

. (6.3)

By Theorem 1 of Jaeckel (1972), D(y −ν1x⁰) is a non-negative, continuous and convex function of ν1. The minimization of (6.3) with respect to β1 yields the R-estimator of ν1. But the estimator may not be unique. Thus, any choice of ν0 is acceptable. There is an asymptotic equivalence between the Jaeckel’s estimator using (6.3) and the estimators discussed in Section 3 and 4; these solve the same equation since

∂

∂∆D(y−∆x⁰) =−Ln(y−∆x⁰), where ∆ =n^1/2ν1. (6.4) In other words the derivative of the Jaeckel’s dispersion criterion is the negative of the linear rank statistics subject to (6.2). Let ˆν_1n^(J) denote the consistent Jaeckel’s estimator of ν1, obtained by minimizing (6.3). Then, the R-estimator is obtained as:

βˆ_1n^(J)= ˆκ⁻¹_x νˆ_1n^(J)= [1−(S_nxx+S_nuu)⁻¹S_nuu]⁻¹ˆν_1n^(J). (6.5) Now considering the AUL result given at (4.7) and (4.16), we investigate the quadratic form Q(δ) =γ(ϕ, f^∗)(Snxx+Snuu)δ²−L^∗_n(β1)δ+Op(1). (6.6) The derivative ofQ(δ) with respect toδ is given by

∂Q(δ)

∂δ =γ(ϕ, f^∗)(Snxx+Snuu)δ−L^∗_n(ν1), (6.7) whereL^∗_n(ν₁) is given by (4.17). This leads to the asymptotic representation of

√n(ˆν_1n^(J)−ν1) =γ⁻¹(ϕ, f^∗)(Snxx+Snuu)⁻¹L^∗_n(ν1) +op(1), (6.8)

(14)

yielding the asymptotic distribution of√

n(ˆν_1n^(J)−ν1) as N 0,

"

A²_ϕ γ²(ϕ, f^∗)

# [σ_x²0]⁻¹

!

(6.9) as given in (4.18). Hence,

√n( ˆβ_1n^(J)−β₁)∼N 0,

"

A²_ϕ γ²(ϕ, f^∗)

#

(κ_xλ²_xσ²_x)⁻¹

!

. (6.10)

We now consider the Jaeckel’s (1972) estimator using Wilcoxon scores given by a^W_n (i) = i

n+ 1 −1

2, i= 1, . . . , n. (6.11) Then, the Jaeckel’s estimator based on the residuals (6.1) is given by the median of the divided differences

(yj−yi)/(x⁰_j−x⁰_i)|1≤i < j ≤n , i.e.

ˆ

ν_1n^(J)= med

1≤i<j≤n

yj−yi

x⁰_j−x⁰_i. (6.12)

Consequently, the slope estimator is given by βˆ_1n^(J)= ˆκ⁻¹_x

(

1≤i<j≤nmed

y_j−y_i x⁰_j−x⁰_i

)

, (6.13)

where ˆκ_xis given by (3.17). Thus, it is easily proved using (6.8) that

√n(ˆν_1n^(J)−ν1)∼N

0, 1

48(R

f^∗2(x)dx)²σ_x²0

(6.14)

and √

n( ˆβ^(J)_1n −β₁)∼N

0, 1

48(R

f^∗2(x)dx)²κxλ²_xσ_x²

, (6.15)

TheAREof the estimator compared to the least squares estimator is clearly given by 48(σ_e²+κxλ²_uσ²_uβ₁²) 1 + κxβ₁²E[u⁴−2σ⁴_u]

(σ_e²+κxλ²_uσ²_uβ₁²)λ²_xσ_x²

!

B²(F^∗). (6.16)

And theAREof ˆβ^(J)_1n compared toβ^∗(J)is given by κx

γ²(ϕ, f^∗) γ²(ϕ, f) =κx

B²(F^∗)

B²(F) ≤κxB²(Hθ). (6.17) We may remind the readers that Jaeckel’s estimator ˆν_1n^(J) and ˆβ^(J)_1n may also be obtained using Kendall’s tau test, resulting in the Theil-Sen type estimator (Sen 1968) in the model (3.1) with the ME version in Sen and Saleh (2010). In the next section we provide some simulation studies on the performance of the R-estimators for the ME models.

(15)

7 Simulation Study

We have conducted a simulation study to check how the proposed estimation procedures perform for finite sample situations. We considered the model

y_i=β₀+β₁x_i+e_i, i= 1, . . . , n, (7.1) x⁰_i =x_i+u_i, i= 1, . . . , n,

where the errors e_i, i= 1, . . . , n, were simulated from the normal N(0,1), Laplace L(0,1) and Cauchy distributions. The measurement errorsui, i= 1, . . . , nwere generated independently from the normalN(0,0.5),N(0,2) and uniformU(−1,1) distributions. We considered vectors with design pointsx1, . . . , xngenerated from the uniform distribution on the interval (-2,10).

The following parameter values of the model were used:

• sample sizes: n= 20,100;

• β₀= 1,β₁= 3;

• Wilcoxon score functionϕ(u) = 2u−1, 0≤u≤1,andA²_ϕ= ₁₂¹.

10 000 replications of the model were simulated for each combination of the parameters and a particular distribution of the measurement errors. The slope parameter was estimated by the least squares, R- and Jaeckel’s estimators. For the sake of comparison, the mean square error (MSE), mean, median, 2.5%– and 97.5%–sample quantiles were computed and summarized in Tables 1-9. These tables compare the slope estimators under different conditions of the sample sizes, error distributions and measurement errors. These results are reproduced from the paper by Saleh et al (2009).

Conclusion

The simulation study indicates:

(i) The R-estimator and Jaeckel’s estimator give very good results, though intuitively the least squares estimator would be more favorable for the normal distribution.

(ii) The influence of even a small sample size is not too big.

(iii) The R-estimator and Jaeckel’s estimator show a good performance for the measurement errors with a small variance.

We have made more extensive simulation experiments. Particularly, we considered various score functions, design vectors, other underlying distributions of the error terms and the measurement errors with a small variance. The corresponding estimates have appeared to be very slightly influenced by these parameters.

(16)

Estimator uj MSE mean 2.5%-q. median 97.5%-q.

LSE – 0.00466 2.99942 2.8647 2.99924 3.13477

N(0,0.5) 0.02585 3.00118 2.68445 2.99926 3.31555 N(0,2) 0.08827 3.00094 2.41449 2.99593 3.59150 U(−1,1) 0.01838 2.99971 2.73392 3.00007 3.26213

R – 0.00507 2.99913 2.86009 2.99834 3.14291

N(0,0.5) 0.02777 3.00045 2.67288 2.99786 3.32993 N(0,2) 0.09290 3.00336 2.41420 2.99910 3.60857 U(−1,1) 0.01798 2.99558 2.73435 2.99541 3.25841

Jaeckel – 0.00529 2.99920 2.85672 2.99907 3.14351

N(0,0.5) 0.02921 3.01801 2.67955 3.01528 3.36205 N(0,2) 0.09979 3.04651 2.44197 3.03935 3.65707 U(−1,1) 0.01868 3.00963 2.74470 3.01258 3.27215

Table 1: Sample statistics of 10 000 values of the estimated slope parameter in model (7.1) for the Least Squares, the R- and the Jaeckel’s estimators, various distributions of the measurement errors uj, j = 1, . . . , n, the sample size n = 20 and the standard normal distribution of errorsei, i= 1, . . . , n.

Estimator uj MSE mean 2.5%-q. median 97.5%-q.

LSE – 0.00899 3.00142 2.81441 3.00100 3.19353

N(0,0.5) 0.02970 3.00438 2.67894 3.00094 3.35343 N(0,2) 0.09093 3.00475 2.41145 3.00343 3.58918 U(−1,1) 0.02284 3.00298 2.70889 3.00071 3.30307

R – 0.00707 3.00088 2.83494 3.00134 3.17287

N(0,0.5) 0.03101 3.00327 2.66215 2.99926 3.34867 N(0,2) 0.09622 3.00468 2.39988 3.00659 3.61228 U(−1,1) 0.02238 2.99949 2.71051 2.99607 3.29889

Jaeckel – 0.00733 3.00069 2.82944 3.00115 3.17347

N(0,0.5) 0.03287 3.02051 2.67842 3.01766 3.38284 N(0,2) 0.10324 3.04708 2.44339 3.04161 3.67898 U(−1,1) 0.02332 3.01260 2.71744 3.00966 3.31475

Table 2: Sample statistics of 10 000 values of the estimated slope parameter in model (7.1) for the Least Squares, the R- and the Jaeckel’s estimators, various distributions of the measurement errors uj, j = 1, . . . , n, the sample sizen= 20 and the Laplace distribution of errorsei, i= 1, . . . , n.