Overdispersi dan Underdispersi

(1)

(2)

 Dispersi adalah ketidaksesuaian antara

ragam pengamatan dengan ragam dari model binomial yang diduga

 Dalam regresi logistik, diasumsikan

bahwa peubah respon (Y_i ) berdistribusi

(3)

 Overdispersi adalah ragam pengamatan lebih besar

dari ragam harapan

 Underdispersi adalah ragam pengamatan lebih

kecil dari ragam harapan

 beberapa hal yang menjadi penyebab terjadinya

overdispersi dan underdispersi adalah: 1. Menghilangkan kovariat yang penting.

2. Kesalahan dalam penentuan link function. 3. Adanya korelasi antar pengamatan.

(4)

 Parameter dispersi disimbolkan dengan 

, dan pemodelan dispersi dirumuskan

melalui model berikut (Williams, 1982):

 Jika  bernilai 0 maka tidak terjadi

dispersi dan respon mengikuti sebaran binomial

 Selain itu maka terjadi overdispersi atau

underdispersi

 

Y_i n_i_i



1 _i





1



n_i  1







(5)

 Pemeriksaan terjadinya overdispersi dan

underdispersi dapat dideteksi dengan

menggunakan nilai statistik 2

Pearson

 Statistik 2

Pearson

merupakan fungsi

(6)

 overdispersi terjadi jika:

 Sedangkan underdispersi terjadi apabila:

 dengan db = m-k, dimana m adalah

banyaknya pengamatan dan k adalah banyaknya parameter yang diduga

(7)

William Method

 Parameter estimate of  denoted by is

obtained by equating 2 statistic of the

model to its approximate expected value, written as :

and

(8)

Where

wi is the weight and vi is diagonal element of the

variance-covariance matrix of the linear predictor,

 The value of 2 statistic depends on , so iteration

process is needed to find optimum value.

 This procedure was firstly introduced by William

(1982), and then called William method

(9)

The algorithm of William method is described as follow:

1. Assumed  = 0 , calculate parameter

estimate of logistic regression model, , using maximum likelihood method.

Calculate the 2 statistics of fitted

model.

2. Compared 2 statistics to distribution. If

2 statistic is too large, conclude that 

> 0 and calculated initial estimates of  using following formula :

(10)

3. Using initial weight re calculate the value of and 2 statistic.

4. If 2 statistic closes to its number of degree of freedom, k − p,

estimated value of is sufficient. If not, re-estimate  using following expression:

 If 2 statistic remain large, return to step 3 until optimum value

of estimated  obtained. Once  has been estimated by ,

could be used as weights in fitting new model

(11)

(12)

(13)