pertemuan 8 estimasi satu populasi compress

(1)

Estimasi

(2)

Statistical Inference

Inferensia statistik adalah proses di mana kita mendapatkan informasi dan menarik kesimpulan tentang populasi dari sampel

Parameter Population

Sample

Statistic Inference

Data

Statistics

Information

Supaya dapat

melakukan inferensia, kemampuan dan

pengetahuan descriptive statistics, distribusi

peluang, dan distribusi

(3)

Statistical Inference

Ada dua jenis inferensi: estimasi dan uji hipotesis;

estimasi diperkenalkan terlebih dahulu.

Tujuan dari estimasi adalah untuk menentukan nilai pendekatan dari parameter populasi

berdasarkan sampel statistik.

E.g., rata-rata sampel ( ) digunakan untuk mengestimasi rata-rata populasi ( μ ).

X

(4)

Estimation

Tujuan dari estimasi adalah untuk menentukan nilai pendekatan dari parameter populasi

berdasarkan sampel statistik.

Ada dua jenis estimator:

1. Estimator Titik

2. Estimator Interval

(5)

Estimasi Titik dan Interval

 Estimasi titik adalah angka tunggal,

 Estimasi interval (Confidence interval)

memberikan informasi tambahan tentang variabilitas

Point Estimate

Lower

Confidence Limit

Upper

Width of

confidence interval

(6)

Estimasi Titik

Kita dapat mengestimasi parameter population

Dengan statistik Sample

(Estimasi titik)

Mean

Proportion

p

μ x

p

(7)

Estimasi Titik

Suatu estimator titik menarik kesimpulan tentang populasi dengan mengestimasi nilai parameter yang tidak diketahui dengan menggunakan suatu nilai tunggal atau titik.

Telah diketahui sebelumnya bahwa peluang dari suatu titik pada distribusi kontinu adalah hampir nol. Dan juga

diharapkan bahwa suatu estimator titik akan semakin dekat dengan nilai parameter ketika ukuran sampel meningkat, tetapi estimator titik tidak mencerminkan pengaruh dari ukuran sampel yang membesar. Oleh karena itu kita akan menggunakan estimator interval untuk memperkirakan parameter populasi.

(8)

Formula Umum



Formula umum untuk semua interval kepercayaan adalah:

Point Estimate  (Critical Value)(Standard Error)

(9)

Confidence Intervals

Population Mean

σ Unknown

Confidence Intervals

Population Proportion

σ Known

(10)

Confidence Interval for μ (σ Known)

 Assumptions

 Population standard deviation σ is known

 Population is normally distributed

 If population is not normal, use large sample

 Confidence interval estimate

n z σ

x  _α/2

(11)

Finding the Critical Value

 Consider a 95% confidence interval:

z_.025= -1.96 z_.025= 1.96 .95

1 

2 .025

α  .025

2 α 

Point Estimate Lower

Upper

z units:

x units: Point Estimate

0

1.96 z_α/2  

(12)

Common Levels of Confidence



Commonly used confidence levels are 90%, 95%, and 99%

Confidence Level

Confidence

Coefficient, z value,

1.28 1.645 1.96 2.33 2.57 3.08 3.27 .80

.90 .95 .98 .99 .998 .999 80%

90%

95%

98%

99%

99.8%

99.9%





1

z

_^/2

(13)

μ μ_x 

Interval and Level of Confidence

Confidence Intervals

Intervals extend from

to

100(1-)%

of intervals constructed contain μ;

100% do not.

Sampling Distribution of the Mean

n z σ

x  __/2

n z σ

x  __/2

x

x₁ x₂

/2 1  /2

(14)

Contoh

 Sampel 11 sirkuit dari populasi normal yang besar memiliki rata-rata ketahanan 2,20

ohm. Dari pengujian terakhir diketahui

bahwa standar deviasi populasi adalah 0,35 ohm

 Tentukan 95% confidence interval bagi

rata-rata yang sebenarnya dari ketahanan populasi.

(15)

2.4068 ...

...

1.9932

.2068 2.20

) 11 (.35/

1.96 2.20

n z σ

x _/2









 _

Contoh

 Sampel 11 sirkuit dari populasi normal yang besar memiliki rata-rata ketahanan 2,20 ohm.

Dari pengujian terakhir diketahui bahwa standar deviasi populasi adalah 0,35 ohm.

 Solution:

(continued)

(16)

Interpretation

 Kita 95% percaya bahwa rata-rata sebenarnya dari ketahanan adalah antara 1,9932 dan

2,4068 ohms

(17)

Confidence Intervals

σ Unknown

σ Known

(18)



If the population standard deviation σ is unknown, we can substitute the sample standard deviation, s



This introduces extra uncertainty, since s is variable from sample to sample



So we use the t distribution instead of the normal distribution

Confidence Interval for μ

(σ Unknown)

(19)

 Assumptions

 Population standard deviation is unknown

 Population is normally distributed

 If population is not normal, use large sample

 Use Student’s t Distribution

 Confidence Interval Estimate

Confidence Interval for μ (σ Unknown)

n t s

x 

_

_/2

(continued)

(20)

Student’s t Distribution



The t is a family of distributions



The t value depends on degrees of freedom (d.f.)

 Number of observations that are free to vary after sample mean has been calculated

d.f. = n - 1

(21)

If the mean of these three values is 8.0,

then x₃ must be 9

(i.e., x₃ is not free to vary)

Degrees of Freedom (df)

Idea: Number of observations that are free to vary after sample mean has been calculated

Example: Suppose the mean of 3 numbers is 8.0 Let x₁ = 7

Let x₂ = 8 What is x₃?

Here, n = 3, so degrees of freedom = n -1 = 3 – 1 = 2

(2 values can be any numbers, but the third is not free to vary for a given mean)

(22)

Student’s t Distribution

0

t

t (df = 5) t (df = 13)

t-distributions are bell-

shaped and symmetric, but have ‘fatter’ tails than the normal

Standard Normal (t with df = )

Note: t z as n increases

(23)

Student’s t Table

Upper Tail Area df .25 .10 .05

1 1.000 3.078 6.314

2 0.817 1.886 2.920 3 0.765 1.638 2.353

0 _2.920 t

The body of the table contains t values, not probabilities

Let: n = 3 df = n - 1 = 2

 = .10

/2 =.05

^{/2 = .05}

(24)

t distribution values

With comparison to the z value

Confidence t t t z Level (10 d.f.) (20 d.f.) (30 d.f.) ____

.80 1.372 1.325 1.310 1.28 .90 1.812 1.725 1.697 1.64 .95 2.228 2.086 2.042 1.96 .99 3.169 2.845 2.750 2.57

Note: t z as n increases

(25)

Example

A random sample of n = 25 has x = 50 and s = 8. Form a 95% confidence interval for μ

 d.f. = n – 1 = 24, so

The confidence interval is

2.0639 t

t__/2_,_n_₁  _.025,24 

25 (2.0639) 8

n 50 t s

x 

__/2

 

46.698 ……….. 53.302

(26)

Approximation for Large Samples

 Since t approaches z as the sample size increases, an approximation is sometimes used when n  30:

n t s

x 

__/2

n z s

x 

__/2

Technically correct

Approximation for large n

(27)

Determining Sample Size



The required sample size can be found to reach a desired margin of error (e) and

level of confidence (1 -

)

 Required sample size, σ known:

2 /2

e

σ z

e

σ

n z 



 



 



^² ₂ ^

(28)

Required Sample Size Example

If  = 45, what sample size is needed to be 90% confident of being correct within ± 5?

(Always round up)

219.19 5

1.645(45) e

σ n z

2 2

/2

 



 



 

 

 



 

^

So the required sample size is n = 220

(29)

Confidence Intervals

σ Unknown

σ Known

(30)

Confidence Intervals for the Population Proportion, p



An interval estimate for the population

proportion ( p ) can be calculated by

adding an allowance for uncertainty to

the sample proportion ( p )

^

(31)

Confidence Intervals for the Population Proportion, p

 Recall that the distribution of the sample proportion is approximately normal if the

sample size is large, with standard deviation

 We will estimate this with sample data:

(continued)

ˆ

ˆ ˆ

p(1 p)

s

^p

n

 

n

p) σ

_p

p(1 



(32)

Confidence interval endpoints

 Upper and lower confidence limits for the

population proportion are calculated with the formula

 where

 z is the standard normal value for the level of confidence desired

 p is the sample proportion

 n is the sample size

/2

ˆ (1 ˆ )

ˆ p p

p z



n

 



(33)

Contoh



Sampel acak 100 orang menunjukkan bahwa 25 diantaranya adalah kidal.



Bentuklah 95% interval kepercayaan

untuk proporsi sebenarnya dari orang

kidal.

(34)

Contoh

 A random sample of 100 people shows that 25 are left-handed. Form a 95%

confidence interval for the true proportion of left-handers.

1.

2.

3.

ˆ

ˆ 25/100 0.25

ˆ ˆ

S

_p

(1 )/n 0.25(0.75)/n 0.0433 p

p p

 

   

0.3349 .

. . . . 0.1651

(.0433) 1.96

.25 

(continued)

(35)

Interpretation

 We are 95% confident that the true

percentage of left-handers in the population is between

16.51% and 33.49%.

(36)

Changing the sample size



Increases in the sample size reduce the width of the confidence interval.

Example:

 If the sample size in the above example is

doubled to 200, and if 50 are left-handed in the sample, then the interval is still centered at .25, but the width shrinks to

.19 …… .31

(37)

Finding the Required Sample Size for proportion problems

n

) p (

z p

e

_/2





_

1

Solve for n:

Define the

margin of error:

2 /2

e

) p (

p

n z 



^²

1

p can be estimated with a pilot sample, if necessary (or conservatively use p = .50)

 

(38)

What sample size...?



How large a sample would be necessary to estimate the true proportion defective in a large population within 3%, with 95%

confidence?

(Assume a pilot sample yields p = .12)

^

(39)

What sample size...?

Solution:

For 95% confidence, use Z = 1.96 e = .03

p = .12, so use this to estimate p

So use n = 451

2 2

/2

2 2

ˆ (1 ˆ ) (1.96) (.12)(1 .12)

450.74 (.03)

z p p

n e



 

  

(continued)



(40)

SOAL LATIHAN

(41)

Soal 1

Suatu perusahaan listrik yang membuat bola lampu yang panjang umurnya

berdistribusi hampir normal dengan

simpangan baku 40 jam. Bila sample 30 bola lampu berumur rata-rata 780 jam, hitunglah selang kepercayaan 96% untuk rata-rata populasi bola lampu yang

dihasilkan perusahaan tersebut

(42)

Soal 2

Berapa besarkah sampel diperlukan pada soal 1 bila diinginkan kepercayaan 96%

agar rata-rata sampel paling banyak meleset 10 jam dari rata-rata

sesungguhnya?

(43)

Soal 3

Sebuah mesin menghasilkan potongan logam yang berbentuk silinder. Sampel beberapa potongan diukur dan ternyata

diameternya 1,01 ; 0,97 ; 1,03 ; 1,04 ; 0,99 ; 0,98 ; 0,99 ; 1,01 ; dan 1,03 cm. hitunglah

selang kepercayaan 99% untuk rata-rata diameter potongan yang dihasilkan mesin tersebut bila dimisalkan distribusinya

hampir normal

(44)

Soal 4

Sampel acak 12 lulusan suatu sekolah

sekretaris mengetik rata-rata 79,3 kata per

menit dengan standar deviasi 7,8 kata per

menit. Anggap jumlah kata yang diketik per

menit berdistribusi normal. Buatlah selang

kepercayaan 95% untuk rata-rata jumlah

kata yang diketik per menit oleh semua

lulusan sekolah tadi

(45)

Soal 5

Suatu sampel acak 200 pemilih diambil dan ternyata 114 daripadanya mendukung

calon A. Hitunglah selang kepecayaan 96%

untuk proporsi populasi pemilih yang

mendukung calon A.

(46)

Soal 6

Dari suatu sampel acak 1000 rumah di

suatu kota ternyata 228 menggunakan gas elpiji. Cari selang kepercayaan 99% untuk proporsi rumah di kota tadi yang

menggunakan gas elpiji.

(47)

Soal 7

Suatu sistem peluncur roket tertentu sedang dipertimbangkan untuk dipakai meluncurkan sejumlah roket jarak pendek. Sistem yang sekarang mempunyai peluang berhasil

meluncurkan sebuat roket p=0,8. Sampel 40

peluncuran percobaan dengan system yang baru menunjukkan 34 yang berhasil.

a. Buatlah selang kepercayaan 95% untuk p.

b. Apakah kenyataannya cukup besar untuk

mendukung bahwa sistem yang baru ini lebih baik? Jelaskan.

pertemuan 8 estimasi satu populasi compress

Estimasi

Statistical Inference

Statistical Inference

X

Estimation

Estimasi Titik dan Interval

Estimasi Titik

p

μ x

p

Estimasi Titik

Formula Umum

Formula umum untuk semua interval kepercayaan adalah:

Confidence Intervals

Confidence Interval for μ (σ Known)

n z σ

x  α/2

Finding the Critical Value

Common Levels of Confidence

Commonly used confidence levels are 90%, 95%, and 99%

z

Interval and Level of Confidence

Contoh

Contoh

Interpretation

Confidence Intervals

If the population standard deviation σ is unknown, we can substitute the sample standard deviation, s

This introduces extra uncertainty, since s is variable from sample to sample

So we use the t distribution instead of the normal distribution

Confidence Interval for μ

(σ Unknown)

Confidence Interval for μ (σ Unknown)

n t s

x 

/2

Student’s t Distribution

The t is a family of distributions

The t value depends on degrees of freedom (d.f.)

d.f. = n - 1

Degrees of Freedom (df)

Student’s t Distribution

t

Student’s t Table

t distribution values

Example

25 (2.0639) 8

n 50 t s

x 

 

Approximation for Large Samples

n t s

x 

n z s

x 

Determining Sample Size

The required sample size can be found to reach a desired margin of error (e) and

level of confidence (1 -

e

σ z

e

σ

n z 



 



 



Required Sample Size Example

219.19 5

1.645(45) e

σ n z

 



 



 

 

 



x  _α/2

_/2