PRECISION OR STANDARD ERRORS OF LEAST-SQUARES ESTIMATES

REGRESSION MODEL: THE PROBLEM OF ESTIMATION

3.3 PRECISION OR STANDARD ERRORS OF LEAST-SQUARES ESTIMATES

From Eqs. (3.1.6) and (3.1.7), it is evident that least-squares estimates are a function of the sample data. But since the data are likely to change from sample to sample, the estimates will change ipso facto. Therefore, what is needed is some measure of “reliability” or precisionof the estimators βˆ1

andβˆ2.In statistics the precision of an estimate is measured by its standard error (se).¹⁷Given the Gaussian assumptions, it is shown in Appendix 3A, Section 3A.3 that the standard errors of the OLS estimates can be obtained

16Mark Blaug, The Methodology of Economics: Or How Economists Explain, 2d ed., Cambridge University Press, New York, 1992, p. 92.

17Thestandard erroris nothing but the standard deviation of the sampling distribution of the estimator, and the sampling distribution of an estimator is simply a probability or fre- quency distribution of the estimator, that is, a distribution of the set of values of the estimator obtained from all possible samples of the same size from a given population. Sampling distri- butions are used to draw inferences about the values of the population parameters on the basis of the values of the estimators calculated from one or more samples. (For details, see App. A.)

as follows:

where var =variance and se =standard error and where σ²is the constant or homoscedastic variance of ui of Assumption 4.

All the quantities entering into the preceding equations except σ²can be estimated from the data. As shown in Appendix 3A, Section 3A.5, σ²itself is estimated by the following formula:

ˆ σ²=

uˆ²_i

n−2 (3.3.5)

whereσˆ² is the OLS estimator of the true but unknown σ² and where the expressionn−2 is known as the number of degrees of freedom (df),

ˆ u²_i being the sum of the residuals squared or the residual sum of squares (RSS).¹⁸

Once

u²_i is known, σˆ²can be easily computed.

u²_i itself can be computed either from (3.1.2) or from the following expression (see Section 3.5 for the proof ):

uˆ²_i =

y_i²− ˆβ2²

x_i² (3.3.6)

Compared with Eq. (3.1.2), Eq. (3.3.6) is easy to use, for it does not require computinguˆifor each observation although such a computation will be use- ful in its own right (as we shall see in Chapters 11 and 12).

Since

βˆ²= xiyi

x_i²

(3.3.1) (3.3.2)

(3.3.3)

(3.3.4) var (βˆ2)= σ²

x_i² se (βˆ2)= σ

x_i²

var (βˆ¹)= X²_i n

x_i²σ² se (βˆ¹)= X_i²

n x_i²σ

18The term number of degrees of freedommeans the total number of observations in the sample (=n) less the number of independent (linear) constraints or restrictions put on them.

In other words, it is the number of independent observations out of a total of nobservations.

For example, before the RSS (3.1.2) can be computed, βˆ1andβˆ2must ﬁrst be obtained. These two estimates therefore put two restrictions on the RSS. Therefore, there are n−2, notn, independent observations to compute the RSS. Following this logic, in the three-variable regression RSS will have n−3 df, and for the k-variable model it will have n−kdf.The general rule is this:df=(n−number of parameters estimated).

an alternative expression for computing ˆ u²_i is

(3.3.7)

In passing, note that the positive square root of σˆ²

(3.3.8)

is known as the standard error of estimate orthe standard error of the regression (se).It is simply the standard deviation of the Yvalues about the estimated regression line and is often used as a summary measure of the “goodness of ﬁt” of the estimated regression line, a topic discussed in Section 3.5.

Earlier we noted that, given Xi,σ² represents the (conditional) variance of both ui andYi.Therefore, the standard error of the estimate can also be called the (conditional) standard deviation of ui andYi.Of course, as usual, σY²andσ^Y represent, respectively, the unconditional variance and unconditional standard deviation of Y.

Note the following features of the variances (and therefore the standard errors) of βˆ¹andβˆ².

1. The variance of βˆ²is directly proportional to σ²but inversely proportional to x_i².That is, given σ², the larger the variation in the Xvalues, the smaller the variance of βˆ²and hence the greater the precision with which β² can be estimated. In short, given σ², if there is substantial variation in the X values (recall Assumption 8), β² can be measured more accurately than when the Xi do not vary substantially. Also, given x_i², the larger the variance of σ², the larger the variance of β². Note that as the sample size n increases, the number of terms in the sum, x_i², will increase. As n increases, the precision with which β²can be estimated also increases. (Why?) 2. The variance of βˆ¹ is directly proportional to σ² and X²_i but inversely proportional to x_i²and the sample size n.

3. Sinceβˆ¹andβˆ²are estimators, they will not only vary from sample to sample but in a given sample they are likely to be dependent on each other, this dependence being measured by the covariance between them. It is shown in Appendix 3A, Section 3A.4 that

(3.3.9) cov (βˆ¹,βˆ²)= − ¯Xvar (βˆ²)

= − ¯X σ²

x_i² ˆ

σ = uˆ²_i n−2 uˆ²_i =

y_i²− xiyi

x_i²

Since var (βˆ2) is always positive, as is the variance of any variable, the nature of the covariance between βˆ1andβˆ2 depends on the sign of X¯.If X¯ is positive, then as the formula shows, the covariance will be negative. Thus, if the slope coefﬁcient β2isoverestimated(i.e., the slope is too steep), the intercept coefﬁcient β1 will be underestimated (i.e., the intercept will be too small).

Later on (especially in the chapter on multicollinearity, Chapter 10), we will see the utility of studying the covariances between the estimated regression coefﬁcients.

How do the variances and standard errors of the estimated regression coefﬁcients enable one to judge the reliability of these estimates? This is a problem in statistical inference, and it will be pursued in Chapters 4 and 5.

Dalam dokumen PREFACE - Spada UNS (Halaman 81-84)