Directory UMM :Data Elmu:jurnal:I:Insurance Mathematics And Economics:Vol26.Issue2-3.2000:

(1)

Non-parametric confidence intervals of instantaneous forward rates

Jacques F. Carriere

∗

Department of Mathematical Sciences, University of Alberta, 632 Central Academic Building, Edmonton, Alta., Canada T6G 2G1

Received 17 March 1998; accepted 19 January 2000

Abstract

Using the price of US Treasury Strips, we show how to estimate forward rates with spline models. Confidence intervals on these rates are constructed with bootstrap methods. An unusual feature of the data is the heteroscedastic and correlated error structure. © 2000 Elsevier Science B.V. All rights reserved.

Keywords:Term structure of interest rates; Instantaneous forward rates; Strip bonds; Bootstrapping; Splines; Correlation; Heteroscedasticity

1. Introduction

Letbmdenote the price of a default-free zero-couponbondat a fixed time that maturesmyears in the future with

a redemption value of $ 1. Next, letfmdenote the instantaneous forward rate for a maturity ofm, thus

fm= −

∂

∂mloge[bm]. (1.1)

In this paper we will focus on the estimation and the construction of confidence intervals for fm. A large

body of research exists on methods of estimation forfm, bmand the yield rate−log_e(bm)/m. Consult Anderson

et al. (1996) for a comprehensive book on this topic. The estimation methods can be grouped into parametric and non-parametric techniques. An example of a good-fitting parametric formula is the formula presented by Svensson (1994). However, our view is that the best-fitting formulas are non-parametric formulas like splines. Some examples of papers that have used splines are: Shea (1985), Vasicek and Fong (1982) and Delbaen and Lorimier (1992). In all these papers special attention was given to the estimation problem but little attention was given to the inference issues. The estimation offmis a relatively easy exercise, however, the construction of confidence intervals and the

characterization of the distribution of the estimator is not trivial because of heteroscedasticity and correlation in the observed values.

We will estimatefmby using price data on Treasury Strips from theWall Street Journal(WSJ). We prefer to use

the prices for US Treasury Strips because the ask and bid prices on a strip is approximately equal to the price function

bm. The WSJ gives the bid and asked prices for “stripped coupon interest”, “stripped Treasury Bond principal”,

∗_Tel.:₊_{1-403-492-3396; fax:}₊_{1-403-492-6826.}

E-mail address:jacques [email protected] (J.F. Carriere)

(2)

and “stripped Treasury Note principal” at various maturities. LetN denote the total number of distinct maturities in the published data at a particular time and letmk >0 fork=1,2, . . . , Ndenote the observed maturities at that

time. Observed maturities were always<30 years. Also, letBk denote the average of the bid and asked prices for

all strips with an observed maturity ofmk, thusBk ≈bmk. It will be convenient to defineB0 = 1 andm0 =0. UsingBk, we can calculate the observed forward rates as follows:

Fk = −

log_e[Bk]−loge[Bk−1] mk−mk−1

(1.2)

fork=1,2, . . . , N. This observed rate corresponds to a maturity of

¯

mk=

mk+mk−1

2 . (1.3)

Fig. 1 is a plot ofFkversusm¯kfor data taken from October 11, 1990 issue of the WSJ. In this case, there were

N =121 observations. Theoretically,fm>0 but note thatFk <0 at two maturities. Also note that the variance

does not look constant as a function of maturity. All the graphs and calculations in this paper were done with GAUSS.

(3)

2. A spline model

where 1(e)is an indicator function that is equal to 1 if the eventeis true and 0 otherwise. This spline is a piecewise polynomial of degreeqwithq−1 continuous derivatives at the interior knots. The parameterq =1,2, . . .is fixed and it controls the smoothness. The parametern=1,2, . . . controls the fit and it is also fixed. In this paper, we will letq =2 andn=12. The unknown parameters of this spline areφ0, . . . , φqandξ1, . . . , ξn−1. Consult Seber

and Wild (1989) for more information about splines in regression analysis.

At this point, it will be convenient to introduce some matrix notation. Letp=q+ndenote the total number of parameters in the spline model and letβββdenote ap×1 parameter vector that is defined as

βββ=[φ0, . . . , φq, ξ1, . . . , ξn−1]′. (2.2)

whereε1, ε2, . . . , εN are random variables that are not necessarily independent and identically distributed. Let

εεε=(ε1, . . . , εN)′, letFFF =(F1, . . . , FN)′and let

These definitions allow us to write our linear model in matrix notation. Specifically,

FFF =XβXβXβ+εεε. (2.6)

In this paper, we will estimateβββby a weighted least squares (WLS) method. LetWWWdenote anN×N diagonal matrix where all the off-diagonal elements are equal to zero. Using our matrix notation, we can write the WLS loss function as

L(βββ)=[FFF −XβXβXβ]′WWW[FFF −XβXβXβ]. (2.7) Note that in this definition,WWW is assumed to be fixed. Usually, we want to minimizeL(βββ)subjected to a linear constraint ofAβAβAβ =µµµ. An example of a desirable linear constraint is 111′XβXβXβ =111′FFF where 111 is a column vector of ones. In this caseAAA=111′XXXandµµµ=111′FFF. The constrained WLS estimator is that value that minimizesL(βββ)subject to the constraint. That value is equal to

ˆ

(4)

where

λλλ=[AAA(XXX′WXWX)WX−1AAA′]−1[µµµ−AAA(XXX′WXWXWX)−1XXX′WFWFWF]. (2.9) Using this estimator, we find that we can estimate the forward rates withfˆm=xxx′(m)βββˆ. We found a very good-fitting

curve by using the constraint 111′XβXβXβ=111′FFF and a weighting matrix equal to the identity matrix. The fixed parameters wereq=2, n=12 andκj =30j/nforj =1,2, . . . , n−1. The residuals are defined as

ek=Fk− ˆfm¯k. (2.10)

These residuals were unusual because they exhibited some heteroscedasticity and correlation.

3. Estimation of the variance structure

Traditionally, statisticians will assume thatε1, ε2, . . . , εN are independent and identically distributed random

variables. However, the residuals in our case do not exhibit that behaviour. We will assume that Var(εk)=σ_g(k)2 . In

this definition,g(k)is a map of{1,2, . . . , N}onto{1,2, . . . , n}, that is defined as

Thus, the variance isσ_j2on the interval(κj−1, κj] and we get an estimable heteroscedastic error structure. It will be

useful to define thestandard deviationmatrixSSS. This is a diagonal matrix where the diagonal elements are equal toσg(k)and all the off-diagonal elements are zero. Now let us describe howSSS can be estimated. The number of

observations in the interval(κj−1, κj] is

be constructed for a fixed weighting matrixWWW, without discussing whatWWW should be. Ideally, we would let

WWW =[SSS SSS]−1. (3.3)

However,SSSis not known. Thus, we recommend thatβββandSSSbe estimated by iterated weighted least squares. This method will generate a sequence of estimators, denoted asβββˆ₁,βββˆ₂,βββˆ₃, . . . andSSSˆ1,SSSˆ2,SSSˆ3, . . .. As a starting value,

we suggest thatSSSˆ1be set to the identity matrix. Letβββˆ1be the estimator in this case. Usingβββˆ1, we can calculate the

residuals and estimateSSSˆ2. In turn,βββˆ2is found by lettingWWW =[SSSˆ2 SSSˆ2]−1. The process is repeated until we have

convergence in successive estimates of the parameters. Table 1 showsσˆj forj =1,2, . . . , n=12. Note that the

(5)

Table 1

Estimates of the standard deviations

Parameter Iteration

1 2 3 4 5

ˆ

σ1 1.00 0.00280 0.00279 0.00280 0.00280

ˆ

σ2 1.00 0.00413 0.00390 0.00385 0.00384

ˆ

σ3 1.00 0.01006 0.01009 0.01009 0.01009

ˆ

σ4 1.00 0.00889 0.00898 0.00900 0.00900

ˆ

σ5 1.00 0.00598 0.00548 0.00544 0.00543

ˆ

σ7 1.00 0.00982 0.00979 0.00978 0.00978

ˆ

σ6 1.00 0.00598 0.00548 0.00544 0.00543

ˆ

σ7 1.00 0.01237 0.01273 0.01285 0.01287

ˆ

σ8 1.00 0.06292 0.06299 0.06301 0.06301

ˆ

σ9 1.00 0.00587 0.00492 0.00482 0.00480

ˆ

σ10 1.00 0.02288 0.02282 0.02281 0.02280

ˆ

σ11 1.00 0.03222 0.03263 0.03275 0.03277

ˆ

σ12 1.00 0.04663 0.04663 0.04664 0.04664

Fig. 2. Spline estimates of the forward rates. This is a plot ofFk,fˆm1¯kand

ˆ

f_m4_¯

(6)

Letfˆ_mi_¯ k =xxx

′₍_m_¯

k)βββˆi denote the estimated forward rates at iterationi=1,2, . . .. We suggest that the iterations

should stop whenever the average relative error (ARE) is less than 0.001, i.e.,

AREi+1=

1

N

X

k=1

ˆ

f_m_¯i+1 k

ˆ

f_mi_¯ k

−1

<0.001, (3.4)

whereN = 121. Note that the ARE of 0.001 could be smaller but the effect on the estimated values would be minimal. In our application we found that ARE2=0.0274,ARE3=0.0047,ARE4=0.0007. Thus, we stopped

after the fourth iteration. Fig. 2 is a plot of the observed ratesFkagainst the estimated rates from the first and fourth

iterations.

Next, letσˆg(k) denote the estimated standard deviations and letSSSˆ denote the associated matrix from the last

iteration. Also, leteee=(e1, e2, . . . , eN)′denote the residuals from the last iteration. Our statistical model suggests

that thestandardizederrors

˜

eee= ˆSSS−1eee (3.5)

should not exhibit any heteroscedasticity. Ife˜k is thekth element ofeee˜, thene˜k =ek/σˆg(k). Fig. 3 is a plot ofe˜k

versusm¯k. Note the lack of heteroscedasticity.

(7)

4. Estimation of the correlation structure

Traditionally, statisticians will assume thatεεε=(ε1, ε2, . . . , εN)′are uncorrelated random variables. As we will

show, the residuals in our case do not exhibit that behaviour. LetRRR = {ρ|i−j|}i,j=1,... ,N denote a positive definite

correlation matrix where the diagonal elements are equal toρ0=1 and the off-diagonal elements are in the interval (−1,1), i.e.,−1< ρ|i−j| <1. We assume that there exists a lag, denoted asl, such thatρ|i−j|=0 if|i−j|> l+1.

Thus,RRRis a banded matrix with elements in the left-lower and right-upper corners equal to zero. In this definition, the correlation structure corresponds to a moving-average process with a lag ofl. We will assume that

Var(εεε)=SRSSRSSRS. (4.1)

Next, defineεεε˜=SSS−1εεε. Thus Var(εεε)˜ =RRR. Next, letRRRchbe the lower triangular matrix from a Choleski decomposition

ofRRR, thusRRR =RRRchRRR′_ch. It is instructive to note that Var(RRR−_ch1εεε)˜ , is simply equal to the identity matrix.

Now let us describe howRRRcan be estimated. Leteee˜=(e˜1, . . . ,e˜N)′denote the standardized residuals from our

last iteration. Then, a standard non-parametric estimator ofρuforu≥1 is

ˆ

Under the null hypothesis, the test statistic

T =

is approximately normal with a mean of zero and a variance of one. With a 99% level of significance, we would reject the null hypothesis if|T| >2.57. Table 2 is a summary ofρˆu andT foru =1,2, . . . ,10 andN =121.

Note that we cannot reject the null hypothesis of no correlation for lags of two or more. However, we can reject the hypothesis thatρ1=0. Therefore, we will assume thatl=1 throughout the rest of the paper.

Usingρˆuwe can construct an estimator ofRRR, denoted asRRRˆ. A major weakness of the moment-type estimatorRRRˆ

is thatRRRˆ is not positive definite in certain cases. Positive definiteness is a necessary property for doing a Choleski decomposition. Let us describe how to construct a positive definite estimator ofRRR. A necessary and sufficient condition forRRRto be a positive definite matrix is that the smallest eigenvalue ofRRR(denoted asλ1) must be positive.

Therefore, we suggest thatRRRˆ be equal to the matrix that minimizes

L(RRR)=log_e[det(RRR)]+ ˜eee′ RRR−1 eee,˜ (4.4)

subject to the constraint that λ1 > 0. To findRRRˆ we need an iterative optimization program. We found that the

moment-type estimator forRRR is a good starting value for the iterations. Using this method we found thatρˆ1 =

(8)

whereeeeare the residuals from the last iteration. The residualseeeˆwill be useful for constructing confidence intervals by the bootstrap method.

5. Confidence intervals by bootstrapping

Up to now, our focus has been on the estimation of the spline model without touching on the inference issues. In this section, we will show how to do inferences, like constructing confidence intervals, with the bootstrap method. For a detailed discussion of the bootstrap, the reader can consult the classical work of Efron (1982).

Letεεεˆ =RRR_ch−1SSS−1εεεand letεˆk denote thekth element ofεεεˆ. We will assume thatεˆ1,εˆ2, . . . ,εˆN are independent

and identically distributed random variables with a common cumulative distribution function (CDF) of H (x). Usually, statisticians will assume thatH (x)has a standard normal distribution. In our case, this assumption is not unreasonable. Using the standardized and uncorrelated residual errors ofeˆkfrom the spline regression, we grouped

the data and conducted a chi-squared goodness-of-fit test with 13 degrees of freedom. The test statistic was 14.62, therefore we cannot reject the null hypothesis thatH (x)is a standard normal CDF at a 95% level of significance because 14.62 is<22.36, which is the 95th percentile of a chi-squared distribution with 13 degrees of freedom.

Fig. 4 is a plot of a standard normal density and a kernel density estimate based on the residual errors eˆk.

This density estimate used a normal kernel with an optimal smoothing parameter. Note how close the kernel

(9)

density estimate is to the normal density. See Simonoff (1996) for more information about kernel density estimates.

Next, let us construct an empirical CDF ofH (x)by using the standardized and uncorrelated residual errors ofeˆk

from the spline regression. The empirical CDF is

ˆ

The traditional bootstrap method requires that we re-sample fromH (x)ˆ . However, an alternate method is too simply re-sample from a standard normal distribution. This alternate method is valid because we could not reject the null hypothesis of normality. Let us proceed in the traditional manner. Thus,eˆ_l,kboot fork = 1,2, . . . , N and

l = 1,2, . . . , B areN ×B independent and identically distributed random variables with a common CDF of

ˆ

H (x). The valueBis the number ofbootstrap simulationsand it should be made as large as possible. Leteeeˆboot_l =

(eˆ_l,boot₁ , . . . ,eˆboot_l,N)′. Using this simulated data, we calculate the heteroscedastic and correlated errors

eeeboot_l = ˆSSSRRRˆcheeeˆbootl . (5.2)

Next, we calculate

FFFboot_l =XXXβββˆ+eeeboot_l , (5.3)

whereβββˆ is the estimate from the last iteration.

Using the simulated data, we apply the same estimation formulas as before and derive the bootstrap estimates

ˆ

βββboot_l , SSSˆboot_l , andRRRˆboot_l forl=1,2, . . . , B. Specifically, we use estimates from the fourth iteration of the iterated WLS, while the estimates ofSandRare based on the residuals from the last iteration and the estimate ofRis found by minimizing Eq. (4.4). Next, letg(βββ, SSS, RRR)be any real-valued function of the parametersβββ,SSS andRRR. Also, let approximate 97.5th percentile is a valueξ0.975such that

ξ₀.975=inf

Applying the bootstrap method withB =2000 yielded confidence intervals for all our parameters. First, a 95% confidence interval for the correlationρ1was [−0.3588,−0.4501] which includes our estimate ofρˆ1= −0.399.

Next, a 95% confidence interval ofσ12was [0.0182,0.0762] and again our estimate ofσˆ12 =0.04664 lies in this

(10)

Fig. 5. Confidence intervals for the forward rates.

References

Anderson, N., Breedon, F., Deacon, M., Derry, A., Murphy, G., 1996. Estimating and Interpreting the Yield Curve. Wiley, New York. Delbaen, F., Lorimier, S., 1992. Estimation of the yield curve and the forward rate curve starting from a finite number of observations. Insurance:

Mathematics and Economics 11, 259–269.

Efron, B., 1982. The Jacknife, the Bootstrap and Other Resampling Plans. Society for Industrial and Applied Mathematics, Philadelphia, PA. Seber, G.A.F., Wild, C.J., 1989. Nonlinear Regression. Wiley, New York.

Shea, G.S., 1985. Interest term structure estimation with exponential splines: a note. The Journal of Finance 1, 319–325. Simonoff, J.S., 1996. Smoothing Methods in Statistics, Springer, New York.

Svensson, L., 1994. Estimating and interpreting forward interest rates: Sweden 1992–1994. International Monetary Fund Working Paper No. 114.