ARCH Models - Heteroskedasticity (ARCH) - Uwe Hassler An Elementary Introduction with Applicati

Heteroskedasticity (ARCH)

6.3 ARCH Models

130 6 Processes with Autoregressive Conditional Heteroskedasticity (ARCH) Proposition6.1. With daily observations (with five trading days a week) one chooses e.g.BD20which approximately corresponds to a time window of a month. A first improvement of the time-dependent volatility measurement is obtained by using a weighted average where the weights,gi0, are not negative:

s²_t D XB

iD1

g_ix²_{t i} with XB

iD1

g_iD1: (6.3)

Example 6.1 (Exponential Smoothing) For the weighting functiongione often uses an exponential decay:

giD ⁱ ¹

1CC: : :C^B ¹ with 0 < < 1:

Note that the denominator is just defined such that it holds thatPB

iD1giD 1. With growingBone furthermore obtains

1CC: : :C^B ¹D 1 ^B

1 ! 1

1 :

Inserting the exponentially decaying weights in (6.3), we get the following result forB! 1:

s²_t./D.1 / X1 iD1

ⁱ ¹x²_{t i}:

Now it is an easy exercise to verify the following recursive relation:

s²_t./D.1 /x²_t ₁Cs²_t ₁./: (6.4) We will calls²_t./the exponentially smoothed volatility or variance. In order to be able to calculate it fortD2; : : : ;n, we need a starting value. Typically,s²₁./Dx²₁ is chosen which leads tos²₂./Dx²₁.

The ARCH and GARCH processes which are subsequently introduced are models leading to volatility specifications which generalizes²_t ands²_t./from (6.3) and (6.4), respectively.

where it is assumed that

˛0> 0 and ˛i0 ; iD1; : : : ;q; (6.6) in order to guarantee_t² > 0. Note that this variance function corresponds tos²_t from (6.3). For ˛1 D D ˛q D 0, however, the case of homoskedasticity is modeled.

Conditional Moments

Givenx_t ₁; : : : ;x_{t q}, one naturally obtains zero for the conditional expectation as ARCH processes are martingale differences, see Proposition6.1. For the conditional variance it holds that

Var

xtjxt 1; : : : ;xt q

x²_tjxt 1; : : : ;xt q

D_t²E

"²_t jxt 1; ;xt q

D_t²;

as"tis again independent ofxt jand has a unit variance. Hence, for the variance it conditionally holds that:

Var.xtjxt 1; : : : ;xt q/D˛0C˛1x²_t ₁C: : :C˛qx²_{t q};

which explains the name of the models: The conditional variance is modeled autoregressively (where “autoregressive” means in this case: dependent on the past of the process). Thus, extreme amplitudes in the previous period are followed by high volatility in the present period resulting in so-called volatility clusters.

If the assumption of normality of the innovations is added, then the conditional distribution ofxtgiven the past is normal as well:

x_tjx_t ₁; : : :x_{t q} N

0; ˛0C˛1x²_t ₁C: : :C˛qx²_{t q} :

But, although the original work by Engle (1982) assumed "t iiN.0; 1/, the assumption of normality is not crucial for ARCH effects.

Stationarity

By Proposition6.1it holds for the variance that

Var.xt/DE._t²/D˛0C˛1E.x²_t ₁/C C˛qE.x²_{t q}/ :

If the process is stationary, Var.xt/ D Var.xt j/,j D 1; : : : ;q, then its variance results as

Var.xt/D ˛0

1 ˛₁ : : : ˛q

132 6 Processes with Autoregressive Conditional Heteroskedasticity (ARCH)

For a positive variance expression, this requires necessarily (due to˛₀ > 0):

1 ˛₁ : : : ˛q> 0 :

In fact, this condition is sufficient for stationarity as well. In Problem 6.1 we therefore show the following result.

Proposition 6.2 (Stationary ARCH)

Let fx_tg be from (6.1) and f_t²g from (6.5) with (6.6). The process is weakly stationary if and only if it holds that

Xq jD1

˛j< 1 :

Correlation of the Squares

We defineetDx²_t _t². Due to Proposition6.1the expected value is zero: E.et/D0.

Addingx²_t to both sides of (6.5), one immediately obtains x²_t D˛₀C˛₁x²_t ₁C: : :C˛qx²_{t q}Cet:

From this we learn that an ARCH(q) process implies an autoregressive structure for the squaresfx²_tg. The serial dependence of an ARCH process originates from the squares of the process. Because of˛i 0,x²_t andx²_{t i}are positively correlated which again allows to capture volatility clusters.

In Figs.6.1and6.2ARCH(1) time series of the length 500 are simulated. For this purpose pseudo-random numbersf"tgare generated as normally distributed and

˛₀ D 1 is chosen. The effect of ˛₁ is now quite obvious. The larger the value of this parameter, the more obvious are the volatility clusters. Long periods with little movement are followed by shorter periods of vehement, extreme amplitudes which can be negative as well as positive. These volatility clusters become even more obvious in the respective lower panel of the figures, in which the squared observations fx²_tg are depicted. Because of the positive autocorrelation of the squares, small amplitudes tend again to be followed by small ones while extreme observations appear to follow each other.

Skewness and Kurtosis

In the first section of the chapter on basic concepts from probability theory we have defined the kurtosis by means of the fourth moment of a random variable and we have denoted the corresponding coefficient by₂. For₂ > 3the density function is more “peaked” than the one of the normal distribution: On the one hand the

0 100 200 300 400 500

−6−224

observations (alpha1 = 0.5)

0 100 200 300 400 500

0102030

squared observations Fig. 6.1 ARCH(1) with˛0D1and˛1D0:5

0 100 200 300 400 500

−1001020

observations (alpha1 = 0.9)

0 100 200 300 400 500

0100250

squared observations Fig. 6.2 ARCH(1) with˛0D1and˛1D0:9

values are more concentrated around the expected value, on the other hand there occur extreme observations in the tail of the distribution with higher probability

134 6 Processes with Autoregressive Conditional Heteroskedasticity (ARCH) (“fat-tailed and highly peaked”). For stationary ARCH processes with Gaussian innovations ("tiiN.0; 1/) it holds that the kurtosis exceeds 3 (provided it exists at all):

₂> 3 :

The corresponding derivation can be found in Problem 6.2. Due to this excess kurtosis, ARCH is generally incompatible with the assumption of anunconditional Gaussian distribution.

We define the skewness coefficient 1 similarly to the kurtosis by the third moment of a standardized random variable. The skewness coefficient of ARCH models depends on the symmetry of "t. If this innovation is symmetric, then it follows that E."³_t/D0. Hence,1D0follows for the corresponding ARCH process (due to independence oftand"t):

E.x³_t/DE._t³/E."³_t/ DE._t³/0D0:

Thereby it was only used that"tis symmetrically distributed.

Example 6.2 (ARCH(1)) In particular for a stationary ARCH(1) process with

˛₁² < ¹₃ and Gaussian innovations"t the kurtosis is finite and it results as (see Problem6.3):

2D3 1 ˛²₁ 1 3 ˛²₁ > 3 :

For˛₁² ¹3 there occur extreme observations with a high probability such that the kurtosis is no longer constant. Consider a stationary ARCH(1) process (˛1< 1) with

˛₁²D1=3. Under this condition one has for4;t DE.x⁴_t/with E.x²_t ₁/D˛0=.1 ˛1/ assuming Gaussianity:

4;t DE."⁴_t/E._t⁴/D3E

.˛0C˛1x²_t ₁/² D3

˛²₀C2 ˛²₀˛1

1 ˛₁ C˛₁²4;t 1

DcC3 ˛₁²4;t 1DcC4;t 1; where the constantcis appropriately defined. Continued substitution yields thus

4;t Dc tC4;0:

Hence, we observe that the kurtosis grows linearly over time if3 ˛²₁D1.

Dalam dokumen Uwe Hassler An Elementary Introduction with Applications (Halaman 145-150)