M01437

(1)

Parameter Estimation in LNSV Models:

Griddy Gibbs versus Metropolis-Hastings

Abstract

This paper compares performances of two MCMC samplers, Griddy-Gibbs (GG) and Metropolis-Hastings (MH) samplers, to estimate parameters and latent variables in the four log-normal stochastic volatility (LNSV) models: (1) LNSVn (standard), (2) LNSVt (with fat-tails), (3) LNSVL (with correlated errors), (4) LNSVLt (with fat-tails and correlated errors). To illustrate the comparison of two samplers, we apply the mod-els and samplers to both simulated and real data sets. The stock indices we examine are TOPIX and three stocks of the TOPIX Core 30: Hitachi Ltd., Nissan Motor Co. Ltd. and Panasonic Corp.

Keywords: Log-normal stochastic volatility, Bayesian inference, single-move Markov Chain Monte Carlo, Griddy Gibbs, Metropolis-Hastings, TOPIX.

1 Introduction

Amongst modeling of financial time series, the stochastic volatility (SV) model is re-cognized as one of the most important class as it has the ability to capture the commonly observed change in variance of the observed stock index or exchange rate over time. A popular and most widely used SV model is the log-normal (LN) SV model, which was first introduced by Taylor (1982). In his discrete time model, volatility process is modeled as a first-order autoregression for the log-squared volatility. The LNSV model provides a more realistic and adequate than the ARCH-type models (see, for example, Ghysels et al. (1996)) and the GARCH-type models (see, for example, Kim et al. (1998), Yu (2002) and Carnero et al. (2001)).

Unfortunately, it is not possible to take random draws directly from the posterior for some unknown parameters in SV models. An approach has become very attractive is the Bayesian approach, which was first proposed by Shephard (1993) and Jacquier et al. (1994). Inference in this approach often requires advanced Bayesian computation, and here we focus on Markov Chain Monte Carlo (MCMC) procedure to perform posterior sampling and statistical inference. MCMC permits to obtain the conditional posterior distributions of the parameters by simulation rather than analytical methods. To summarize posterior

∗_The _email _addresses _of _the _authors _are, _{respectively,} _{bke76145@kwansei.ac.jp} _and

morimot@kwansei.ac.jp. The first author is a PhD candidate. The first author also wishes to thank Shuichi Nagata for some Matlab codes and helpful discussions.

2

Didit B. Nugroho1,2 and Takayuki Morimoto

Department of Mathematics, Satya Wacana Christian University Department of Mathematical Sciences, Kwansei Gakuin University

June 25, 2012

2

(2)

distributions, it is necessary to have algorithms that are well-suited to updating random samples (e.g. Gibbs sampling, Metropolis-Hastings and Griddy-Gibbs sampler).

Comparison of Metropolis-Hastings (MH) algorithm and Griddy-Gibbs (GG) sampler has already been discussed in Bauwens and Lubrano (1998) for an asymmetric StudentGARCH model. They show that GG sampler is feasible and competitive compared with MH sampler. In this paper we compare the performance between the use of the GG sampler and the MH sampler to estimate four LNSV models fitted to both simulated and real data sets. The real data we analysis includes the high frequency data sets for computing realized volatility (RV) and daily returns of TOPIX and three stocks of the TOPIX Core 30, which are Hitachi Ltd., Nissan Motor Co. Ltd. and Panasonic Corp., from January 5, 2004 until December 30, 2011. RV is used as a proxy to evaluate the daily estimated volatility performances. It was showed by Anderson et al. (2003) that the daily RV obtained from the intra-day high frequency data set is an consistent and robust estimator of the ”true” volatility.

The rest of this paper is organized as follows. The model specification is discussed in Section 2. Section 3 presents the description of the Bayesian MCMC method, conditional posteriors and sampling algorithms. In Section 4, we apply the model and the samplers to the daily returns on three stocks to obtain volatilities and values of model parameters. Finally, Section 5 gives some concluding remarks.

2 MCMC Methods

The Gibbs sampling algorithm (Geman and Geman. (1984)) uses a sequence of draws from conditional distributions to characterize the joint target distribution. Suppose that our parameter vectorθhas two components, making our target distributionf(θ1, θ2|y). To use the Gibbs sampler, one begins by choosing starting values θ₂(0) (starting values are usually chosen near the posterior mode or the maximum likelihood estimates). One then repeats, fori= 1, ..., N iterations (making sure to store the sequence of draws at each iteration):

Step 1: Sample θ₁(i+1) frompθ1|θ(2i), y

.

Step 2: Sample θ₂(i+1) frompθ2|θ(₁i+1), y.

Metropolis-Hastings algorithm is first introduced by Metropolis et al. (1953) and general-ized by Hastings (1970). The algorithm has many applications beyond Bayesian statistics; it is commonly used for all sorts of numerical integration and optimization. (It is also the case that the Gibbs sampling algorithm is a special case of the Metropolis-Hastings algorithm.) To simulate from our target distribution p(θ|y), we again start with sensible starting values: θ(0). For each iteration of the simulationi= 1, ..., N, we draw a proposalθ∗ from a known proposal distributionp θ∗_|θ(i)_{. Unlike the Gibbs sampling algorithm, when} each move is automatically accepted, one accepts the proposal distribution probabilistically, sometimes moving to a value with a higher density value, sometimes moving to one with a lower density value.

Unlike the MH sampler, in the GG sampler we do not need to find an efficient proposal distribution.

(3)

3 Accuracy

Without being able to plot the target distribution, which in most applications is of high dimension, it would be difficult, if not impossible, to assess whether the chain is sampling from the target distribution. An important part of any Bayesian analysis is assessing the convergence of the simulation results. Indeed, no Monte Carlo estimates of posterior density summaries can be trusted unless the chain has reached its steady state.

4 LNSV Model with Fat-tails and Correlated Errors

We consider the lognormal stochastic volatility (LNSV) model given by

Rt = e

1

2htω_t, ω_t=λ−

1 2

t ǫt, ǫtiid∼ N(0,1), t= 1, ..., T, (1)

ht+1 = α+φ(ht−α) +τ ηt+1, ηt+1iid∼ N(0,1), t= 1, ..., T−1, (2) λt ∼ p(λt|ν)

corr(ǫt, ηt+1) = ρ

where ht = lnσt2, for the unobservable volatility σt of asset on day t, Rt is the asset

return on day t from which the mean and autocorrelations are removed, 0 _≤ φ < 1, h1 ∼ N α, τ2/ 1−φ2, and N(·,·) represents the normal distribution. In this model, ν captures the fat-tailness and ρcaptures the leverage effect, which refers to the asymmet-ric behavior that return movements are negatively correlated with volatility. As pointed out in Yu (2005), specification ofcorr(ǫt, ηt+1) =ρ implies that the above model is a mar-tingale difference sequence and clear to interpret the leverage effect instead of assuming corr(ǫt, ηt) =ρ in Jacquier et al. (2004).

The above model has a parameter vector (ν, α, φ, τ2_{, ρ}_{) and two latent stochastic} pro-cesses denoted, respectively, by h = (h1, h2, ..., hT) and λ = (λ1, λ2, ..., λT), which are to

be estimated simultaneously using simulation methods which will be discussed in the next section. Here, the value of φ measures the autocorrelation present in the logged squared volatility. Thusφ can be interpreted as the persistence in the volatility, the constant scal-ing factor exp 1₂α as the modal instantaneous volatility, and τ is the volatility of the log-volatility (cf. Kim et al. (1998)). It is common to assume that 0_≤φ <1 because the volatility is positively autocorrelated in most financial time series. Next,λtis a scale factor,

p(λt|ν) is the mixing density. We assume that λt is distributed i.i.d gamma, or that νλt

follows a χ2 _{distribution with 3} _≤ _ν _≤ _{40 degrees of freedom, as discussed by Watanabe} and Asai (2001) and Jacquier et al. (2004). This assumption implies that the marginal distribution of λ−

1 2

t ǫt is Student-t withν degrees of freedom.

As in the previous literature (see, for example Zhang and King (2003, 2008), Jacquier et al. (2004) and Yu (2005)), it is convenient to write

ηt+1=ρǫt+ p

1₋ρ2_w

t+1, (3) fort= 1,2, ..., T−1, wherewt+1 is iidN(0,1) and corr(ǫt, wt+1) = 0. Transformation (3) shows that var(ηt+1) = 1 and corr(ηt+1, ǫt) = ρ which satisfies the above LNSV model

specification. Substituting (3) into (2), we obtain

ht+1=α+φ(ht−α) +ρτ e−

1 2htλ

1 2

tRt+ p

1₋ρ2_{τ w}

t+1.

It is evident that a unit increase in stock return at time tresults in aρτ σ_t−1λ

1 2

t unit change

(4)

a unit fall in stock return at this period will result in exp

ρτ σ_t−1λ

1 2

t

unit increase in the

variance at timet+ 1.1

Following Jacquier et al. (2004), and has been applied more recently by Yu (2005) and Zhang and King (2003, 2008), we re-parameterizedρandτ toϕ=ρτ andψ2 = 1−ρ2τ2, respectively, and obtain

Rt = e

1 2htλ−

1 2

t ǫt, ǫtiid∼ N(0,1), t= 1, ..., T, (4)

ht+1 = α+φ(ht−α) +ϕe−

1 2htλ

1 2

tRt+ψwt+1, wt+1iid∼ N(0,1), t= 1, ..., T−1,(5) where

h1∼ N α, ψ2/ 1−φ2 and ht|ht−1 ∼ N

α+φ(ht−1−α) +ϕe−

1 2ht−1_λ

1 2

t−1Rt−1, ψ2

(6) for t = 2, ..., T. We now have a new parameter vector denoted by θ _{= (}_{ν, α, φ, ϕ, ψ}2_).

Because ǫt and wt+1 are uncorrelated, we can now easily obtain the joint posterior ofh,λ and θ _{conditional on observations} _{R, where}_R_{= (}_R₁_{, R}₂_{, ..., R}_T_).

5 Prior Specification

Following standard practice, we assume that our model is completed by priors for the unknown structural parameters as follows:

λt∼ G(1₂ν,1₂ν), ν ∼ G(aν, bν), (7)

φ_{∼ B}(A, B), ψ2_{∼ IG}(a, b), α_|ψ2 _{∼ N} α0, ψ2/p0, ϕ|ψ2 ∼ N ϕ0, ψ2/q0, (8)

whereB(·,·),IG(·,·) andG(·,·) represent the beta, inverse gamma and gamma distributions, respectively. For parameter ν, the following alternative priors were used in the previous literature: exponential by Geweke (1993), Watanabe and Asai (2001), uniform discrete on the interval [3,40] by Jacquier et al. (2004), Gaussian by Zhang and King (2008) and gamma on the interval (2,40] by Abanto-Valle et al. (2010). These priors are chosen to prevent ν from getting too large during MCMC iterations, because the update of ν has a negligible effect when ν is already large.

6 Bayesian Approach

In order to implement Gibbs sampling, all of the fully conditional posterior distributions (one for each component ofθ) need to be derived from the joint posterior distribution. The conditional posterior distribution is derived from the joint posterior distribution by picking out the parts that involve the unknown parameter in question.

Denote θ₋_ν _{be the parameter vector after removing} _ν_{. By using Bayes’ theorem, the}

joint posterior distribution of parameters and latent unobservable variables conditional on the returns is

p(λ_,_h_,θ_|_{R) =}_p_(R_|_h_,λ₎_×_p₍λ_|_ν₎_×_p_(h_|θ₋_ν₎_×_p₍θ₎_, ₍₉₎

1

(5)

where

which are obtained from the likelihood function of R, the conditional and unconditional functions of h and the prior distributions in (7) and (8). A Gibbs sampler approach then estimates the parameters and latent variables by drawing random samples from the following conditional posterior distributions:

into the lower bound. In this case, the conditional posterior forλ_{is obtained from the joint}

(6)

The conditional posterior forλ _is

shape and inverse scale parameters being defined, respectively, by

Aλt =

The above conditional posterior is only standard for t=T, and hence we sample directly λ1 from Gamma distribution and sample λt, for t = 1, ..., T −1, by using Griddy Gibbs

and Metropolis-Hastings. Because the fG distribution does not depend on λt, using the

Independence Chain (IC) MH sampling method introduced by Tierney (1994), we sample a proposal λ∗t ∼ G(Aλt, Bλt) and accept it with probability min{1, g1(λ

∗

t)/g1(λt)}.

The probability density function (PDF) of the standardized student-t distribution with degrees of freedomν >2 is given by

f(ωt) =

So the conditional posterior forν is

p(ν|h,R) =p(ν)f(ωt)∝νaν−1e−bνν

A problem arises here in that the above posterior is not standard, and hence we sample ν using Griddy Gibbs sampler and Metropolis-Hastings algorithm.

To sampleνby Hastings algorithm, we use the Independence Chain Metropolis-Hastings (IC-MH) sampling method, in which the proposal forν isν∗ _{∼ N}_[3,40] µν, s2ν

and the acceptance probability is min_{1, p(ν∗)/p(ν)_}. The meanµν and variances2ν are

(7)

In this following, we write the above log posterior asℓ(ν) since after the data are observed,ν is only random quantity. We find the the posterior mode ˆν ofℓ(ν), meaning that ℓ′_(ˆ_ν_{) = 0,} by numerical optimization, and then we take µν = ˆν and sν2 = −1/ℓ′′(ˆν). A very simple

and robust method for finding the mode ˆν is provided by bisection method,

ˆ

To see this, we use the result of Abramowitz and Stegun (1972) (see Eq. 6.1.38), in which the ln Γ(ν/2) is approximated by

ln Γ(x) = ln(2π)

Thus, withaν >1, the first term is always negative, and the second term is always positive.

Therefore, following Watanabe and Asai (2001) we set s2

ν =−1/Dν(ˆν) where

Dν(ˆν) = min

ℓ′′(ˆν),₋0.0001 .

We should note that ℓ′′(ˆν) is always negative in our empirical analysis.

Given the conditional posterior for w, we propose the following Griddy Gibbs sampler procedure:

1. Denote the equally spaced grid of values for w, say,w1≤w2 ≤ · · · ≤wm.

2. Compute the value ofw’s posterior at each of the grid points and denote the resultant vector by p(w) = (p(w1), . . . , p(wm)).

3. Normalize p(w) and denote the resultant vector by p∗(w) = (p∗(w1), . . . , p∗(wm)).

4. Compute the empirical cumulative distribution function ofw, Φ(w).

5. Generateu_∼U(0,Φ(wm)) and invert Φ(w) by linear interpolation to obtain a random

sample ofw.

7.2 Sampling α

For sampling α based on the joint posterior (9), we found that the logarithm of the conditional posterior forα is

(8)

wherezt=ht−φht−1−ϕe−

Based on the joint posterior (9), we found that the logarithm of the conditional posterior forϕis

(9)

and

ψ2 _{are sampled, respectively, from their conditional posteriors, we can calculate} _τ2 _and _ρ through τ2 ₌_ϕ2₊_ψ2 _and _ρ₌ ϕ

τ.

7.5 Sampling φ

Based on the joint posterior (9), the log of the conditional posterior for φis given by

lnp(φ_|h,λ_,θ₋_ν_,_R) _∝ _ln_p₍_φ_{) +} ln 1−φ

Hence the conditional posterior forφcan be expressed as

p φ|h, α, ϕ, ψ2∝fN φ|µφ, s2φ mean µ and variance s2_{. The above conditional posterior is not standard, and hence we} draw φby Griddy Gibbs sampler and Metropolis-Hastings algorithm.

We note that a fN

φ_|µφ, s2φ

distribution does not depend on φ. Therefore, using the

IC-MH sampling method, we sample a proposal φ∗ _{∼ N}_[0,1)

Denoteh−tbe the latent stochastic vector after removinght. Based on the joint posterior

(10)

where wt = ht−Rtλ

1 2

t, for t = 1, ..., T −1. In order to sample h, we use the Griddy

Gibbs sampler and the random walk Metropolis-Hastings algorithm, in which the proposal density is the standar normal and the acceptance probability is computed through (12). In this case, the proposal value h∗t at (i+ 1)th iteration is generated according to the process

h∗_t =h(_ti)+ǫ(_ti+1), whereǫ(_ti+1)_{∼ N}0, ψ2(i+1).

7.7 Single Move MCMC

Since the conditional posteriors of ν,φ and h do not have closed form, we first sample the latent variableλ_{and parameters}_α_,_ϕ_and_ψ2_{. The Gibbs sampling scheme is described}

as follows. Choose arbitrary starting values θ(0)_,λ(0) _and_h(0)_{, and let}_i_{= 0.}

1. Sampleλ(i+1) _from_p λ_|_ν(i)_,_h(i)_,_R_.

2. Sampleν(i+1) _from _{p ν}_|_h(i)_,_R_.

3. Sampleα(i+1) _from _p_α_|_λ(i+1)_,_h(i)_{, φ}(i)_{, ϕ}(i)_{, ψ}2(i)_,_R_.

4. Sampleϕ(i+1) _from_p_ϕ_|_λ(i+1)_{, α}(i+1)_,_h(i)_{, φ}(i)_{, ψ}2(i)_,_R_.

5. Sample ψ2(i+1) _from _p_ψ2_|_λ(i+1)_{, α}(i+1)_{, ϕ}(i+1)_,_h(i)_{, φ}(i)_,_R_.

6. Sampleφ(i+1) frompφ|λ(i+1)_{, α}(i+1)_{, ϕ}(i+1)_{, ψ}2(i+1)_,_h(i)_,_R_.

7. Sampleh(i+1) from ph_|λ(i+1)_,θ(₋i+1)_ν _,_R_.

8. Seti=i+ 1 and go to step 1 until convergence is achieved.

8 Comparison of MCMC Methods

8.1 Simulated Data

8.2 Stock Market Data

8.2.1 Data Description

In this section we apply the methods and LNSV models used previously to the series of daily closing prices {St} of the TOPIX and three stocks of the TOPIX Core 30, which

are Hitachi Ltd. (HIT), Nissan Motor Co. Ltd. (NIS) and Panasonic Corp. (PAN), from January 5, 2004 until December 30, 2011, for a total of 1960 observations. The series of return are daily percentage mean-corrected returns,Rt, given by the transformation

Rt= 100× ln

St

St−1 − 1 T

T X

i=1 ln Si

Si−1 !

(11)

Figure 1: Time series plots for percentage daily mean-corrected returns of TOPIX, HIT (un-smooth), NIS and PAN from January 2004 until December 2011.

Table 1 reports summary statistics for the returns data, and Figure 1 displays the series of returns data. As usual, the kurtosis of the returns is significantly above three, indi-cating leptokurtic return distributions. The Ljung-Box (LB) test statistics indicate that the returns for TOPIX, NIS and PAN stock price indices are serially uncorrelated, while the returns for HIT index is serially correlated at the 5% level. One simple way to adjust these autocorrelated returns is to unsmooth the return series such that the adjusted returns display no serial correlation. The procedure is given as follows (see De Souza and Gokcan (2004)):

R∗_t = Rt−ρkRt−k 1₋ρk

where ρk is the kth-order autocorrelation of the autocorrelated return seriesRt, andRt−k

is the k-period lagged return.

Table 1: Descriptive statistics of the daily returns for TOPIX and TOPIX Core 30.

Statistics TOPIX HIT NIS PAN

Raw Data Raw data Unsmooth Raw data Raw data

Sample size 1961 1961 1959 1961 1961

Mean 0.000 0.000 −0.001 0.000 0.000 Standard deviation 1.479 2.195 2.065 2.360 2.113 Kurtosis 11.245 11.390 11.178 8.669 8.353 LB(8) 9.026 18.227 10.295 15.136 12.682 p-value LB(8) 34.00% 1.96% 24.49% 5.65% 12.33%

Autocorrelation no yes no no no

NOTE: The lag lengths= 8 for the LB(s) statistic is selected based on the choice ofs≈ln(1961) (see Tsay (2010)).

10

HIT

-15

-20..._ _ __. __ _._ _ ___. __ _._ _ __. __ ...._ _ _._ __ ..__, 2004 2005 2006 2007 2008 2009 2010 2011 2012 2004 2005 2006 2007 2008 2009 2010 2011 2012

NIS

15

10

-10

PAN

10

-15 -15..._--~--'---'--~----'---~---'---.L....I

(12)

8.2.2 Estimation Results

In our analysis, we compare two proposed MCMC methods for the following four candi-date LNSV models:

1. Model SVn: the LNSV model without both fat-tails and correlated errors. The error in return is assumed to follow normal distribution (λt = 1 for all t), and there is no

correlation between errors in return and volatility (ρ= 0).

2. Model SVt: the LNSV model with fat-tails. The error in return is assumed to Student-t distribution with unknown degrees of freedom, and there is no correlation between errors in return and volatility.

3. Model SVL: the LNSV model with correlated errors. The error in return is assumed to follow normal distribution, and there is a correlation between errors in return and volatility (ρ6= 0).

4. Model SVLt: the LNSV model with fat-tails and correlated errors. The error in return is assumed to Student-tdistribution with unknown degrees of freedom, and there is a correlation between errors in return and volatility.

The hyperparameters in (7) and (8) required in the joint prior distribution were set to aν = 12,bν = 0.8,A= 30,B = 1.5,a= 1,b= 0.005,α0 = 0,p0 = 0.2,ϕ0= 0 andq0= 0.5. For initial parameter values and log-volatilities, respectively, we setθ(0) _{= (20}_,₁_,₀_.₉_,₀_.₁_,₀_.₁₎ and h(0)_t = 2 lnRV. In particular, Griddy Gibbs samples of ν, φ and ht are, respectively,

drawn using 100, 200 and 400 equally spaced points. The possible range of ht for the ith

Gibbs iteration is [h(_ti−1) ₋1, h(_ti−1) + 1]. In all cases, we simulated the ht’s in a

single-move fashion. All the calculations were performed running stand alone code developed by the authors using MATLAB version 7.8.0(R2009a) on a AMD Phenom(tm) II X6 1035T Processor 2.60GHz.

For all the models and methods, we ran the MCMC simulation for 15000 iteration but discarded the first 5000 draws. With the resulting N = 10000 values, we calculated the posterior means, standard deviations (SD), the 95% credible interval, the Monte Carlo standard error (MCSE) and the simulation inefficiency factors (SIF). Table 2, 4, 6 and 8 summarize these results. According to both MCSE and SIF, the two proposed sampling algorithms have been mixing quite good. The 95% credible intervals are calculated using a highest posterior density (HPD) proposed by Chen and Shao (1999). The MCSE is estimated by σ_bf/

√

N (see Roberts (1996)), where _bσ_f2 id defined as the variance of the posterior mean from correlated draws. Here, the number of batches was 50 and there were 200 draws in each batch. The SIF can be interpreted as the number of successive iterations needed to obtain near independent samples and is calculated by _bσ_f2/_eσ_f2, where _eσf is the

standard deviation of the posterior mean.

(13)

of shock to volatility, HL = ln(0.5)/ln( ˆφ) (see Hall (2005)). Third, the posterior means for the conditional variance of volatility, ˆτ2_, _{are smaller by the MH sampler} _than those by the GG sampler for all cases, indicating that MH sampler exhibits a lower vari-ability of volatility. The lowest posterior mean is for SVLt model in MH sampler except for TOPIX, and the highest one is for SVL model in GG sampler. Fourth, the posterior means of the degrees of freedom, ν, in SV models with fat-tails are between 13 (for the Hitachi stock in the SVLt model by MH sampler) and 24 (for the TOPIX in the SVt by GG sampler), indicating that the normal conditional distribution would be strongly rejected by the data. In all cases, GG sampler exhibits the higher posterior means of the degrees of freedom of the Student-t distribution. Fifth, the posterior means of the leverage effect parameter, ρ, in all cases of SV models with correlated errors are in the negative interval, where posterior means in GG sampler suggest that some though weakerevidence of negative correlation between returns and volatility process.

To evaluate the daily estimated volatility performances, the standard realized volatility (RV) is used as a proxy. It was showed by Anderson et al. (2003) that the daily RV obtained from the intra-day high frequency data set is an consistent and robust estimator of the ”true” volatility as long as there are no jumps. The percentage RV is defined by

RVt= 100× r

XNt

k=2[p(t, k)−p(t, k−1)] 2_,

where Nt is the number of observations in day t and p(t, k) denotes the log-price at the

k’th observation in day t. To measure the closeness between the estimated volatilities and RV, six loss functions are used, namely, root mean-squared error (RMSE) for volatility and variance, mean absolute error (MAE) for volatility and variance, logarithmic loss (LL) and Gaussian quasi-maximum likelihood function (GMLE), as discussed by Bollerslev et al. (1994) and Lopez (2001).

Table 3, 5, 7 and 9 present the value and ranking (indicated by subscripts) of all four SV models and two samplers under the six statistical loss functions. An interesting result is thatfor the TOPIX series the GG sampler provides themost accurateestimates against the MH sampler in each model. In fact, of the eight cases the four best estimates are performed by the GG sampler, where the best performance is for SVLt model. For Hitachi, excluding GMLE loss function, the best estimate is provided by MH sampler, where five loss functions indicate that the SVL model provides the most accurate estimation. Excluding GMLE loss function, MH sampler for SVLt model is the best sampler to estimate volatility of Nissan indicated by five loss functons. The results for Panasonic series indicatesno clear pattern.

(14)

the posterior means of volatilities from the MH sampler exhibit smoother movements than those from the GG sampler. The figure is also nicely illustrated that RV and the estimated volatility closely mimic the movements of|Rt|plotted in the top panel supporting the idea

of using mean-corrected returns as proxies forht.

9 Conclusion

The results, based on daily observations from all four stocks using four LNSV models, reveal that volatility by MH sampler in all cases is more persistent and less variable than those by GG sampler. In the case of LNSV models with a fat-tails component, the degrees of freedom shows that the student-tdistribution by MH sampler appears flatters with heavier tails. In case the LNSV models with correlated errors, the posterior mean of leverage effect parameter involving the use of MH sampler show a stronger evidence of negative asymmetric returns-volatility relation. After comparing the estimation performance of all cases, where standard daily RV is used as a proxy, it was found that the GG sampler is superior according to all six loss functions for TOPIX in the LNSVLt model, while the MH sampler is superior according to five loss functions for Hitachi in the LNSVL model and for Nissan in the LNSVLt model and three loss functions for Panasonic in the LNSVt model. Furthermore, GMLE (Gaussian quasi-maximum likelihood function) loss function is only minimized on all stocks in the same model and sampler: LNSVLt model by GG sampler.

References

Abanto-Valle, C. A., Bandyopadhyay, D., Lachos, V. H., & Enriquez, I. (2010). Robust Bayesian analysis of heavy-tailed stochastic volatility models using scale mixtures of normal distributions. Computational Statistics and Data Analysis, 54(12), 2883– 2898.

Abramowitz, M., & Stegun, N. (Eds.). (1972). Handbook of mathematical functions with

formulas, graphs, and mathematical tables. Dover Publications Inc.

Albert, J. (2009). Bayesian computation with R (2nd ed.). Springer.

Anderson, T. G., Bollerslev, T., Diebold, F. X., & Labys, P. (2003). Modeling and fore-casting realized volatility. Econometrica,71, 529–626.

Bauwens, L., & Lubrano, M. (1998). Bayesian inference on GARCH models using the Gibbs sampler. The Econometrics Journal,1, 23–46.

Bollerslev, T., Engle, R. F., & Nelson, D. B. (1994). ARCH Models. In R. Engle & D. McFadden (Eds.), The Handbook of Econometrics (pp. 2959–3038). Amsterdam: North-Holland.

Carnero, M. A., Pea, D., & Ruiz, E. (2001). Is stochastic volatility more flexible than

GARCH? Working Paper 01-08, Universidad Carlos III de Madrid. Available from

e-archivo.uc3m.es/bitstream/10016/152/1/w

Chen, M. H., & Shao, Q. M. (1999). Monte Carlo estimation of Bayesian credible and HPD intervals. Journal of Computational and Graphical Statistics,8, 69–92.

De Souza, C., & Gokcan, S. (2004). Hedge fund volatility: It’s not what you think it is.

AIMA Journal.

Geman, S., & Geman., D. (1984). Stochastic relaxation, Gibbs distribution and Bayesian restoration of images. IEE Transactions on Pattern Analysis and Machine

Intelli-gence,6, 721–741.

Geweke, J. (1993). Bayesian treatment of the independent student-tlinear model. Journal

(15)

Ghysels, E., Harvey, A. C., & Renault, E. (1996). Stochastic volatility. In G. Maddala & C. Rao (Eds.), Handbook of Statistics: Statistical Methods in Finance (pp. 119–191). Amsterdam: Elsevier Science.

Hall, A. R. (2005). Generalized method of moments. Oxford University Press.

Hastings, W. K. (1970). Monte Carlo sampling methods using Markov chains and their applications. Biometrika,57(1), 97–109.

Jacquier, E., Polson, N. G., & Rossi, P. E. (1994). Bayesian analysis of stochastic volatility models. In N. Shephard (Ed.),Stochastic Volatility: Selected Readings (pp. 247–282). Oxford University Press, New York.

Jacquier, E., Polson, N. G., & Rossi, P. E. (2004). Bayesian analysis of stochastic volatility models with fat-tails and correlated errors. Journal of Econometrics, 122(1), 185– 212.

Kim, S., Shephard, N., & Chib, S. (1998). Stochastic volatility: likelihood inference and comparison with ARCH models. In N. Shephard (Ed.), Stochastic volatility: selected

readings. Oxford University Press.

Lopez, J. A. (2001). Evaluation of predictive accuracy of volatility models. Journal of

Forecasting,20(1), 87–109.

Metropolis, N., Rosenbluth, A. W., Marshall, N. R., Teller, A. H., & Teller, E. (1953). Equations of state calculations by fast computing machines. Journal of Chemical

Physics,21(6), 1087–1091.

Roberts, G. O. (1996). Markov chain concepts related to sampling algorithms. In R. S. Gilks W.R. & D. Spiegelhalter (Eds.),Markov Chain Monte Carlo in Practice (pp. 45–57). Chapman & Hall, London.

Sel¸cuk, F. (2008). Asymmetric stochastic volatility in emerging stock markets. Applied

Financial Economics,15(12), 867–874.

Shephard, N. (1993). Fitting non-linear time series models, with applications to stochastic variance models. Journal of Applied Econometrics,8, 135–152.

Taylor, S. J. (1982). Financial returns modelled by the product of two stochastic processes— a study of the daily sugar prices 1961–75. In N. Shephard (Ed.),Stochastic Volatility:

Selected Readings (pp. 60–82). Oxford University Press, New York.

Tierney, L. (1994). Markov chain for exploring posterior distributions. Annals of Statistics, 22(4), 1701–1762.

Watanabe, T., & Asai, M. (2001).Stochastic volatility models with heavy-tailed distributions:

a Bayesian analysis. In Bank of Japan, IMES Discussion Paper Series No. 2001-E-17.

Yu, J. (2002). Forecasting volatility in the New Zealand stock market. Applied Financial

Economics,12, 193–202.

Yu, J. (2005). On leverage in a stochastic volatility model. Journal of Econometrics, 127(2), 165–178.

Zhang, X., & King, M. L. (2003). Estimation of asymmetric Box-Cox stochastic volatility

models using MCMC simulation.Working Paper 10/03, Monash University. Available

from http://www.buseco.monash.edu.au/ebs/pubs/wpapers/2003/

(16)

Table 2: Posterior analysis for daily TOPIX.

Parameter Statistic SVn SVt

GG MH GG MH

exp(1 2αˆ)

Mean (SD) 1.015 (0.086) 1.139 (0.124) 0.963 (0.086) 1.099 (0.124) 95% HPD (0.850,1.185) (0.897,1.384) (0.796,1.141) (0.862,1.351) MCSE (SIF) 0.0020 (5.60) 0.0022 (3.13) 0.0024 (8.03) 0.0023 (3.50) ˆ

φ

Mean (SD) 0.969 (0.007) 0.978 (0.006) 0.972 (0.006) 0.981 (0.005) 95% HPD (0.963,0.991) (0.966,0.990) (0.958,0.985) (0.969,0.991) MCSE 0.0004 (35.16) 0.0003 (36.85) 0.0004 (33.96) 0.0003 (38.07) ˆ

τ2

ν

Mean (SD) - - 23.591 (4.335) 20.885 (2.732) 95% HPD - - (15.544,32.141) (15.525,26.160) MCSE (SIF) - - 0.1498 (11.94) 0.1044 (14.62)

Parameter Statistic SVL SVLt

GG MH GG MH

exp(1 2αˆ)

φ

τ2

ν

Mean (SD) - - 23.562 (4.222) 21.269 (2.798) 95% HPD - - (16.005,32.156) (15.636,26.632) MCSE (SIF) - - 0.1387 (10.79) 0.1574 (31.64) ˆ

ρ

Mean (SD) ₋0.539 (0.050) ₋0.606 (0.054) ₋0.552 (0.054) ₋0.639 (0.061) 95% HPD (₋0.636,₋0.442) (₋0.705,₋0.493) (₋0.654,₋0.440) (₋0.749,₋0.521) MCSE (SIF) 0.0036 (50.45) 0.0063 (135.78) 0.0045 (70.919) 0.0074 (148.755)

Table 3: Estimation performance of the estimated volatility for TOPIX under six loss func-tions.

Loss Function

SVn SVt SVL SVLt

GG MH GG MH GG MH GG MH

RMSE1 0.5284 0.6618 0.4632 0.6056 0.5113 0.6497 0.4371 0.5955

RMSE2 1.9884 2.4948 1.6512 2.1686 1.8353 2.3517 1.5311 2.0495

GMLE 2.0154 2.7758 1.7032 2.5256 1.9373 2.7217 1.5751 2.4425

LL 1.1344 1.6308 0.9332 1.4726 1.0873 1.6067 0.8521 1.4245

MAE1 0.4254 0.5578 0.3682 0.5106 0.4183 0.5567 0.3481 0.5065

MAE2 0.9694 1.3388 0.8072 1.1796 0.9443 1.3247 0.7561 1.1685

(17)

Table 4: Posterior analysis for daily Hitachi.

GG MH GG MH

exp(1 2αˆ)

φ

τ2

ν

GG MH GG MH

exp(1 2αˆ)

φ

τ2

ν

ρ

Table 5: Estimation performance of the estimated volatility for Hitachi under six loss func-tions.

Loss Function

SVn SVt SVL SVLt

RMSE1 1.0446 0.9452 1.1477 1.0073 1.0425 0.9381 1.1598 1.0134

RMSE2 6.8334 6.6142 7.3257 6.9435 6.7933 6.5461 7.3578 7.0416

GMLE 2.1514 2.2487 2.0602 2.1776 2.1493 2.2538 2.0461 2.1675

LL 0.9885 0.7172 1.2737 0.8493 0.9906 0.7061 1.3208 0.8714

MAE1 0.8265 0.7262 0.9347 0.7813 0.8276 0.7191 0.9538 0.7894

MAE2 3.8096 3.5042 4.1727 3.6803 3.8095 3.4631 4.2318 3.7094

(18)

Table 6: Posterior analysis for daily Nissan.

GG MH GG MH

exp(1 2αˆ)

φ

τ2

ν

GG MH GG MH

exp(1 2αˆ)

φ

τ2

ν

ρ

Table 7: Estimation performance of the estimated volatility for Nissan under six loss functions. Loss

Function

SVn SVt SVL SVLt

RMSE1 0.5434 0.5998 0.5475 0.5383 0.5342 0.5977 0.5576 0.4891

RMSE2 4.0426 4.7377 3.6433 4.0225 3.9814 4.9168 3.5242 3.4741

GMLE 2.0474 2.3038 1.8682 2.1856 2.0243 2.2817 1.8041 2.1535

LL 0.2606 0.2334 0.3395 0.2132 0.2607 0.2273 0.4018 0.1951

MAE1 0.3644 0.3786 0.3957 0.3462 0.3593 0.3755 0.4198 0.3171

MAE2 1.6975 1.9407 1.6894 1.6813 1.6602 1.9558 1.7206 1.4901

(19)

Table 8: Posterior analysis for daily Panasonic.

GG MH GG MH

exp(1 2αˆ)

φ

τ2

ν

GG MH GG MH

exp(1 2αˆ)

φ

τ2

ν

ρ

Table 9: Estimation performance of the estimated volatility for Panasonic under six loss func-tions.

Loss Function

SVn SVt SVL SVLt

RMSE1 0.6383 0.6546 0.6495 0.6181 0.6424 0.6668 0.6577 0.6232

RMSE2 3.8886 4.2587 3.6112 3.8214 3.8765 4.3348 3.5571 3.6243

GMLE 2.0844 2.2738 1.8962 2.1296 2.0823 2.2647 1.8651 2.0825

LL 0.4425 0.3982 0.5197 0.3901 0.4586 0.4154 0.5698 0.4313

MAE1 0.4643 0.4684 0.4736 0.4451 0.4705 0.4777 0.4858 0.4562

MAE2 1.9165 2.0457 1.8311 1.8584 1.9386 2.0968 1.8452 1.8593

(20)

Figure 2: Time series plots for the absolute returns (top panel) and the posterior mean of volatility, ˆσt= exp(

1

2ˆht), for TOPIX based on the use of GG and MH samplers for

sampling ht, φandν in the four LNSV models, together with realized volatility. 12

10

absolute returns

8

6

4

2

0

2004/1/6 2005/1/4 2006/1/4

6~~~~~~~~~~~~~~~~~--.-~~~~~~~~~~~~~~~~~

2007/1/4 2008/1/4 2009/1/5 2010/1/4 2011/1/4 2011/12/30

SV-normal

5

4

3

2

o ... ~~~~..__~~~-'-~~~-'-~~~_._~~~---'~~~~.__~~~-'-~~~ ... 2004/1/6 2005/1/4 2006/1/4

6~~~~~~~~~~~~~~~~~--.-~~~~~~~~~~~~~~~~~

2007/1/4 2008/1/4 2009/1/5 2010/1/4 2011/1/4 2011/12/30

RV SVtGG

sv~H

SV with fat-tails

5

4

3

2

o ... ~~~~..__~~~-'-~~~-'-~~~_._~~~---'~~~~.__~~~-'-~~~ ... 2004/1/6 2005/1/4 2006/1/4

6~~~~~~~~~~~~~~~~~--.-~~~~~~~~~~~~~~~~~

2007/1/4 2008/1/4 2009/1/5 2010/1/4 2011/1/4 2011/12/30

5

RV SVLGG

sv~H

SV with correlated errors

4

3

2

o ... ~~~~..__~~~-'-~~~-'-~~~-'-~~~---'~~~~'--~~~-'-~~~ ... 2004/1/6 2005/1/4 2006/1/4

6....--.~~~~..--~~~~~~~--.-~~~--.-~~~---.~~~~.--~~~~~~~...-.

2007/1/4 2008/1/4 2009/1/5 2010/1/4 2011/1/4 2011/12/30

SV with fat-tails and correlated errors RV

SVUGG

SVL~H

5

4

3

2