Quantitative Finance authors titles recent submissions

(1)

arXiv:1711.03744v1 [q-fin.CP] 10 Nov 2017

Efficient Simulation for Portfolio Credit Risk in Normal Mixture Copula Models

Cheng-Der Fuh† _{and Chuan-Ju Wang}‡

National Central University and Academia Sinica

Abstract

This paper considers the problem of measuring the credit risk in portfolios of loans, bonds, and other instruments subject to possible default. One such performance measure of interest is the probability that the portfolio incurs large losses over a fixed time horizon. To capture the extremal dependence among obligors, we study a multi-factor model with a normal mix-ture copula that allows the multivariate defaults to have an asymmetric distribution. Due to the amount of the portfolio, the heterogeneous effect of obligors, and the phenomena that default events are rare and mutually dependent, it is difficult to calculate portfolio credit risk either by means of direct analysis or crude Monte Carlo simulation. That motivates this study on efficient simulation. To this end, we first propose a general account of an impor-tance sampling algorithm based on a two-parameter exponential embedding. Note that this innovative tilting device is more suitable for the multivariate normal mixture model and is of independent interest. Next, by utilizing a fast computational method for how the rare event occurs and the proposed importance sampling method, we provide an efficient simulation algorithm to estimate the probability that the portfolio incurs large losses under the normal mixture copula. Here our proposed simulation device is based on importance sampling for a joint probability other than the conditional probability used in previous studies. Theoretical investigations and simulation studies are given to illustrate the method.

Subject classifications: Monte Carlo simulation, variance reduction, importance sam-pling, portfolio credit risk, Fourier method, Copula models, rare-event simulation.

† Research partially supported by grants MOST 105-2410-H-008-025-MY2 and MOST 106-2118-M-008-002-MY2.

‡ Research partially supported by a grant MOST 102-2221-E-845-002-MY3 and MOST 105-2221-E-001-035.

1 Introduction

(2)

risk management usually takes a portfolio approach to measure and manage this risk, in which the dependence among sources of credit risk (obligors) in the portfolio is modeled. With the potential default of obligors, the portfolio approach evaluates the impact of the de-pendence among them on the probability of multiple defaults and large losses. Moreover, the default event triggered by an obligor is generally captured via so-called threshold models, in which an obligor defaults when a latent variable associated with this obligor goes beyond (or falls below) a predetermined threshold. This important concept is shared among all models originating from Merton’s credit risk model (Merton, 1974); the latent variable associated with each obligor is modeled using multiple factors occurring as a result of factor analysis, and thus captures common macroeconomic or industry-specific effects.

To model the dependence structure that shapes the multivariate default distribution, cop-ula functions have been widely adopted in the literature (e.g., Li,2000; Glasserman and Li,

2005; Glasserman et al., 2007, 2008; Bassamboo et al., 2008; Chan and Kroese,2010). One of the most widely used models is the normal copula model, which assumes that the la-tent variables follow a multivariate normal distribution and forms the basis of many risk-management systems such as J. P. Morgan’s CreditMetrics (Gupton et al., 1997). In spite of its popularity, some empirical studies suggest that strong dependence exhibited among financial variables is difficult to capture in the normal copula model (Mashal and Zeevi,

2002). Therefore, in view of this limitation, Bassamboo et al. (2008) present a t-copula model derived from the multivariate t-distribution to capture the relatively strong depen-dence structure of financial variables. Further studies can be found in Chan and Kroese

(2010) and Scott and Metzler (2015).

Monte Carlo simulation is the most widely adopted computational technique when mod-eling the dependence among sources of credit risk; it however suffers from the problem of slow convergence especially with low-probability events. To obtain rare-event probabilities more efficiently in simulation, one common approach is to shift the factor mean via im-portance sampling, one type of variance reduction technique (e.g., Asmussen and Glynn,

2007; Rubinstein and Kroese, 2011). Various importance sampling techniques for rare-event simulation for credit risk measurement have been studied in the literature (e.g.,

Glasserman and Li,2005;Glasserman et al.,2007;Bassamboo et al.,2008;Botev et al.,2013;

Scott and Metzler,2015; Liu,2015). For example, Glasserman and Li(2005) develop a two-step importance sampling approach for the normal copula model and derive the logarith-mic limits for the tail of the loss distribution associated with single-factor homogeneous portfolios. Due to the difficulty of generalizing the approach to the general multi-factor model, Glasserman et al. (2007) derive the logarithmic asymptotics for the loss distribu-tion under the multi-factor normal copula model, the results of which are later utilized in Glasserman et al. (2008) to develop importance sampling techniques to estimate the tail probabilities of large losses. Others focus on importance sampling techniques for the t -copula model; for instance, Bassamboo et al. (2008) present two importance sampling algo-rithms to estimate the probability of large losses under the assumption of the multivariate

(3)

conditional Monte Carlo, which utilizes the asymptotic description of how the rare event occurs. However, most proposed importance sampling techniques for both normal copula andt-copula models choose tilting parameters by minimizing the upper bound of the second moment of the importance sampling estimator based on large deviations theory. In these papers, they focus on the one dimensional case.

An alternative for choosing a tilting parameter is based on the criterion of minimizing the variance of the importance sampling estimator directly. For example,Do and Hall(1991) and

Fuh and Hu(2004) study efficient simulations for bootstrapping the sampling distribution of statistics of interest. Su and Fu (2000) andFu and Su (2002) minimize the variance under the original probability measure. Fuh et al. (2011) apply the importance sampling method for value-at-risk (VaR) computation under a multivariate t-distribution.

Along the line of minimizing the variance of the importance sampling estimator, in this paper we study a problem in portfolio risk under the normal-mixture copula model, which includes the popular normal copula and t-copula models as special cases. Here, we con-sider grouped normal mixture copulas in our general model setting. An important idea of the normal-mixture model is the inclusion of randomness into the covariance matrix of a multivariate normal distribution via a set of positive mixing variables, which could be inter-preted as “shock variables” (see Bassamboo et al.,2008; McNeil et al.,2015) in the context of modeling portfolio credit risk. There have been various empirical and theoretical stud-ies related to normal-mixture models in the literature (e.g., Barndorff-Nielsen, 1978, 1997;

Eberlein and Keller, 1995); these are popular in financial applications because such models appear to yield a good fit to financial return data and are consistent with continuous-time models (Eberlein and Keller, 1995).

There are two aspects in this study. To begin with, based on the criterion of minimizing the variance of the importance sampling estimator, we propose a general account of the importance sampling algorithm, which is based on a two-parameter exponential embedding. Theoretical investigations are given to support our method. Note that the innovative tilting formula is more suitable for the grouped normal mixture copula model, and is of independent interest. It is worth mentioning that for normal, multivariate normal and Gamma distri-butions, our simulation study shows that two-parameter exponential tilting performs 2 to 5 times better than the classical one-parameter exponential tilting for some simple rare event simulations.

(4)

numerical results of the proposed algorithm are given under various copula settings.

The remainder of this paper is organized as follows. Section 2formulates the problem of estimating large portfolio losses and presents the normal-mixture copula model. Section 3

presents a general account of importance sampling. We then study the proposed optimal importance sampling for portfolio loss under the normal-mixture copula model in Section4. The performance of our methods is demonstrated with an extensive simulation study in Section 5. Section 6 concludes.

2 Portfolio loss under the normal mixture copula model

Consider a portfolio of loans consisting ofnobligors, each of whom has a small probability of default. We further assume that the loss resulting from the default ofkth obligor, denoted as

ck(monetary units), is known. In copula-based credit models, dependence among default in-dicator for each obligor is introduced through a vector of latent variables X= (X1,· · ·, Xn),

where the kth obligor defaults if Xk exceeds some chosen threshold χk. The total loss from defaults is then denoted by

Ln=c11

{X1>χ1}+· · ·+cn1

{Xn>χn}, (1)

where 1 is the indicator function. Particularly, the problem of interest is to estimate the

probability of losses, P(Ln > τ), especially at large values ofτ.

As mentioned earlier in the introduction, the widely-used normal copula model might assign inadequate probability to the event of many simultaneous defaults in a portfolio. In view of this, Bassamboo et al. (2008); Chan and Kroese (2010) set forth the t-copula model for modelling portfolio credit risk. In this paper, we consider the normal-mixture model (McNeil et al., 2015), including normal copula and t-copula model as special cases, for the generalized d-factor model of the form:

Xk =ρk1

p

W1Z1+· · ·+ρkd p

WdZd+ρk p

Wd+1ǫk, k= 1, . . . , n, (2)

in which

• Z = (Z1, . . . , Zd)⊺ follows a d-dimensional multivariate normal distribution with zero

mean and covariance matrix Σ, where ⊺ _{denotes transpose of a vector.}

• W = (W1, . . . , Wd, Wd+1) are non-negative scalar-valued random variables which are

independent of Z, and each Wj is a shock variable and independent from each other, for j = 1, . . . , d+ 1.

• ǫk ∼N(0, σǫ2) is an idiosyncratic risk associated with thekth obligor.

• ρk1, . . . , ρkd are the factor loadings for the kth obligor, and ρ2k1+· · ·+ρ2kd≤1;

• ρk = p

1₋(ρ2

(5)

Model (2) is the so-called grouped normal mixture copula in McNeil et al. (2015). The above distributions are known as variance mixture models, which are constructed by draw-ing randomly from this set of component multivariate normals based on a set of “weights” controlled by the distribution of W. Such distributions enable us to blend in multiplicative shocks via the variables W, which could be interpreted as shocks that arise from new infor-mation. Note that withW_j−1 =Qj/νj and Qj ∼Gamma(νj/2,1/2) (for j = 1, . . . , d+ 1), the vector of latent variables, X = (X1, . . . , Xn), forms an n-dimensional multivariate t distri-bution. In addition, we consider the case that Wj, j = 1, . . . , d+ 1, in Equation (2) to have a generalized inverse Gaussian (GIG) distribution. The GIG mixing distribution, a special case of the symmetric generalized hyperbolic (GH) distribution, is very flexible for modelling financial returns. Moreover, GH distributions also include the symmetric normal inverse Gaussian (NIG) distribution and a symmetric multivariate distribution with hyperbolic dis-tribution for its one-dimensional marginal as interesting examples. Note that this class of distributions becomes popular in the financial modelling literature. An important reason is their link to L´evy processes (like Brownian motion or the compound Poisson distribution) that are used to model price processes in continuous time. For example, the generalized hy-perbolic distribution has been used to model financial returns inEberlein and Keller(1995);

Eberlein et al. (1998). The reader is referred to (McNeil et al., 2015) for more details. Now, we define the tail probability of total portfolio losses conditional on Z and W. To be more specific, the tail probability of total portfolio losses conditional on the factors Z

and W, denoted as h(Z, W) is defined as follows:

h(Z, W) =P(Ln> τ|(Z, W)). (3)

And the desired probability of losses can be represented as

P(Ln > τ) =E[h(Z, W)]. (4)

To have an efficient Monte Carlo simulation of the probability of total portfolio losses (4), we apply importance sampling to the distributions of the factorsZ = (Z1, . . . , Zd)⊺ and

W = (W1, . . . , Wd+1)⊺ (see Equation (2)). In other words, we attempt to choose importance

sampling distributions for both Z and W that will reduce the variance in estimating the integral E[h(Z, W)] against the original densities of Z and W.

As noted in Glasserman and Li (2005) for normal copula models, the simulation of (4) involves two rare events: the default event and the total portfolio loss event. For the normal mixture model (2), this makes the simulation of (4) even more challenge. To have a general simulation algorithm for this type of problems, we simulate P(Ln> τ) as expected value of

h(Z, W) in (4). Our device is based on a joint probability simulation other than conditional probability simulation considered in the previous literatures. Moreover, we note that the simulated distributions, the multivariate normal distribution Z1 _{and the commonly adopted}

Gamma distribution forW, are both two parameters distributions. This motives us to study a two-parameter exponential tilting in the next section.

1

(6)

3 A general account for importance sampling

Let (Ω,_F, P) be a given probability space. Let X = (X1, . . . , Xd)⊺ be a d-dimensional

random vector having f(x) = f(x1, . . . , xd) as a probability density function (pdf), with

respect to the Lebesgue measure _L, under the probability measure P. Let ℘(_·) be a real-valued function from R

d _to

R. The problem of interest is to calculate the expectation of

℘(X),

m=EP[℘(X)], (5)

where EP[·] is the expectation operator under the probability measure P.

To calculate the value of (5) using importance sampling, one needs to select a sampling probability measure Q under which X has a pdf q(x) = q(x1, . . . , xd) with respect to the Lebesgue measure _L. The probability measure Q is assumed to be absolutely continuous with respect to the original probability measure P. Therefore, Equation (5) can be written as

where EQ[_·] is the expectation operator under which X has a pdf q(x) with respect to the Lebesgue measure _L. The ratio, f(x)/q(x), is called the importance sampling weight, the likelihood ratio, or the Radon-Nikodym derivative.

Here, we focus on the exponentially tilted probability measure of P. Instead of con-sidering a commonly adopted one parameter exponential tilting appeared in the litera-ture (Asmussen and Glynn, 2007), we consider a two-parameter exponential tilting. Note that the distributions we are going to simulate, the multivariate normal distribution and the Gamma distribution, are both two-parameter distributions. To the best of our knowledge, the device of using a two-parameter exponential embedding seems to be new in the literature. As will be seen in the following examples, the tilting probabilities for existent two parameter distributions can be obtained by solving simple formulas.

Let Qθ,η be the tilting probability, where the subscript θ = (θ1, . . . , θp)⊺ ∈ Θ ⊂R

p _and

η = (η1,· · · , ηq)⊺ ∈ H ⊂ R

q _{are the tilting parameters. Here} _p _and _q _{denote the number}

of parameters in Θ and H, respectively. Assume that the moment generating function of (X, h(X)) exists and is denoted by Ψ(θ, η). Letfθ,η(x) be the pdf of X under the exponen-tially tilted probability measure Qθ,η, defined by

fθ,η(x) =

(7)

Consider the two-parameter exponential embedding, Equation (6) becomes

Because of the unbiasedness of the importance sampling estimator, its variance is

V arQθ,η

is equivalent to minimizing G(θ, η). Standard algebra gives a simpler form of G(θ, η),

which is used to find the optimal tilting parameters. In the following theorem, we show that

G(θ, η) in (9) is a convex function in θ and η. This property ensures that there exist no multi-modes problem in the searching stage of getting the optimal tilting parameters.

To minimize G(θ, η), the first-order condition requires the solution of θ, η, denoted by

θ∗_{, η}∗_{, to satisfy}_∇

θG(θ, η)|θ=θ∗= 0, and ∇_ηG(θ, η)|_η₌_η∗= 0, where∇_θ denotes the gradient

with respect to θ and _∇η denotes the gradient with respect to η. Simple calculation leads

∇θG(θ, η) = EP

and, therefore, (θ∗_{, η}∗_{) is the root of the following system of nonlinear equations,}

(8)

The following theorem states the existence, uniqueness and characterization for the min-imizer of (9). Before that, we need a condition for Ψ(θ, η) being steep to ensure the finiteness of the moment generating function Ψ(θ, η). To define steepness, letθ−_i = (θ1,· · · , θi,· · · , θp)∈ Θ such that all θk fixed fork = 1,· · ·, i−1, i+ 1,· · · , p except theith component. Denote

η−_j _∈ H in the same way for j = 1,_{· · ·} , q. Now, let θi,max := sup{θi : Ψ(θi−, η) < ∞} for

i= 1,_{· · ·} , p, andηj,max := sup{ηj : Ψ(θ, ηj−)<∞}forj = 1,· · · , q, (for light-tailed distribu-tions, we have 0< θi,max ≤ ∞ for i= 1,· · · , p, and 0< ηj,max≤ ∞ for j = 1,· · · , q.) Then

steepness means Ψ(θ, η)_{→ ∞}as θi →θi,maxfor i= 1,· · · , p, orηj →ηj,maxforj = 1,· · · , q.

The following conditions will be used in Theorem 3.1.

i) ¯Ψ(θ, η)Ψ(θ, η)_{→ ∞} asθi →θi,max fori= 1,· · ·, p, or ηj →ηj,max for j = 1,· · ·, q,

ii) G(θ, η) is a continuous differentiable function on Θ_×H, and

sup i=1,···,p, j=1,···,q

lim θi→θi,max

∂G(θ, η)

∂θi

, lim ηj→ηj,max

∂G(θ, η)

∂ηj

>0. (13)

Theorem 3.1 Suppose the moment generating function Ψ(θ, η) of (X, h(X))exists for θ _∈

Θ _⊂ R

p _and _η _∈ _H _⊂

R

q_{. Assume} _Ψ(_{θ, η}₎ _{is steep. Furthermore, assume either i) or ii)}

holds. Then G(θ, η) defined in (9) is a convex function in (θ, η), and there exists a unique minimizer of (9), which satisfies

∇θψ(θ, η) = E_Q¯_θ,η[X], (14)

∇ηψ(θ, η) = E_Q¯_θ,η[h(X)]. (15)

The proof of Theorem 3.1 will be given in the Appendix.

To illustrate our proposed two-parameter exponential tilting, we present two examples: multivariate normal and Gamma distributions. The reason of choosing these two distribu-tions is to indicate the location and scale properties of the exponential tilting, which will be used in our general framework. In these examples, by using a suitable re-parametrization, we obtain neat tilting formulas for each distribution. Our simulation studies also show that the two-parameter exponential tilting performs 2 ₋5 times better than the classical one-parameter exponential tilting for some simple rare events.

We here check the validity of applying Theorem3.1for each example. First, we note that both multivariate normal distribution and Gamma distribution are steep. Next, it is easy to see that the sufficient condition ¯Ψ(θ)Ψ(θ)_{→ ∞}asθi →θi,maxfori= 1,· · · , p, orηj →ηj,max

for j = 1,_{· · ·} , q in Theorem 3.1 hold in each example. For example, when X _∼ Nd(0,Σ), then Ψ(θ) =O(ekθk2

) approaches to_∞ sufficiently fast. Another simple example illustrated here is when℘(X) = 1_{X∈A} andA:= [a1,∞)×· · ·×[ad,∞), withai >0 for alli= 1,· · · , d, and X has a d-dimensional standard normal distribution, then it is easy to check that the sufficient conditions in Theorem 3.1 hold.

(9)

To illustrate the concept of two-parameter exponential embedding, we first consider a one-dimensional normal distributed random variable as an example. Let X be a random variable with the standard normal distribution, denoted by N(0,1), with probability density function (pdf ) dP_d_L =e−x2_/₂

/√2π. By using the two-parameter exponential embedding in (7) with h(x) :=x2_{, we have} with η > ₋1/2. For easy presentaion, we consider the standard parametrization by letting

µ :=θ/(1₋2η), σ2 _{= 1}_/₍₁₋₂_η₎ _{and define} _Q¯

In the case of a one-parameter exponential embedding with σ fixed. Standard calculation givesΨ(θ) =eθ2_/₂

, ψ(θ) = θ2_/₂_and_ψ′₍_θ_{) =}_θ_{. Using the fact that} _X_|{_{X > a}_}_{is a truncated}

normal distribution with minimum value a under Q¯, θ∗ _{needs to satisfy} _θ₌ φ(a+θ)

1−Φ(a+θ)−θ, cf.

Fuh and Hu (2004).

Table 1 presents numerical results for the normal distribution. As demonstrated in the table, using the two-parameter exponential tilting for the simple event, 1

{X>a}, achieves 2 to

5 times better performance than the one-parameter exponential tilting in terms of variance reduction factors.

Table 1: Two-parameter importance sampling for normal distribution.

We now proceed to ad-dimensional multivariate normal distribution. LetX = (X1, . . . , Xd)⊺

be a random vector with the standard multivariate normal distribution, denoted by N(0,I_),

(10)

where _{| · |} denotes the determinant of a matrix, and the above one-dimensional normal distribution, we consider the standard parametrization by letting µ:= (I₋₂_M₎−1_θ_, _{Σ := (}I₋₂_M₎−1_{, and define}_Q¯_µ,_Σ _as_N₍₋₍₂I₋_Σ−1₎−1_(Σ−1₎_µ,₍₂I₋

Remark 1 _{The left-hand-side (LHS) of (19) and (20) are derivatives of the cumulant}

func-tion ψM N(θ, η) := log(p

|(I₋₂_M₎−1_|_{) +} 1 2(θ

⊺₍_I₋₂_M₎−1_θ₎ _{for a multivariate normal with}

(11)

Table 2 presents numerical results for the standard bivariate normal distribution. As shown in the table, for three types of events1

{X1+X2>a},1

{X1>a,X2>a}, and1

{X1X2>a,X1>0,X2>0},

tilting different parameters results in different performance in variance reduction. Although sometimes tilting the variance parameter or the correlation parameter alone provides poor performance, combining them to the mean parameter tilting can yield 2 to 3 times better performance than the one-parameter exponential tilting.

[X∼N2(0,I)] Variance reduction factors

k Crude µ∗ _σ∗ _ρ∗ ₍_µ∗_{, σ}∗_{, ρ}∗₎

P(X1+X2> a)

3 1.663×10−2 ₂₄ ₂ ₂ ₄₃

4 2.400×10−3 ₁₃₈ ₄ ₂ ₃₅₄

5 1.800×10−4 _1,064 ₈ ₄ _4,036

P(X1> a, X2> a)

1 2.532×10−2 ₉ ₁ ₂ ₁₆

1.5 4.600×10−3 ₃₄ ₂ ₇ ₆₈

2 5.800×10−4 ₂₂₇ ₅ ₁₈ ₅₀₄

P(X1X2> a, X1>0, X2>0)

2 1.538×10−2 ₂₁ ₂ ₃ ₄₆

3 4.800×10−3 ₅₇ ₂ ₃ ₁₄₅

5 5.600×10−4 ₄₂₅ ₅ ₃ _1,213

Table 2: Two-parameter importance sampling for standard bivariate normal dis-tribution.

Example 3.3 Gamma distribution.

Let X be a random variable with Gamma distribution, denoted by Gamma₍_{α, β}_{), with pdf} dP

dL = (βα/Γ(α))xα−1e−βx. By using the two-parameter exponential embedding in (7) with

h(x) :=₋ln(βx), we have

dQθ,η/dL

dP/d_L =

exp_{θx+ηh(x)_}

E[exp_{θX+ηh(X)_}]

= exθ−ηln(βx) Γ(α)

(1/β)−α₍_β₎−η₍_β₋_θ₎−α+η_Γ(_α₋_η₎. (23) In this case, the tilting probability measure Qθ,η is Gamma(α−η, β −θ). For the event of

℘(X) =1

{X>a} fora >0, defineQ¯θ,η as Gamma(α+η, β+θ). Applying Theorem3.1, (θ∗, η∗) is the root of the following system of equations,

α₋η

β₋θ = EQ¯θ,η[X|X > a] (24)

lnβ₋ln(β₋θ) + Υ(α₋η) = E_Q¯_θ,η[ ln (βX)|X > a], (25)

where Υ is a digamma function equal to Γ′₍_α₋_η₎_/_Γ(_α₋_η_).

Table 3presents numerical results for the Gamma distribution. The commonly used one-parameter exponential tilting involves the change for the one-parameter β only (i.e., changing β

to β ₋θ∗_{) in the case of the Gamma distribution. However, observe that tilting the other}

(12)

exponential tilting in terms of variance reduction factors. This is due to the constrain η < α

and θ < β. For instance, consider the case that we only tilt one parameter, either θ or

η as follows. For the simple event 1

{X>a}, we can either choose a parameter θ such that

0 < θ < β or a parameter η such that η < 0 to obtain a larger mean of the tilted Gamma distribution. In this case, it is easy to see that η-tilting yields a larger searching space for parameters and thus achieves better performance than θ-tilting. Note that the event 1

{1/X>a}

shows the opposite effect.

[X∼Gamma(α, β)]

Variance reduction factors Variance reduction factors

a Crude forP(X > a) θ∗ _η∗ ₍_θ∗_{, η}∗₎ _a _{Crude for}_P₍₁_{/X > a}₎ _θ∗ _η∗ ₍_θ∗_{, η}∗₎

10 2.613×10−1 ₂ ₃ ₃ _0.2 _2.438_×₁₀−1 ₄ ₂ ₆

20 1.050×10−2 ₂₄ ₄₆ ₄₇ _0.5 _1.864_×₁₀−2 ₄₁ ₁₅ ₄₅

30 1.800×10−4 ₅₆₇ _1,288 _1,307 _1.5 _3.100_×₁₀−4 _1,321 ₂₉₄ _1,744

35 3.000×10−5 _4,788 _11,788 _12,226 _2.5 _6.000_×₁₀−5 _11,904 _2,156 _11,939

Table 3: Two-parameter importance sampling for the Gamma distribution.

4 Importance sampling for portfolio loss

For easy presentation, this section is divided into three parts. We first introduce the two-parameter exponential tilting for normal-mixture distributions in Section 4.1. Section 4.2

provides importance sampling ofh(Z, W) defined in (3). Section 4.3 summarizes the impor-tance sampling algorithm for calculating the probability of the portfolio loss P(Ln> τ).

4.1 Two-parameter tilting for normal-mixture distributions

Recall that in Equation (2), the latent random vector X follows a multivariate mixture distribution. In this section, for simplicity, we consider one-dimensional normal-mixture distribution as an example to demonstrate the two-parameter exponential tilting. Let X be a one-dimensional normal-mixture random variable with only one factor (i.e.,

d= 1) such that

X =ξ√W Z, (26)

where ξ _∈ R_, _Z _∼ _N₍₀_,_{1), and} _W _{is a non-negative and scalar-valued random variable}

which is independent of Z. Since the random variable W is independent ofZ, by using the two-parameter exponential embedding with h(z) forZ and h(w) forW, we have

dQθ1,η1,θ2,η2/dL

dP/d_L =

exp_{θ1z+η1h(z)}

E[exp_{θ1Z+η1h(Z)]}

exp_{θ2w+η2h(w)}

E[exp_{θ2W +η2h(W)]}

, (27)

(13)

Now, we consider the standard parametrization by lettingµ:=θ1/(1−2η1), σ2 := 1/(1−

2η1), θ := θ2, and η := η2 adopted in the examples of Section 3. If W follows a Gamma

distribution, Gamma₍_{α, β}_{), from Equations (}₁₆_{) and (}₂₃_{), Equation (}₂₇_{) becomes}

dQµ,σ,θ,η/dL

dP/d_L (28)

= 1

σ exp{µz−µ

2_/_{2 +} σ2−1

2σ2 (z−µ) 2

} ×ewθ−ηln(βw) Γ(α)

(1/β)−α₍_β₎−η₍_β₋_θ₎−α+η_Γ(_α₋_η₎. The optimal tilting parameters µ∗_{, σ}∗ _for _Z _{can be obtained by solving Equation (}₁₇_{), and}

θ∗ _and _η∗ _for _W _{are the solutions of (}₂₄_{) and (}₂₅_).

Table 4 tabulates the numerical results for this one-dimensional, one-factor normal-mixture distribution. Since for a normal-normal-mixture random variable, the variance is associated with the random variableW, tilting the standard deviationσwith the meanµof the normal random variable Z is relative insignificant than tilting the parameters of W (i.e., θ or η) with µ. This can be shown in Table 4. In addition, similar to the case demonstrated in Example 3.3, in this case, tilting η withµ also yields better performance than tilting θ with

µin the sense of variance reduction.

[Z∼N2(0,1), W∼Gamma₍_{α, β}_)] _{Variance reduction factors}

a Crude µ∗ _σ∗ ₍_µ∗_{, σ}∗₎ _θ∗ _η∗ ₍_µ∗_{, θ}∗₎ ₍_µ∗_{, η}∗₎

P(√W Z > a)

2 1.344×10−1 ₃ ₁ ₆ ₁ ₁ ₄ ₅

4 2.808×10−2 ₇ ₂ ₁₀ ₁ ₂ ₁₃ ₁₇

8 9.500×10−4 ₃₀ ₈ ₄₈ ₃ ₄ ₂₃₄ ₃₉₄

12 2.000×10−5 ₇₀ ₂₈ ₁₉₉ ₄ ₁₁ _5,054 _9,619

Table 4: Two-parameter importance sampling for normal-mixture distribution.

Next, we summarize the tilting for an event that the kth obligor defaults ifXk exceeds a given threshold χk as the “XYZ-event tilting”, which actually involves the calculation of the following tail event:

{(X₊Y₎Z_{> a}_}_, ₍₂₉₎

where X _{denotes the normal distributed part of the systematic risk factors,} Y _{denotes the}

idiosyncratic risk associated with each obligor, and Z _{denotes the non-negative and}

scalar-valued random variables which are independent of X _and Y_.2 _{For example, for the}

normal-mixture copula model in (2), the d-dimensional multivariate normal random vectors Z = (Z1,· · · , Zd) are associated with X, the ǫk is associated with Y, and the non-negative and scalar-valued random variablesW are associated withZ_{. Table}₅_{summarizes the exponential}

tilting used inGlasserman and Li(2005);Bassamboo et al.(2008);Chan and Kroese(2010) and our paper. Note that Glasserman and Li (2005) consider the normal copula model so that there is no need for tilting Z_.

2

(14)

Glasserman and Li(2005) Bassamboo et al.(2008) Chan and Kroese(2010) Our paper

multivariate normal distribution t-distribution normal-mixture distribution

X ✓ ✗ ✓ ✓

Y ✗ ✗ ✓ ✗

Z _NA ✓ ✗ ✓

Table 5: XYZ_{-event tilting}

4.2 Exponentially tilting for

h(Z, W

)

In this subsection, we follow the same notations as those in Section3. LetZ = (Z1, . . . , Zd)⊺

be a d-dimensional multivariate normal random variable with zero mean and identity co-variance matrix I_,3 _{and denote} _W _{= (}_W₁_{, . . . , W}_d₊₁₎⊺ _{as non-negative scalar-valued}

ran-dom variables, which are independent of Z. Under the probability measure P, let f1(z) =

f1(z1, . . . , zd) and f2(w) = f2(w1, . . . , wd+1) be the probability density functions of Z and

W, respectively, with respect to the Lebesgue measure _L. As alluded to earlier, our aim is to calculate the expectation ofh(Z, W),

m=EP[h(Z, W)]. (30)

To evaluate (30) via importance sampling, we choose a sampling probability measure

Q, under which Z and W have the corresponding probability density functions q1(z) =

q1(z1, . . . , zd) and q2(w) = q2(w1, . . . , wd+1). Assume that Q is absolutely continuous with

respect to P, then Equation (30) can be written as follows:

EP [h(Z, W)] =EQ

h(Z, W)f1(Z)

q1(Z)

f2(W)

q2(W)

. (31)

Let Qµ,Σ,θ,η be the two-parameter exponentially tilted probability measure of P. Here the subscripts µ = (µ1, . . . , µd)⊺, and Σ, constructed via ρ and σ = (σ1, . . . , σd)⊺, are the

tilting parameters for random vector Z. And θ = (θ1, . . . , θd+1)⊺ and η= (η1, . . . , ηd+1)⊺ are

the tilting parameters for W. Define the likelihood ratios

r1,µ,Σ(z) =

f1(z)

q1,µ,Σ(z)

, and r2,θ,η(w) =

f2(w)

q2,θ,η(w)

. (32)

Then combining with (32), Equation (31) becomes

EQ

h(Z, W)f1(Z)

q1(Z)

f2(W)

q2(W)

=EQµ,Σ,θ,η[h(Z, W)r1,µ,Σ(Z)r2,θ,η(W)]. (33)

Denote

G(µ,Σ, θ, η) =EP h2(Z, W)r1,µ,Σ(Z)r2,θ,η(W). (34)

3

(15)

By using the same argument as that in Section 3, we minimize G(µ,Σ, θ, η) to get the

Therefore, to get the optimal tilting parameters in (35) and (36), we need to calculate the conditional default probability P(Ln > τ|Z = z, W = w). For this purpose, we apply fast inverse Fourier transform (FFT) method described as follows. Before that, we first note that using the definition of the latent factor Xk for the kth obligor in Equation (2), the conditional default probability P(Xk > χk|Z = z, W = w) given Z = (z1, . . . , zd)⊺ and

The (fast) inverse Fourier transform for non-identical ck.

With non-identical ck, the distribution of the sum of n independent, but non-identically distributed “weighted” Bernoulli random variables becomes hard to evaluate. Here we adopt the inverse Fourier transform to calculateh(z, w) (Oberhettinger,2014). Recall thatLn|(Z =

n =k) by inverting the Fourier series:

q_kz,w = 1

(16)

φLz,wn (t) has a period of 2π; i.e., φLz,wn (t) = φLz,wn (t+ 2π) for all t, which is due to the fact

thatei(t+2π)k ₌_eitk_{. With this periodic property, we now evaluate the characteristic function}

φLz,wn atN equally spaced values in the interval [0,2π] as

bz,w_m =φLz,wn

2πm

N

, m= 0,1,_{· · ·}, N ₋1,

which defines the DFT of the sequence of probabilities q_kz,w. By using the corresponding sequence of characteristic function values above, we can recover the sequence of probabilities; that is, we aim for the sequence of q_kz,w’s from the sequence of bz,w

m ’s, which can be achieved by employing the inverse DFT operation

˜

q_kz,w = 1

N

N−1

X

m=0

bz,w_m e−i2πkm/N, k= 0,1,_{· · ·} , N ₋1. (41)

Finally, the approximation ofh(z, w) can be calculated as

˜

h(z, w) = 1₋P(Lz,w

n ≤τ) = 1− τ X

ℓ=0

˜

q_ℓz,w. (42)

4.3 Algorithms

This subsection summarizes the steps when we implement the proposed two-parameter im-portance sampling algorithm, which consists of two components: the probability calculation and the tilting parameter searching. The aim of the first component is to calculate the probability of losses, in which the optimal tilting parameters, µ∗_{, Σ}∗_, _θ∗ _and _η∗ _{are used.}

For demonstration purposes, we show the detailed procedure for the t-copula model, in which the latent vector X in Equation (2) follows a multivariate t distribution (recall that

W_j−1 =Qj/νj and Qj ∼ Gamma(νj/2,1/2) for j = 1, . . . , d+ 1). The detailed procedures of the first component are summarized as follows:

• Calculate the probability of losses, P(Ln > τ):

(1) Generate independent samples z(i) _from _N₍_µ∗_,_Σ∗_{) and} _q(i)

j from Gamma(νj/2−

η∗

j,1/2−θj∗) fori= 1, . . . M; calculate w

(i)

j =νj/qj(i) for j = 1, . . . , d+ 1. (2) Estimatemby ˆm= _M1 PM_i₌₁h˜(z(i)_{, w}(i)₎_r

1,µ∗_,_Σ∗(z)r₂_,θ∗_,η∗(w) in (32), where ˜h(z, w)

is calculated by the analytical form from (42).

We proceed to describe the second component: the optimal tilting parameter searching. To implement the searching phase, we involve the automatic Newtons method (Teng et al.,

(17)

the results in (19), (20), (24), and (25), we define functions gµ(µ), gΣ(Σ), gθ(θ), gη(η) (see

Equations (35) and (36)) as

gµ(µ) = µ₋E_Q¯_µ,Σ[X|X ∈A], (43)

gΣ(Σ) = K(µ,Σ)−E_Q¯_µ,Σ[X⊺(∇ηiM)X|X ∈A] for i= 1,2,· · ·, d+ 1, (44)

gθ(θ) = hα1−η1

β1−θ1,· · · ,

αd+1−ηd+1

βd+1−θd+1

i⊺

−E_Q¯_θ,η[X], (45)

gη(η) = lnβ1−ln(β1−θ1) + Υ(α1−η1),· · ·,lnβd+1−ln(βd+1−θd+1) + Υ(αd+1−ηd+1)

⊺ (46)

−E_Q¯_θ,η[ ln (βx)],

where _∇ηiM in (44) is defined in (21). To find the optimal tilting parameters, we need to

find the roots of the above four equations. Via employing Newton’s method, the roots of (43), (44), (45), and (46) can be found iteratively by

δ(k) = δ(k−1)₋J_δ−(k1−1)gδ(δ

(k−1)₎_, ₍₄₇₎

where the Jacobian ofgδ(δ) is defined as

Jδ[i, j] := ∂

∂δj

gδ,i(δ). (48)

In (47) and (48) , δ can be replaced to µ, Σ, θ, and η, and J_δ−1 is the inverse of the matrix

Jδ.

In order to measure the precision of roots to the solutions in (43), (44), (45), and (46), we define the sum of the square error ofgδ(δ) as

kgδ(δ)k=gδ′(δ)gδ(δ), (49)

and a δ(n) _{is accepted when} _k_g_δ(_δ(n)₎_k _{is less than a predetermined precision level} _ǫ_{. The}

detailed procedures of the second component are described as follow:

• Search the optimal tilting parameters:

(1) Generate independent samplesz(i) _from_N₍₀_,_I_{) and}_q(i)

j fromGamma(νj/2,1/2) for

i= 1, . . . M; calculate w(_ji) =νj/q(ji) for j = 1, . . . , d+ 1.

(2) Set µ(0)_{, Σ}(0)_, _θ(0)_{, and} _η(0) _{properly and} _k_{= 1.}

(3) Calculate gδ(0) functions via (43), (44), (45), and (46).

(4) Calculate the Jδ and their inverse matrices for the gδ functions.

(5) Calculate δ(k) ₌_δ(k−1)₋_J−1

δ(k−1)gδ(δ

(k−1)_{) in (}₄₇_{) for} _δ₌_µ,_Σ_{, θ, η}_.

(6) Calculategδ(k) functions via (43), (44), (45), and (46). If∀ δ∈ {µ,Σ, θ, η}, |gδ(δ(k))k<

(18)

Remark 2 As a side note, in the above algorithm for the proposed two-parameter expo-nential tilting, a componentwise Newton’s method is adopted to search the optimal tilting parameters µ,Σ, θ, η, which is different from Algorithm 2 in Teng et al. (2016) that only in-volves one-parameter tilting for µ. With the convex property of G(θ, η) and the uniqueness of the optimal tilting parameters, the optimal tiling formula is robust and not sensitive to the initial value.

5 Numerical results

This section demonstrates the capability and performance of the proposed method via an ex-tensive simulation study. In the following discussion, we split it as three categories. First, we illustrate the results of the special case of the normal mixture copula model,t-copula model for single-factor homogeneous portfolios, whose setting is similar to those inBassamboo et al.

(2008) and Chan and Kroese (2010). Second, we compare the performance of the proposed method with naive simulation, under 3-factor normal mixture models, in which thet distribu-tion for Xk is considered. Third, to check the robustness, we also compare the performance of the proposed method with naive simulation, under 3-factor normal mixture models, in which a GIG distribution forXk is considered. The cases with different losses resulting from default of the obligors are also investigated. In all of the following experiments, for each set of parameters, we generate 5,000 samples for locating the optimal tilting parameters and 10,000 samples for calculating the probability of losses.

First, Table6tabulates the performance comparison among our method and the methods proposed inBassamboo et al.(2008) andChan and Kroese(2010). For comparison purposes, we adopt the same sets of parameter values as those in Table 1 ofBassamboo et al. (2008), where the latent variables Xk in Equation (2) follow a t distribution, i.e., W1−1 = W2−1 =

Q1/ν1 and Q1 ∼ Gamma(ν1/2,1/2). The model parameters are chosen to be n = 250, ρ11 =

0.25, the default thresholds for each individual obligorsχi = 0.5×√n, eachci = 1,τ = 250×

b,b = 0.25, andσǫ = 3. In the table, the results of the exponential change of measure (ECM) proposed in Bassamboo et al. (2008), and conditional Monte Carlo simulation without and with cross-entropy (CondMC and CondMC-CE, respectively) in Chan and Kroese (2010) are reported. Observed from the table, the proposed algorithm (the last four columns) offers substantial variance reduction compared with naive simulation, and in general, it compares favorably to both of the two estimators ECM and CondMC. Moreover, in order to fairly compare with the results of CondMC-CE, we first follow the CondMC method by integrating out the shock variable analytically; then, instead of using the cross-entropy approach, we apply the proposed importance sampling method for variance reduction. Under this setting, the proposed method obtains comparable variance reduction performance with the CondMC-CE.

(19)

Bassamboo et al.(2008) Chan and Kroese(2010) Importance sampling (IS)

ECM CondMC CondMC-CE IS CondMC-IS

ν P(Ln> τ) V.R. factor V.R. factor P(Ln> τ) V.R. factor P(Ln> τ) V.R. factor

4 8.13×10−3 ₆₅ ₂₇₁ _2,440 ₈_.₁₁_×₁₀−3 ₃₃₈ ₈_.₀₉_×₁₀−3 _1,600

8 2.42×10−4 ₈₇₈ _1,690 _20,656 ₂_.₃₆_×₁₀−4 _6,212 ₂_.₄₇_×₁₀−4 _14,770

12 1.07×10−5 _7,331 _12,980 ₂_.₀₈_×₁₀5 ₁_.₀₄_×₁₀−5 _16,100 ₁_.₁₀_×₁₀−5 ₁_.₅₇_×₁₀5

16 6.16×10−7 _52,185 _81,170 ₁_.₃₀_×₁₀6 ₆_.₃₄_×₁₀−7 ₂_.₇₈_×₁₀5 ₆_.₂₀_×₁₀−7 ₁_.₈₉_×₁₀6 20 4.38×10−8 _301,000 ₄_.₁₉_×₁₀5 ₁_.₂₇_×₁₀7 ₄_.₁₂_×₁₀−8 ₅_.₄₄_×₁₀6 ₄_.₁₄_×₁₀−8 ₁_.₆₁_×₁₀7

Table 6: Performance of our algorithm with equal loss resulting from default of the obligors for a one-factor model (t distribution).

Equal loss (ci= 1)

b P(Ln> τ) V.R. factor ~ν P(Ln> τ) V.R. factor n P(Ln> τ) V.R. factor

0.3 3.08×10−3 ₈₆₃ _(4,4,4,4) _3.09_×₁₀−3 _1,009 ₁₀₀ _1.91_×₁₀−2 ₄₁₆

0.4 2.39×10−4 _5,931 _(8,6,4,4) _3.08_×₁₀−3 ₈₆₃ ₂₅₀ _3.08_×₁₀−3 ₈₆₃

0.5 2.13×10−6 _20,300 _(8,8,8,8) _2.97_×₁₀−5 _1,667 ₄₀₀ _1.17_×₁₀−3 ₅₆₃

ρki P(Ln> τ) V.R. factor ρˆ P(Ln> τ) V.R. factor (σ1, σ2, σ3) P(Ln> τ) V.R. factor

0.1 3.08×10−3 ₈₆₃ _-0.5 _3.06_×₁₀−3 _1,100 _{(0.6,0.4,0.1)} _3.08_×₁₀−3 ₉₉₁

0.3 1.89×10−3 ₉₄₅ ₀ _3.05_×₁₀−3 _1,156 _{(0.8,0.6,0.3)} _3.07_×₁₀−3 _1,087

0.5 2.76×10−4 _1,174 _0.5 _3.08_×₁₀−3 ₈₆₃ _(1,0.8,0.5) _3.08_×₁₀−3 ₈₆₃

Table 7: Performance of our algorithm with equal loss resulting from default of the obligors for a 3-factor model (t distribution).

copula models, where the latent variablesXkfollow a multivariatet distribution, i.e.,Wj−1 =

Qj/νj andQj ∼Gamma(νj/2,1/2) forj = 1, . . . ,4. In the three tables, the results with equal losses (Table 7) and different losses (Tables 8 and 9) resulting from default of the obligors with various parameter settings are listed. Following the settings inBassamboo et al.(2008) and Chan and Kroese (2010), the threshold for the i-th obligor χi is set to 0.5×√n, the idiosyncratic risk σǫ is set to N(0,9), and the total loss τ is set to n×b. The results with base-case model parameters are reported in the gray-background cells in Tables 7, 8, and9. For the equal loss scenario (ci = 1), the model parameter b for the base case is set to 0.3, while for ci = (⌈2i⌉/n)2 and ci = (⌈5i⌉/n)2, b is set to 0.7 and 2, respectively. In addition, the other parameters for the base case are listed as follows: ~ν = (8 6 4 4), n = 250, each

ρki = 0.1 (for k = 1, . . . , n and i = 1, . . . d), and the covariance matrix Σ = (uij) ∈ R3×3 of the multivariate normal distribution Z is set to ui,i = σ2i, ui,j = uj,i = ˆρ σiσj, where

σ1 = 1, σ2 = 0.8, σ3 = 0.5,ρˆ= 0.5.

(20)

2 different loss (ci= (⌈2i⌉/n)2)

0.7 4.79×10−3 ₈₃₂ _(4,4,4,4) _4.78_×₁₀−3 ₆₉₂ ₁₀₀ _3.02_×₁₀−2 ₃₂₅

1 2.91×10−4 _3,078 _(8,6,4,4) _4.79_×₁₀−3 ₈₃₂ ₂₅₀ _4.79_×₁₀−3 ₈₃₂

1.2 1.20×10−5 _14,471 _(8,8,8,8) _8.64_×₁₀−5 _7,305 ₄₀₀ _1.87_×₁₀−3 ₇₃₉

ρki P(Ln> τ) V.R. factor ρˆ P(Ln> τ) V.R. factor σ1, σ2, σ3 P(Ln> τ) V.R. factor

0.1 4.79×10−3 ₈₃₂ _-0.5 _4.81_×₁₀−3 _1,055 _{(0.6,0.4,0.1)} _4.84_×₁₀−3 ₇₀₀

0.3 2.96×10−3 ₈₇₆ ₀ _4.80_×₁₀−3 ₇₄₅ _{(0.8,0.6,0.3)} _4.79_×₁₀−3 ₅₈₂

0.5 4.27×10−4 ₇₃₉ _0.5 _4.79_×₁₀−3 ₈₃₂ _(1,0.8,0.5) _4.79_×₁₀−3 ₈₃₂

Table 8: Performance of our algorithm with 2 different loss resulting from default of the obligors for a 3-factor model with inverse FFT (t distribution).

5 different loss (ci= (⌈5i⌉/n)2)

2 2.38×10−2 ₁₄₁ _(4,4,4,4) _2.39_×₁₀−2 ₁₃₉ ₁₀₀ _1.09_×₁₀−1 ₅₉

4 8.59×10−4 ₃₄₁₄ _(8,6,4,4) _2.38_×₁₀−2 ₁₄₁ ₂₅₀ _2.38_×₁₀−2 ₁₄₁

6 4.15×10−7 _81,498 _(8,8,8,8) _1.84_×₁₀−3 _1,131 ₄₀₀ _9.77_×₁₀−3 ₂₆₄

ρki P(Ln> τ) V.R. factor ρˆ P(Ln> τ) V.R. factor σ1, σ2, σ3 P(Ln> τ) V.R. factor

0.1 2.38×10−2 ₁₄₁ _-0.5 _2.39_×₁₀−2 ₁₅₂ _{(0.6,0.4,0.1)} _2.37_×₁₀−2 ₁₄₉

0.3 1.51×10−2 ₁₈₃ ₀ _2.35_×₁₀−2 ₁₂₅ _{(0.8,0.6,0.3)} _2.40_×₁₀−2 ₁₄₈

0.5 2.34×10−3 ₁₄₃ _0.5 _2.38_×₁₀−2 ₁₄₁ _(1,0.8,0.5) _2.38_×₁₀−2 ₁₄₁

Table 9: Performance of our algorithm with 5 different loss resulting from default of the obligors for a 3-factor model with inverse FFT (t distribution).

the occurrence of the rare event; while CondMC simply ignores the contribution of Z, the proposed method twists the distributions of bothZ and W.

Third, in addition to the t-copula model, Table 10shows the results for another type of 3-factor normal mixture copula models, whereWj follows a special case of generalized inverse Gaussian (GIG) distributions, i.e.,Wj ∼Gamma(νj/2,1/2) forj = 1, . . . ,4 and~ν = (8 6 4 4); expect forb set to 0.28, 0.32, 0.36, the other model parameters are the same as those for the base case in Table7. Observe from Table10our approach obtains superior performance than naive simulation, which attest the capability of the proposed algorithm for normal mixture copula models. Moreover, another feature that is worth commenting is that in this setting, tilting the other parameter νj/2 of the Gamma distribution (i.e., νk/2→νk/2−η) yields 2 to 4 times better performance than the one-parameter exponential tilting (tiltingθ) in terms of the variance reduction factors4.

4

(21)

Crude IS (tiltingθ) IS (tiltingη)

b P(Ln> τ) Variance P(Ln> τ) Variance V.R. factor P(Ln> τ) Variance V.R. factor

0.28 2.04×10−3 _2.04_×₁₀−3 _1.98_×₁₀−3 _6.99_×₁₀−6 ₂₉₁ _1.98_×₁₀−3 _2.79_×₁₀−6 ₇₃₁ 0.32 2.00×10−4 _2.00_×₁₀−4 _1.74_×₁₀−4 _8.09_×₁₀−8 _2,473 _1.75_×₁₀−4 _3.04_×₁₀−8 _6,575

0.36 8.00×10−6 _8.00_×₁₀−6 _7.99_×₁₀−6 _3.77_×₁₀−10 _21,194 _7.55_×₁₀−6 _1.11_×₁₀−10 _72,352

Table 10: Performance of our algorithm with equal loss resulting from default of the obligors for a 3-factor model (symmetric generalized hyperbolic distribution).

6 Conclusions

This paper studies a factor model with a normal mixture copula that allows multi-variate defaults to have an asymmetric distribution. Due to the amount of the portfolio, the heterogeneous effect of obligors, and the phenomena that default events are rare and mutually dependent, it is difficult to calculate portfolio credit risk either by means of di-rect analysis or crude Monte Carlo simulation. To address this problem, we first propose a general account of a two-parameter importance sampling algorithm. Then, an efficient simulation algorithm is proposed to estimate the probability that the portfolio incurs large losses under the normal mixture copula. We also provide theoretical investigations of the proposed method and illustrate its effectiveness through numerical results.

There are several possible future directions based on this model. To name a few, first, we may consider simulating portfolio loss under the meta-elliptical copula and/or the Archimedean copula models. Second, although in this paper the default time is fixed, the default time could be any time before a pre-fixed time T. To capture this phenomenon, we could consider a first-passage time model based on the factor model (2). In this case, we would develop an importance sampling algorithm for stopping time events. Third, by developing the first-passage time model, it would be more practical to consider the firm value process with credit rating. In such case, an importance sampling for Markov chains is needed.

Appendix: Proof of Theorem 3.1

To prove Theorem 3.1, we need the following three propositions first. Proposition 1 is taken from Theorem VI.3.4. of Ellis (1985), Proposition 2 is a standard result from convex analysis, Proposition 3 is taken from Theorem 1 of Soriano(1994). Note that although the domain of the function is the whole space in Propositions2and 3, the results for a subspace still hold with similar proofs.

Proposition 1 f(θ) is differentiable at θ _∈ int(Θ) if and only if the d partial derivatives

∂f(θ)

∂θi for i= 1,· · · , d exist at θ ∈int(Θ) and are finite.

Proposition 2 Let f : R

d _→

R be continuous on all of x ∈ R

d_{. If} _f _{is coercive (in the}

(22)

Proposition 3 Let f : R

d

→ R, and f is continuous differentiable function and convex,

satisfies (13). Then a minimum point for f exists.

In the following, we first show that G(θ, η) is a strictly convex function. For any given

λ_∈(0,1), and (θ1, η1),(θ2, η2)∈Θ×H, has unique minimizer. It is to see that ii) implies conditions in Proposition 3hold.

(23)

References

Asmussen, S. and P. Glynn (2007). Stochastic Simulation: Algorithms and Analysis. New York: Springer-Verlag.

Barndorff-Nielsen, O. E. (1978). Hyperbolic distributions and distributions on hyperbolae. Scandinavian Journal of Statistics 5(3), 151–157.

Barndorff-Nielsen, O. E. (1997). Normal inverse Gaussian distributions and stochastic volatil-ity modelling. Scandinavian Journal of Statistics 24(1), 1–13.

Bassamboo, A., S. Juneja, and A. Zeevi (2008). Portfolio credit risk with extremal depen-dence: Asymptotic analysis and efficient simulation. Operations Research 56(3), 593–606.

Botev, Z. I., P. L’Ecuyer, and B. Tuffin (2013). Markov chain importance sampling with applications to rare event probability estimation. Statistics and Computing 23(2), 271– 285.

Chan, J. C. and D. P. Kroese (2010). Efficient estimation of large portfolio loss probabilities in t-copula models. European Journal of Operational Research 205(2), 361–367.

Do, K.-A. and P. Hall (1991). On importance resampling for the bootstrap.

Biometrika 78(1), 161–167.

Eberlein, E. and U. Keller (1995). Hyperbolic distributions in finance. Bernoulli 1(3), 281–299.

Eberlein, E., U. Keller, and K. Prause (1998). New insights into smile, mispricing, and value at risk: The hyperbolic model. The Journal of Business 71(3), 371–405.

Ellis, R. S. (1985).Entropy, Large Deviations, and Statitical Mechanics. New York: Springer.

Fu, M. and Y. Su (2002). Optimal importance sampling in securities pricing. Journal of Computational Finance 5, 27–50.

Fuh, C.-D. and I. Hu (2004). Efficient importance sampling for events of moderate deviations with applications. Biometrika 91(2), 471–490.

Fuh, C.-D., I. Hu, Y.-H. Hsu, and R.-H. Wang (2011). Efficient simulation of value at risk with heavy-tailed risk factors. Operations Research 59(6), 1395–1406.

Glasserman, P., W. Kang, and P. Shahabuddin (2007). Large deviations in multifactor portfolio credit risk. Mathematical Finance 17(3), 345–379.

(24)

Glasserman, P. and J. Li (2005). Importance sampling for portfolio credit risk. Management Science 51(11), 1643–1656.

Gupton, G. M., C. C. Finger, and M. Bhatia (1997). Creditmetrics: Technical Document. JP Morgan & Co.

Li, D. X. (2000). On default correlation: A copula function approach. The Journal of Fixed Income 9(4), 43–54.

Liu, G. (2015). Simulating risk contributions of credit portfolios. Operations Research 63(1), 104–121.

Mashal, R. and A. Zeevi (2002). Beyond correlation: Extreme co-movements between finan-cial assets. Working paper, Columbia University.

McNeil, A. J., R. Frey, and P. Embrechts (2015). Quantitative Risk Management: Concepts, Techniques and Tools. Princeton University Press.

Merton, R. C. (1974). On the pricing of corporate debt: The risk structure of interest rates. The Journal of Finance 29(2), 449–470.

Oberhettinger, F. (2014). Fourier Transforms of Distributions and Their Inverses: A Col-lection of Tables. Academic Press, INC.

Rubinstein, R. Y. and D. P. Kroese (2011). Simulation and the Monte Carlo Method, Volume 707. John Wiley & Sons.

Scott, A. and A. Metzler (2015). A general importance sampling algorithm for estimating portfolio loss probabilities in linear factor models. Insurance: Mathematics and Eco-nomics 64, 279–293.

Soriano, J. (1994). Extremum points of a convex function. Applied Mathematics and Com-putation 66(2-3), 261–266.

Su, Y. and M. C. Fu (2000). Importance sampling in derivative securities pricing. In Proceedings of the 2000 Winter Simulation Conference, pp. 587–596.