• Tidak ada hasil yang ditemukan

The Markov Chain and Marginal Probabilities of Infection

IMPROVED BOUNDS ON THE EPIDEMIC THRESHOLD OF THE EXACT MODELS

3.2 The Markov Chain and Marginal Probabilities of Infection

Let 𝐺 = (𝑉 , 𝐸) be an arbitrary connected undirected network with 𝑛 nodes, and with adjacency matrix 𝐴. Each node can be in a state of health, represented by β€œ0,”

or a state of infection, represented by β€œ1.” The state of the entire network can be represented by a binary n-tupleπœ‰(𝑑) = (πœ‰1(𝑑),Β· Β· Β· , πœ‰π‘›(𝑑)) ∈ {0,1}𝑛, where each of the entries represents the state of a node at time𝑑, i.e.,𝑖is infected ifπœ‰π‘–(𝑑) =1 and it is healthy ifπœ‰π‘–(𝑑) =0.

Given the current stateπœ‰(𝑑), the infection probability of each node in the next step

is determined independently, and therefore, the transition matrix𝑆of this Markov Chain has elements𝑆𝑋 ,π‘Œ =P(πœ‰(𝑑+1) =π‘Œ|πœ‰(𝑑) =𝑋)of the following form:

P(πœ‰(𝑑+1) =π‘Œ|πœ‰(𝑑) =𝑋) =

𝑛

Γ–

𝑖=1

P(πœ‰π‘–(𝑑+1) =π‘Œπ‘–|πœ‰(𝑑) = 𝑋), (3.1) for any two state vectors𝑋 , π‘Œ ∈ {0,1}𝑛.

As mentioned before, a healthy node can receive infection from any of its infected neighbors independently with probability 𝛽per infected link, and an infected node can recover from the disease with probability𝛿. That is

P(πœ‰π‘–(𝑑+1)=π‘Œπ‘–|πœ‰(𝑑) =𝑋) =









ο£²







ο£³

(1βˆ’π›½)π‘šπ‘– if(𝑋𝑖, π‘Œπ‘–) = (0,0),|π‘π‘–βˆ©S(𝑋) | =π‘šπ‘–, 1βˆ’ (1βˆ’π›½)π‘šπ‘– if(𝑋𝑖, π‘Œπ‘–) = (0,1),|π‘π‘–βˆ©S(𝑋) | =π‘šπ‘–, 𝛿 if(𝑋𝑖, π‘Œπ‘–) = (1,0),|π‘π‘–βˆ©S(𝑋) | =π‘šπ‘–, 1βˆ’π›Ώ if(𝑋𝑖, π‘Œπ‘–) = (1,1),|π‘π‘–βˆ©S(𝑋) | =π‘šπ‘–,

(3.2)

whereS(𝑋)is the support of𝑋 ∈ {0,1}𝑛, i.e.,S(𝑋) ={𝑖: 𝑋𝑖 =1}, and 𝑁𝑖is the set of neighbors of node𝑖.

Eqs. (3.1, 3.2) completely define the 2𝑛×2𝑛transition matrix of the Markov chain, which determines the evolution of the 2𝑛states over time. Of course with this, we have the joint probability of all the nodes, and we can compute the probability of any desired combination by marginalizing out the rest. In particular, one can compute the probability of each node𝑖being infected at time𝑑+1 (denoted by𝑝𝑖(𝑑+1)), which is a function of all joint probabilities of the states at time𝑑. Since there are only𝑛 such variables, the dimension would be significantly reduced if one could β€œbound” or

β€œapproximate” that function by something that includes marginals𝑝𝑖(𝑑)only. This way, we obtain a recursion which relates the marginals at time𝑑+1 to those of time𝑑, and indeed we have a system with only𝑛states rather that 2𝑛states. Approximations per se are not very interesting because they do not provide anyguarantee on the behavior of the exact Markov chain model. What is more important is whether one can obtain a bound on these true probabilities, which can guarantee, for example, fast extinction of the disease. The most common upper-bound, which has been shown to be the tightest linear upper-bound with marginals only (using a linear programming technique) [6, 16] is:

𝑝𝑖(𝑑+1) ≀ (1βˆ’π›Ώ)𝑝𝑖(𝑑) +𝛽 Γ•

π‘—βˆˆπ‘π‘–

𝑝𝑗(𝑑) (3.3)

for all𝑖 = 1, . . . , 𝑛. Defining 𝑝(𝑑) = (𝑝1(𝑑), . . . , 𝑝𝑛(𝑑))𝑇, this can be written in a matrix form as

𝑝(𝑑+1) 𝑀 𝑝(𝑑), (3.4)

where

𝑀 =(1βˆ’π›Ώ)𝐼𝑛+𝛽 𝐴. (3.5)

3.2.1 An Alternative Bounding Technique

The derivation of (3.3) in [6, 16] involves a linear programming technique. In this chapter, we provide an alternative technique to bound the infection probabilities using indicator variables and conditional expectation, which is more intuitive and direct. Importantly, as will be shown later, this technique can be used to obtain tighter bounds on the exact probabilities of infections using pairwise inflectional probabilities. Before that, it is instructive to derive (3.3) using this alternative approach.

Let𝑖 βˆˆπ‘‰. We start by conditioning on the state of the same node 𝑖 at time𝑑, as follows:

𝑝𝑖(𝑑+1) =P(𝑋𝑖(𝑑+1) =1|𝑋𝑖(𝑑) =1)P(𝑋𝑖(𝑑) =1) +P(𝑋𝑖(𝑑+1) =1|𝑋𝑖(𝑑) =0)P(𝑋𝑖(𝑑) =0).

The probability that an infected node remains infected is 1βˆ’π›Ώ, and the probability that a susceptible node does not receive infection from an infected neighbor is 1βˆ’π›½. We denote 𝑗 neighbor of𝑖 by 𝑗 βˆΌπ‘–. The expression above can be written as

𝑝𝑖(𝑑+1) = (1βˆ’π›Ώ)𝑝𝑖(𝑑) +Eπ‘‹βˆ’π‘–(𝑑) |𝑋𝑖(𝑑)=0

1βˆ’Γ–

π‘—βˆΌπ‘–

(1βˆ’ 𝛽1𝑋𝑗(𝑑))

P(𝑋𝑖(𝑑) =0). (3.6) The conditional expectation is on the joint probability of all nodes other than 𝑖 (denoted by π‘‹βˆ’π‘–), given node𝑖being healthy (𝑋𝑖 =0). Of note, this expression is still exact, and we have not done any approximation yet. It can be easily checked that

Γ–

π‘—βˆΌπ‘–

(1βˆ’π›½1𝑋𝑗(𝑑)) β‰₯1βˆ’ 𝛽 Γ•

π‘—βˆΌπ‘–

1𝑋𝑗(𝑑).

Combining this with (3.6) yields the desired upper bound 𝑝𝑖(𝑑+1) ≀ (1βˆ’π›Ώ)𝑝𝑖(𝑑) +𝛽

Γ•

π‘—βˆΌπ‘–

P(𝑋𝑖(𝑑)=0, 𝑋𝑗(𝑑)=1) (3.7)

≀ (1βˆ’π›Ώ)𝑝𝑖(𝑑) +𝛽 Γ•

π‘—βˆΌπ‘–

𝑝𝑗(𝑑).

3.2.2 Connection to Mixing Time of the Markov Chain

Up to this point, we just talked about bounding the marginal probabilities of infection, and it is not clear how a bound on the marginal probabilities relates to the mixing time of the Markov chain. To establish this connection, let us start from the definition of mixing time [126]:

π‘‘π‘šπ‘–π‘₯(πœ–) =min{𝑑 : sup

πœ‡

kπœ‡ π‘†π‘‘βˆ’πœ‹k𝑇 𝑉 ≀ πœ–}, (3.8) where πœ‡is any initial probability distribution defined on the state space, and πœ‹ is the stationary distribution; kπœ‡βˆ’ πœ‡0k𝑇 𝑉 is the total variation distance of any two probability measuresπœ‡andπœ‡0, and is defined by

kπœ‡βˆ’πœ‡0k𝑇 𝑉 = 1 2

Γ•

π‘₯

|πœ‡(π‘₯) βˆ’ πœ‡0(π‘₯) |,

whereπ‘₯is any possible state in the probability space. In factπ‘‘π‘šπ‘–π‘₯(πœ–)is the minimum time instant for which the distance between the stationary distribution and the probability distribution at time𝑑 from any initial distribution is smaller than or equal toπœ–. Roughly speaking, the mixing time measures how fast the initial distribution converges to the limit distribution, which, in our case, means how quickly the epidemic dies out.

Since, in the stationary distribution, the all-healthy state has probability 1, it can be shown [6] that

sup

πœ‡

kπœ‡ π‘†π‘‘βˆ’πœ‹k𝑇 𝑉 =P

some nodes are infected at time𝑑| all nodes were infected at time 0

(3.9) which highlights the fact that the worst initial distribution (i.e., theπœ‡that maximizes above quantity) is the all-infected state. Now, for any𝑑 < π‘‘π‘šπ‘–π‘₯(πœ–) we have

πœ– < P

some nodes are infected at time𝑑|

all nodes were infected at time 0

≀

𝑛

Γ•

𝑖=1

P

node𝑖is infected at time𝑑|

all nodes were infected at time 0

=1𝑇𝑛𝑝(𝑑) given that𝑝(0) =1𝑛, (3.10) where we have used the union bound, and 1𝑛denotes the all-ones vector of size𝑛. Back to the upper-bound on the marginals (3.4), we get 1𝑇𝑛𝑝(𝑑) ≀ 1𝑇𝑛𝑀 𝑝(π‘‘βˆ’ 1).

Furthermore, since 𝑀 has non-negative entries (we write this as𝑀 β‰₯ 0), we can

β€œpropagate” the bound to find that

1𝑇𝑛𝑝(𝑑) ≀ 1𝑇𝑛𝑀 𝑝(π‘‘βˆ’1) ≀1𝑇𝑛𝑀2𝑝(π‘‘βˆ’2) ≀ Β· Β· Β· ≀1𝑇𝑛𝑀𝑑𝑝(0).

As a result, for any𝑑 < π‘‘π‘šπ‘–π‘₯(πœ–)

πœ– < 1𝑇𝑛𝑀𝑑1𝑛 ≀ 𝑛(𝜌(𝑀))𝑑, (3.11) since𝑀 is non-negative and symmetric, andπœ†max(𝑀) = 𝜌(𝑀), where 𝜌(𝑀) is the spectral radius of𝑀.

When𝜌(𝑀) <1 (or equivalently 1βˆ’π›Ώ+π›½πœ†π‘š π‘Žπ‘₯(𝐴) < 1), it follows that𝑑 < log

𝑛 πœ–

βˆ’log𝜌(𝑀)

for all𝑑 < π‘‘π‘šπ‘–π‘₯(πœ–). This implies the well-known result that when 𝛽/𝛿 <1/πœ†max(𝐴) thenπ‘‘π‘šπ‘–π‘₯(πœ–) ≀ log

𝑛 πœ–

βˆ’log𝜌(𝑀) =𝑂(log𝑛).

We should note here that if𝑀was not symmetric (as we will encounter such instances in the next section), it can be shown by an appeal to the Lyapunov equation that if 𝜌(𝑀) <1, then for all𝑑 < π‘‘π‘šπ‘–π‘₯(πœ–), there exists 0< πœ‚ <1 such thatπœ– ≀ πœ‚π‘‘π‘‚(poly(𝑛)), from which it follows directly that the mixing time is logarithmic in𝑛. To see that, note that𝜌(𝑀) < 1 implies that there exists a positive definite matrix𝑃 0 such that𝑀𝑇𝑃 𝑀 βˆ’π‘ƒβ‰Ί 0. Letting𝑃1/2denote the unique positive square root of𝑃and 𝑁 := 𝑃1/2𝑀 π‘ƒβˆ’1/2, it follows easily that𝑁𝑇𝑁 β‰Ί 𝐼𝑑, or equivalentlyπœ‚ :=k𝑁k2 <1.

(Here, k𝑁k2denotes the spectral norm of𝑁.) Defining𝑦:=𝑃1/21𝑛andπ‘₯ :=π‘ƒβˆ’1/21𝑛

we get 1𝑇𝑛𝑀𝑑1𝑛 =π‘₯𝑇𝑁𝑑𝑦 ≀ kπ‘₯k2πœ‚π‘‘k𝑦k2 ≀ π‘›πœ‚π‘‘k𝑃1/2k2kπ‘ƒβˆ’1/2k2. 3.3 Pairwise Probabilities (𝑝𝑖 𝑗)

In Section 3.2, we showed how a bound on marginal probabilities of infection can be obtained, and how this bound translates to the threshold condition for fast mixing of the Markov chain. As mentioned before, the bound (3.4) has been proved to be the tightest linear bound one can get with marginals. A natural idea to improve this bound is to go to higher-order terms (i.e., pairs, triples, etc.). In principle, maintaining higher-order terms is advantageous because it means keeping more information from the original chain, but of course at the cost of increased complexity. We define the pairwise probability of infection of two nodes, in addition to the marginals, as follows. For(𝑖, 𝑗) ∈𝐸,

𝑝𝑖 𝑗(𝑑) :=P(𝑋𝑖(𝑑) =1, 𝑋𝑗(𝑑) =1). Note that out of the 𝑛2

possible pairs of nodes, we only consider the ones that correspond to edges in the graph. Based on this definition, P(𝑋𝑖(𝑑) = 0, 𝑋𝑗(𝑑) = 1)= 𝑝𝑗(𝑑) βˆ’ 𝑝𝑖 𝑗(𝑑), and it follows easily from (3.7) that

𝑝𝑖(𝑑+1) ≀ (1βˆ’π›Ώ)𝑝𝑖(𝑑) +𝛽 Γ•

π‘—βˆΌπ‘–

𝑝𝑗(𝑑) βˆ’π›½ Γ•

π‘—βˆΌπ‘–

𝑝𝑖 𝑗(𝑑). (3.12)

Of course, this bound is at least as tight as the one in (3.3). Now, in order to strictly improve upon the latter, we need a lower bound on the pairwise infection probabilities at time 𝑑+1 in terms of marginals and pairwise probabilities at time𝑑, which is derived next.

3.3.1 A Lower Bound on the 𝑝𝑖 𝑗’s

To construct a lower bound on the pairwise marginal probabilities 𝑝𝑖 𝑗(𝑑+1), we use the same approach as was introduced in Section 3.2.1, but this time applied to pairwise infection probabilities.

Let(𝑖, 𝑗) ∈ 𝐸 and𝑑 β‰₯ 0. We first expand 𝑝𝑖 𝑗(𝑑+1)as follows Γ•

π‘₯∈{0,1}

π‘¦βˆˆ{0,1}

P(𝑋𝑖(𝑑 +1)=1, 𝑋𝑗(𝑑+1)=1, 𝑋𝑖(𝑑) =π‘₯ , 𝑋𝑗(𝑑)= 𝑦).

For convenience, denote each one of the summands above by 𝑠π‘₯ 𝑦. Also, let 𝑐π‘₯ 𝑦 represent the corresponding conditional probabilityP(𝑋𝑖(𝑑+1) = 1, 𝑋𝑗(𝑑 +1) = 1|𝑋𝑖(𝑑) = π‘₯ , 𝑋𝑗(𝑑) = 𝑦). In what follows, we lower bound each one of 𝑠π‘₯ 𝑦’s. We writeEπ‘₯ 𝑦 for the conditional expectationEπ‘‹βˆ’π‘– ,βˆ’π‘—(𝑑) |{𝑋𝑖(𝑑)=π‘₯ , 𝑋𝑗(𝑑)=𝑦}.

β€’(x=0,y=0): Trivially,𝑠00 β‰₯ 0.

β€’(x=0,y=1): As before, the probability of not getting infected from each infected neighbor is (1βˆ’π›½), and the probability that an infected node remains infected is (1βˆ’π›Ώ). Therefore

𝑐01 =E01

(1βˆ’Γ–

π‘˜βˆΌπ‘–

(1βˆ’ 𝛽1π‘‹π‘˜(𝑑))) (1βˆ’π›Ώ) . Since 𝑗 βˆΌπ‘–and 0 ≀ 𝛽 ≀ 1, it follows thatÎ

π‘˜βˆΌπ‘–(1βˆ’π›½1π‘‹π‘˜(𝑑)) ≀ (1βˆ’π›½1𝑋𝑗(𝑑)). Hence 𝑐01 β‰₯ 𝛽(1βˆ’π›Ώ)E011𝑋𝑗(𝑑), which eventually gives

𝑠01 β‰₯ 𝛽(1βˆ’π›Ώ)P(𝑋𝑖(𝑑) =0, 𝑋𝑗(𝑑) =1)

β‰₯ 𝛽(1βˆ’π›Ώ)𝑝𝑗(𝑑) βˆ’π›½(1βˆ’π›Ώ)𝑝𝑖 𝑗(𝑑).

β€’(x=1,y=0): By symmetry, the exact same argument as above implies 𝑠10 β‰₯ 𝛽(1βˆ’π›Ώ)𝑝𝑖(𝑑) βˆ’π›½(1βˆ’π›Ώ)𝑝𝑖 𝑗(𝑑).

β€’(x=1,y=1): Clearly𝑐11 =(1βˆ’π›Ώ)2, which gives 𝑠11 β‰₯ (1βˆ’π›Ώ)2𝑝𝑖 𝑗(𝑑).

Summing all the above terms yields the following lower bound for all(𝑖, 𝑗) ∈ 𝐸and 𝑑 β‰₯ 0:

𝑝𝑖 𝑗(𝑑+1) β‰₯ (1βˆ’π›Ώ)𝛽(𝑝𝑖(𝑑) + 𝑝𝑗(𝑑)) + (1βˆ’π›Ώ) (1βˆ’π›Ώβˆ’2𝛽)𝑝𝑖 𝑗(𝑑). (3.13) 3.3.2 Back to the Mixing Time

In order to express Eqs. (3.12) and (3.13) for all𝑖and 𝑗’s together in a matrix form, recall the definition 𝑝(𝑑) =(𝑝1(𝑑), . . . , 𝑝𝑛(𝑑))𝑇. Further, let us define 𝑝𝐸(𝑑) ∈R|𝐸| as the vector of pairwise infection probabilities, i.e.,𝑝𝐸(𝑑)=vec(𝑝𝑖 𝑗(𝑑) : (𝑖, 𝑗) ∈𝐸).

Note that𝑝𝑖 𝑗(𝑑)= 𝑝𝑗 𝑖(𝑑), so for each edge we only keep track of one of the two terms.

Now we can write Eqs. (3.12), (3.13) as

"

𝑝(𝑑+1)

βˆ’π‘πΈ(𝑑+1)

# 𝑀0

"

𝑝(𝑑)

βˆ’π‘πΈ(𝑑)

#

, (3.14)

The matrix𝑀0, after a little bit of thought, can be expressed in the following way:

𝑀0=

"

(1βˆ’π›Ώ)𝐼𝑛+𝛽 𝐴 𝛽 𝐡

βˆ’(1βˆ’π›Ώ)𝛽 𝐡𝑇 (1βˆ’π›Ώ) (1βˆ’π›Ώβˆ’2𝛽)𝐼|𝐸|

#

, (3.15)

where 𝐡 ∈ R|𝑉|Γ—|𝐸| happens to be the incidence matrix of 𝐺, which is formally defined as

𝐡𝑖,𝑒 =





ο£²



ο£³

1 if𝑖is an endpoint of𝑒, 0 otherwise,

for all𝑖 βˆˆπ‘‰ and𝑒 ∈𝐸.

By accounting for pairwise infection probabilities, the bound derived in (3.14) is tighter when compared to the one in (3.4). In order to connect this to the the mixing time of the underlying Markov chain, observe that

1𝑇𝑛𝑝(𝑑) = h

1𝑇𝑛 0𝑇|𝐸| i

"

𝑝(𝑑)

βˆ’π‘πΈ(𝑑)

# . Applying (3.14) to this gives,

1𝑇𝑛𝑝(𝑑) ≀ h

1𝑇𝑛 0𝑇|𝐸| i

𝑀0

"

𝑝(π‘‘βˆ’1)

βˆ’π‘πΈ(π‘‘βˆ’1)

#

. (3.16)

This step is possible because the entries of the vector h

1𝑇𝑛 0𝑇|𝐸|

i

are all non-negative, which guarantees that the signs of all the𝑛+ |𝐸|inequalities in (3.14) are preserved.

With this note, it becomes clear that in order to be able to propagate the bounds for the remaining time instancesπ‘‘βˆ’2, π‘‘βˆ’3, . . . ,0, a sufficient condition would be

h

1𝑇𝑛 0𝑇|𝐸|i

(𝑀0)𝑑 β‰₯ 0, for all𝑑 β‰₯ 1. (3.17) Provided that (3.17) holds, we can continue with the sequence of bounds after (3.16), which results in

1𝑇𝑛𝑝(𝑑) ≀ h

1𝑇𝑛 0𝑇|𝐸|i (𝑀0)𝑑

"

𝑝(0)

βˆ’π‘πΈ(0)

#

. (3.18)

Subsequently, the same argument as in Section 3.2.2 concludes the following result.

Theorem 15. Assume that(3.17)holds. If 𝜌(𝑀0) < 1, then the mixing time of the Markov chain whose transition matrix 𝑆 is described by Eqs. (3.1) and (3.2) is 𝑂(log𝑛).

From the standard bound with only marginals, it was known that when𝜌(𝑀) <1, the Markov chain is fast-mixing. Now, in addition to that, the above theorem states that when 𝜌(𝑀0) < 1, the Markov chain mixes fast again. Of course, this is informative only when there is a case where 𝜌(𝑀) > 1 but𝜌(𝑀0) < 1. As it will be shown in Section 3.5, this is indeed the case.

Note that, in the proof of Theorem 15, we used the assumption that (3.17) holds.

As will be shown in the simulations section, in many cases this is a reasonable assumption. However, when the assumption does not hold we cannot appeal to this theorem. For this reason, we propose another bound using an alternative pairwise probability, which does not require such a condition.