IMPROVED BOUNDS ON THE EPIDEMIC THRESHOLD OF THE EXACT MODELS
3.2 The Markov Chain and Marginal Probabilities of Infection
Let πΊ = (π , πΈ) be an arbitrary connected undirected network with π nodes, and with adjacency matrix π΄. Each node can be in a state of health, represented by β0,β
or a state of infection, represented by β1.β The state of the entire network can be represented by a binary n-tupleπ(π‘) = (π1(π‘),Β· Β· Β· , ππ(π‘)) β {0,1}π, where each of the entries represents the state of a node at timeπ‘, i.e.,πis infected ifππ(π‘) =1 and it is healthy ifππ(π‘) =0.
Given the current stateπ(π‘), the infection probability of each node in the next step
is determined independently, and therefore, the transition matrixπof this Markov Chain has elementsππ ,π =P(π(π‘+1) =π|π(π‘) =π)of the following form:
P(π(π‘+1) =π|π(π‘) =π) =
π
Γ
π=1
P(ππ(π‘+1) =ππ|π(π‘) = π), (3.1) for any two state vectorsπ , π β {0,1}π.
As mentioned before, a healthy node can receive infection from any of its infected neighbors independently with probability π½per infected link, and an infected node can recover from the disease with probabilityπΏ. That is
P(ππ(π‘+1)=ππ|π(π‘) =π) =




ο£²



ο£³
(1βπ½)ππ if(ππ, ππ) = (0,0),|ππβ©S(π) | =ππ, 1β (1βπ½)ππ if(ππ, ππ) = (0,1),|ππβ©S(π) | =ππ, πΏ if(ππ, ππ) = (1,0),|ππβ©S(π) | =ππ, 1βπΏ if(ππ, ππ) = (1,1),|ππβ©S(π) | =ππ,
(3.2)
whereS(π)is the support ofπ β {0,1}π, i.e.,S(π) ={π: ππ =1}, and ππis the set of neighbors of nodeπ.
Eqs. (3.1, 3.2) completely define the 2πΓ2πtransition matrix of the Markov chain, which determines the evolution of the 2πstates over time. Of course with this, we have the joint probability of all the nodes, and we can compute the probability of any desired combination by marginalizing out the rest. In particular, one can compute the probability of each nodeπbeing infected at timeπ‘+1 (denoted byππ(π‘+1)), which is a function of all joint probabilities of the states at timeπ‘. Since there are onlyπ such variables, the dimension would be significantly reduced if one could βboundβ or
βapproximateβ that function by something that includes marginalsππ(π‘)only. This way, we obtain a recursion which relates the marginals at timeπ‘+1 to those of timeπ‘, and indeed we have a system with onlyπstates rather that 2πstates. Approximations per se are not very interesting because they do not provide anyguarantee on the behavior of the exact Markov chain model. What is more important is whether one can obtain a bound on these true probabilities, which can guarantee, for example, fast extinction of the disease. The most common upper-bound, which has been shown to be the tightest linear upper-bound with marginals only (using a linear programming technique) [6, 16] is:
ππ(π‘+1) β€ (1βπΏ)ππ(π‘) +π½ Γ
πβππ
ππ(π‘) (3.3)
for allπ = 1, . . . , π. Defining π(π‘) = (π1(π‘), . . . , ππ(π‘))π, this can be written in a matrix form as
π(π‘+1) π π(π‘), (3.4)
where
π =(1βπΏ)πΌπ+π½ π΄. (3.5)
3.2.1 An Alternative Bounding Technique
The derivation of (3.3) in [6, 16] involves a linear programming technique. In this chapter, we provide an alternative technique to bound the infection probabilities using indicator variables and conditional expectation, which is more intuitive and direct. Importantly, as will be shown later, this technique can be used to obtain tighter bounds on the exact probabilities of infections using pairwise inflectional probabilities. Before that, it is instructive to derive (3.3) using this alternative approach.
Letπ βπ. We start by conditioning on the state of the same node π at timeπ‘, as follows:
ππ(π‘+1) =P(ππ(π‘+1) =1|ππ(π‘) =1)P(ππ(π‘) =1) +P(ππ(π‘+1) =1|ππ(π‘) =0)P(ππ(π‘) =0).
The probability that an infected node remains infected is 1βπΏ, and the probability that a susceptible node does not receive infection from an infected neighbor is 1βπ½. We denote π neighbor ofπ by π βΌπ. The expression above can be written as
ππ(π‘+1) = (1βπΏ)ππ(π‘) +Eπβπ(π‘) |ππ(π‘)=0
1βΓ
πβΌπ
(1β π½1ππ(π‘))
P(ππ(π‘) =0). (3.6) The conditional expectation is on the joint probability of all nodes other than π (denoted by πβπ), given nodeπbeing healthy (ππ =0). Of note, this expression is still exact, and we have not done any approximation yet. It can be easily checked that
Γ
πβΌπ
(1βπ½1ππ(π‘)) β₯1β π½ Γ
πβΌπ
1ππ(π‘).
Combining this with (3.6) yields the desired upper bound ππ(π‘+1) β€ (1βπΏ)ππ(π‘) +π½
Γ
πβΌπ
P(ππ(π‘)=0, ππ(π‘)=1) (3.7)
β€ (1βπΏ)ππ(π‘) +π½ Γ
πβΌπ
ππ(π‘).
3.2.2 Connection to Mixing Time of the Markov Chain
Up to this point, we just talked about bounding the marginal probabilities of infection, and it is not clear how a bound on the marginal probabilities relates to the mixing time of the Markov chain. To establish this connection, let us start from the definition of mixing time [126]:
π‘πππ₯(π) =min{π‘ : sup
π
kπ ππ‘βπkπ π β€ π}, (3.8) where πis any initial probability distribution defined on the state space, and π is the stationary distribution; kπβ π0kπ π is the total variation distance of any two probability measuresπandπ0, and is defined by
kπβπ0kπ π = 1 2
Γ
π₯
|π(π₯) β π0(π₯) |,
whereπ₯is any possible state in the probability space. In factπ‘πππ₯(π)is the minimum time instant for which the distance between the stationary distribution and the probability distribution at timeπ‘ from any initial distribution is smaller than or equal toπ. Roughly speaking, the mixing time measures how fast the initial distribution converges to the limit distribution, which, in our case, means how quickly the epidemic dies out.
Since, in the stationary distribution, the all-healthy state has probability 1, it can be shown [6] that
sup
π
kπ ππ‘βπkπ π =P
some nodes are infected at timeπ‘| all nodes were infected at time 0
(3.9) which highlights the fact that the worst initial distribution (i.e., theπthat maximizes above quantity) is the all-infected state. Now, for anyπ‘ < π‘πππ₯(π) we have
π < P
some nodes are infected at timeπ‘|
all nodes were infected at time 0
β€
π
Γ
π=1
P
nodeπis infected at timeπ‘|
all nodes were infected at time 0
=1πππ(π‘) given thatπ(0) =1π, (3.10) where we have used the union bound, and 1πdenotes the all-ones vector of sizeπ. Back to the upper-bound on the marginals (3.4), we get 1πππ(π‘) β€ 1πππ π(π‘β 1).
Furthermore, since π has non-negative entries (we write this asπ β₯ 0), we can
βpropagateβ the bound to find that
1πππ(π‘) β€ 1πππ π(π‘β1) β€1πππ2π(π‘β2) β€ Β· Β· Β· β€1ππππ‘π(0).
As a result, for anyπ‘ < π‘πππ₯(π)
π < 1ππππ‘1π β€ π(π(π))π‘, (3.11) sinceπ is non-negative and symmetric, andπmax(π) = π(π), where π(π) is the spectral radius ofπ.
Whenπ(π) <1 (or equivalently 1βπΏ+π½ππ ππ₯(π΄) < 1), it follows thatπ‘ < log
π π
βlogπ(π)
for allπ‘ < π‘πππ₯(π). This implies the well-known result that when π½/πΏ <1/πmax(π΄) thenπ‘πππ₯(π) β€ log
π π
βlogπ(π) =π(logπ).
We should note here that ifπwas not symmetric (as we will encounter such instances in the next section), it can be shown by an appeal to the Lyapunov equation that if π(π) <1, then for allπ‘ < π‘πππ₯(π), there exists 0< π <1 such thatπ β€ ππ‘π(poly(π)), from which it follows directly that the mixing time is logarithmic inπ. To see that, note thatπ(π) < 1 implies that there exists a positive definite matrixπ 0 such thatπππ π βπβΊ 0. Lettingπ1/2denote the unique positive square root ofπand π := π1/2π πβ1/2, it follows easily thatπππ βΊ πΌπ, or equivalentlyπ :=kπk2 <1.
(Here, kπk2denotes the spectral norm ofπ.) Definingπ¦:=π1/21πandπ₯ :=πβ1/21π
we get 1ππππ‘1π =π₯πππ‘π¦ β€ kπ₯k2ππ‘kπ¦k2 β€ πππ‘kπ1/2k2kπβ1/2k2. 3.3 Pairwise Probabilities (ππ π)
In Section 3.2, we showed how a bound on marginal probabilities of infection can be obtained, and how this bound translates to the threshold condition for fast mixing of the Markov chain. As mentioned before, the bound (3.4) has been proved to be the tightest linear bound one can get with marginals. A natural idea to improve this bound is to go to higher-order terms (i.e., pairs, triples, etc.). In principle, maintaining higher-order terms is advantageous because it means keeping more information from the original chain, but of course at the cost of increased complexity. We define the pairwise probability of infection of two nodes, in addition to the marginals, as follows. For(π, π) βπΈ,
ππ π(π‘) :=P(ππ(π‘) =1, ππ(π‘) =1). Note that out of the π2
possible pairs of nodes, we only consider the ones that correspond to edges in the graph. Based on this definition, P(ππ(π‘) = 0, ππ(π‘) = 1)= ππ(π‘) β ππ π(π‘), and it follows easily from (3.7) that
ππ(π‘+1) β€ (1βπΏ)ππ(π‘) +π½ Γ
πβΌπ
ππ(π‘) βπ½ Γ
πβΌπ
ππ π(π‘). (3.12)
Of course, this bound is at least as tight as the one in (3.3). Now, in order to strictly improve upon the latter, we need a lower bound on the pairwise infection probabilities at time π‘+1 in terms of marginals and pairwise probabilities at timeπ‘, which is derived next.
3.3.1 A Lower Bound on the ππ πβs
To construct a lower bound on the pairwise marginal probabilities ππ π(π‘+1), we use the same approach as was introduced in Section 3.2.1, but this time applied to pairwise infection probabilities.
Let(π, π) β πΈ andπ‘ β₯ 0. We first expand ππ π(π‘+1)as follows Γ
π₯β{0,1}
π¦β{0,1}
P(ππ(π‘ +1)=1, ππ(π‘+1)=1, ππ(π‘) =π₯ , ππ(π‘)= π¦).
For convenience, denote each one of the summands above by π π₯ π¦. Also, let ππ₯ π¦ represent the corresponding conditional probabilityP(ππ(π‘+1) = 1, ππ(π‘ +1) = 1|ππ(π‘) = π₯ , ππ(π‘) = π¦). In what follows, we lower bound each one of π π₯ π¦βs. We writeEπ₯ π¦ for the conditional expectationEπβπ ,βπ(π‘) |{ππ(π‘)=π₯ , ππ(π‘)=π¦}.
β’(x=0,y=0): Trivially,π 00 β₯ 0.
β’(x=0,y=1): As before, the probability of not getting infected from each infected neighbor is (1βπ½), and the probability that an infected node remains infected is (1βπΏ). Therefore
π01 =E01
(1βΓ
πβΌπ
(1β π½1ππ(π‘))) (1βπΏ) . Since π βΌπand 0 β€ π½ β€ 1, it follows thatΓ
πβΌπ(1βπ½1ππ(π‘)) β€ (1βπ½1ππ(π‘)). Hence π01 β₯ π½(1βπΏ)E011ππ(π‘), which eventually gives
π 01 β₯ π½(1βπΏ)P(ππ(π‘) =0, ππ(π‘) =1)
β₯ π½(1βπΏ)ππ(π‘) βπ½(1βπΏ)ππ π(π‘).
β’(x=1,y=0): By symmetry, the exact same argument as above implies π 10 β₯ π½(1βπΏ)ππ(π‘) βπ½(1βπΏ)ππ π(π‘).
β’(x=1,y=1): Clearlyπ11 =(1βπΏ)2, which gives π 11 β₯ (1βπΏ)2ππ π(π‘).
Summing all the above terms yields the following lower bound for all(π, π) β πΈand π‘ β₯ 0:
ππ π(π‘+1) β₯ (1βπΏ)π½(ππ(π‘) + ππ(π‘)) + (1βπΏ) (1βπΏβ2π½)ππ π(π‘). (3.13) 3.3.2 Back to the Mixing Time
In order to express Eqs. (3.12) and (3.13) for allπand πβs together in a matrix form, recall the definition π(π‘) =(π1(π‘), . . . , ππ(π‘))π. Further, let us define ππΈ(π‘) βR|πΈ| as the vector of pairwise infection probabilities, i.e.,ππΈ(π‘)=vec(ππ π(π‘) : (π, π) βπΈ).
Note thatππ π(π‘)= ππ π(π‘), so for each edge we only keep track of one of the two terms.
Now we can write Eqs. (3.12), (3.13) as
"
π(π‘+1)
βππΈ(π‘+1)
# π0
"
π(π‘)
βππΈ(π‘)
#
, (3.14)
The matrixπ0, after a little bit of thought, can be expressed in the following way:
π0=
"
(1βπΏ)πΌπ+π½ π΄ π½ π΅
β(1βπΏ)π½ π΅π (1βπΏ) (1βπΏβ2π½)πΌ|πΈ|
#
, (3.15)
where π΅ β R|π|Γ|πΈ| happens to be the incidence matrix of πΊ, which is formally defined as
π΅π,π =


ο£²

ο£³
1 ifπis an endpoint ofπ, 0 otherwise,
for allπ βπ andπ βπΈ.
By accounting for pairwise infection probabilities, the bound derived in (3.14) is tighter when compared to the one in (3.4). In order to connect this to the the mixing time of the underlying Markov chain, observe that
1πππ(π‘) = h
1ππ 0π|πΈ| i
"
π(π‘)
βππΈ(π‘)
# . Applying (3.14) to this gives,
1πππ(π‘) β€ h
1ππ 0π|πΈ| i
π0
"
π(π‘β1)
βππΈ(π‘β1)
#
. (3.16)
This step is possible because the entries of the vector h
1ππ 0π|πΈ|
i
are all non-negative, which guarantees that the signs of all theπ+ |πΈ|inequalities in (3.14) are preserved.
With this note, it becomes clear that in order to be able to propagate the bounds for the remaining time instancesπ‘β2, π‘β3, . . . ,0, a sufficient condition would be
h
1ππ 0π|πΈ|i
(π0)π‘ β₯ 0, for allπ‘ β₯ 1. (3.17) Provided that (3.17) holds, we can continue with the sequence of bounds after (3.16), which results in
1πππ(π‘) β€ h
1ππ 0π|πΈ|i (π0)π‘
"
π(0)
βππΈ(0)
#
. (3.18)
Subsequently, the same argument as in Section 3.2.2 concludes the following result.
Theorem 15. Assume that(3.17)holds. If π(π0) < 1, then the mixing time of the Markov chain whose transition matrix π is described by Eqs. (3.1) and (3.2) is π(logπ).
From the standard bound with only marginals, it was known that whenπ(π) <1, the Markov chain is fast-mixing. Now, in addition to that, the above theorem states that when π(π0) < 1, the Markov chain mixes fast again. Of course, this is informative only when there is a case where π(π) > 1 butπ(π0) < 1. As it will be shown in Section 3.5, this is indeed the case.
Note that, in the proof of Theorem 15, we used the assumption that (3.17) holds.
As will be shown in the simulations section, in many cases this is a reasonable assumption. However, when the assumption does not hold we cannot appeal to this theorem. For this reason, we propose another bound using an alternative pairwise probability, which does not require such a condition.