CHMM Module: Individual-Level Propagation

Chapter VI: Epidemic Spread Mitigation in Population Networks

6.3 CHMM Module: Individual-Level Propagation

∀e∈ {c₁,· · · ,c₇,d₁,· · · ,d₉}: dI_k[e](t)

dt =α·,kE_k[e](t)−γ_·,k^(d)I_k[e](t) (6.7c)

dR_k(t) dt =

i=1

γ_i,k^(r)I_k(t) (6.7d)

dD_k(t)

dt = X

e∈{c1,···,d9}

γ_·,k^(d)I_k(t) (6.7e)

where the parameters are defined in (6.6) and the subscript of·in (6.7c) and (6.7e) is replaced with the appropriate index.

1 2 3

... ...

Figure 6.5: A sample coupled hidden Markov model relating the health statuses of three individuals in a community, along with the observed symptoms.

individual n∈ V, with entries On(Yn,j(t)|Xn(t)) denoting the probability of observing the positive presence symptom j in individual v at time t, i.e., Yn,j(t) = 1. Note that each row of this matrix does not necessarily need to sum to 1 since the entries correspond only to the case y_n,j = 1. The boldfaced notation for X and x also extends to Y_n,y_n in the same way, as {Y_n(0 :t) =y_n(0 :t)} ≡ {Y_n,1(0 :t) = y_n,1(0 :t),· · · ,Y_n,B(0 :t) =y_n,B(0 :t)}.

Assumption 12. For each n∈ V, the OPMOn is given and known.

A sample CHMM of a specific community with three individuals, not strongly-connected, is visualized in Figure 6.5. Because contact-tracing data only provides us information about the evolution of the observed symptoms over time for a subset of tracked individuals, the transition probabilities among the different phases in X are unknown. Each individual is assigned a vector of unknown parameters similar toθ_k for the compartmental model of each community k∈ {1,· · · , K}. For individual n∈ V, the full vector of transition probability parameters is given byη_n(t) := [β_n(t), α_n, γn^(r), γn^(d)], and the sparsity pattern of thetransition

S E I

1−βn(t) βn(t)

1−αn

αn

1−γn^(r)−γn^(d)

γ^(r)n

γ_n^(d)

Figure 6.6: The underlying Markov chain for a single chain of the CHMM module, using transition probabilities as parameters.

probability matrix (TPM) corresponding to the chain of v is given by

P_n(t) :=







1−βn(t) βn(t) 0 0 0

0 1−α_n α_n 0 0

0 0 1−γn^(r)−γn^(d) γn^(r) γn^(d)

0 0 0 1 0

0 0 0 0 1







. (6.8)

Note that the probability for transitioning fromStoEis time-varying because it is dependent on the time-varying health statuses of his/her immediate neighbors.

Given a complete sequence {y_n(0 : T_sim)} of observed symptoms over some time duration Tsim>0, we address two questions for each individualn∈ V. Question 1: how can we estimate the values of the TPM Pn(t) in the CHMM? Question 2: given the TPM estimates ˆPn(t) for all t ∈ [0, T_sim], how can we estimate the true health status x_n(0 : t)? Both question can be addressed by extending standard HMM techniques (see, e.g., [65]). For Question 1, theforward-backward algorithm (e.g., [126]) andBaum-Welch (expectation-maximization) (e.g., [16]) are standard procedures in the HMM literature which can estimate the transition and observation probabilities inP_nandO_n. For the purposes of this application, we make two simultaneous extensions: 1) multiple different time series of observations can be incorporated at once, and 2) the unknown parameters are assumed to be time-varying.

Define fn,j(t, x) :=P(Xn(t) =x,Y^(t)_n,j=y^(t)_n,j) to be the probability that the individual is in state x ∈ X at time t and the past observed symptom sequence is given by Y^(t)_n,j=y^(t)_n,j. Defineb_n,j(t, x) := P(Y^(t+1:T_n,j ⁾=y^(t+1:T_n,j ⁾|X_n(t) =x) to be the probability of observing a future sequence of symptoms Y^(t+1:T_n,j ⁾=y^(t+1:T_n,j ⁾ given we know the individual is in state x. The

recursive equations for f_n,j and b_n,j are then given by:

fn,j(t, x) = X

z∈X

fn,j(t−1, z)On(yn,j(t)|x) ˆP_n,j^(t−1)(z, x), fn,j(0, x) :=qn(x)On(yn,j(0)|x), (6.9a) b_n,j(t, x) = X

z∈X

b_n,j(t+ 1, z) ˆP_n,j^(t)(x, z)O_n(y_n,j(t+ 1)|z), b_n,j(T, x) = 1 ∀x∈ X. (6.9b)

Given observation sequenceY_n,j(0 : T_sim) =y_n,j(0 :T_sim), defineg_n,j(t, x) to be the probability that the state of individualn at timet isxgiven observation sequencej, andh_n,j(t, x, z) to be the probability that the state of individualn makes a transition fromxtoz at time t:

g_n,j(t, x) := P(X_n(t) =x|Y_n,j(0 : T_sim) =y_n,j(0 :T_sim)), (6.10a) h_n,j(t, x, z) := P(X_n(t) =x, X_n(t+ 1) =z|Y_n,j(0 :T_sim) =y_n,j(0 :T_sim)). (6.10b)

The variables defined in (6.9) allow us to simplify (6.10) beyond their definitions:

g_n,j(t, x) = fn,j(t, x)bn,j(t, x) P

z∈X

f_n,j(t, z)b_n,j(t, z), (6.11a)

h_n,j(t, x, z) = f_n,j(t, x) ˆP_n,j^(t)(x, z)O_n(y_n,j(t+ 1)|z)b_n,j(t+ 1, z) P

u,w∈X

f_n,j(t, u) ˆP_n,j^(t)(u, w)O_n(y_n,j(t+ 1)|w)b_n,j(t+ 1, w). (6.11b)

Note that the expressions for (6.11) are dependent on previous estimates of the TPM ˆP_n,j^(0:t−1) for each time t; essentially, we are recursively building new estimates of ˆP_n,j^(t) based on the previous time’s estimates. For a single individual n ∈ V, estimating the TPM Pn(t) based on a single observation sequence j ∈ {1,· · · , B} can be solved according to the standard Baum-Welch algorithm [16]. Define ˆη_n,j^(t) to be the estimate of the true parameter vector η_n at time t based on observation sequence j, and define a corresponding auxiliary function as:

Q_n,j(t) :=E

log

p^(c)_n,j(X_n(0 :t),Y_n,j(0 :t)|η_n(0 :t))

y_n,j(0 :T),ηˆ_n,j^(0:t)

, (6.12)

where p^(c)_n,j denotes the joint probability distribution of observing a complete set of data {x_n(0 :t),y_n,j(0 :t)}for individual n:

p^(c)_n,j(x_n(0 :t),y_n,j(0 :t)|η_n(0 :t)) =q_n(x_n(0))

t−1

s=0

P_n(s, x_n(s), x_n(s+ 1))

s=0

O_n(y_n,j(s)|x_n(s)).

We maximize the (6.12) to determine the optimal initial probability distribution ˆq^(t)_n,j and the optimal TPM ˆP_n,j^(t). Note that the maximization must be done subject to the regularity conditions P

u∈XPˆ_n,j^(t)(x, u) = 1 and P

x∈Xqˆ^(t)_n,j(x) = 1 for all x ∈ X. The optimal point has the following closed-form expression:

q_n,j^(t)(x) =g_n,j(0, x), (6.13a)

Pˆ_n,j^(t)(x, z) =

t−1

s=0

h_n,j(s, x, z)

! _t X

s=0

g_n,j(s, x)

!−1

, (6.13b)

where the g_v,j(t, x) andh_v,j(t, x, z) are defined in (6.10). The procedure is repeated for each t∈[0, T_sim] so that we obtain an estimate of ˆq_n,j^(t) and ˆP_n,j^(t) which evolves over time.

In order to account for time-varying parameters, we apply a discounting factor a ∈ (0,1]

which weights the values of past estimates less the further back in the past they were observed. To aggregate multiple observations into a single definitive estimate, define w∈R^B to be weights such that P

wj = 1:

Pˆn(t, x, z) =

j=1

wj t−1

s=0

a^t−shn,j(s, x, z)

! _B X

j=1

wj t

s=0

a^t−sgn,j(s, x)

!⁻¹

. (6.14)

The assignment of weights is chosen via two metrics: 1) the observation sequences are sta- tistically correlated with each other, or 2) one observation sequence yields more information about a state than another, e.g., observing a fever on an individual may be more reflective of his/her ill state than a runny nose. For simplicity, we assume that these weights are known beforehand and that our observation processes are independent of each other, meaning that the weights are only chosen according to how well they represent the true state.

Question 2 can be addressed by applying the standard Viterbi algorithm to each separate observation sequence, then aggregating them. Specifically, recall Xn(t) ∈ X refers to the hidden state of individual n ∈ V, and suppose we are given a time series of observations Y_n,j(0 : t) for symptom j ∈ {1,· · · , B}. The estimated time-varying TPM underlying the HMM is given by ˆPn^(t) at time t, and the known OPM is given by O_n. The initial state is known and given by X_n(0) = x_n(0). Then the standard Viterbi algorithm (e.g., [58]) can be applied to the observation sequence j∈ {1,· · · , B} to estimate the sequence ˆx_n,j(1 : t) of likely hidden states over time based on symptom j. The probability of observing some specific sequence of health statuses x_n(1 :t) for some t≤T_sim is given by:

P({X_n(t) = x_n(t), n∈ V, t∈[0, T]})

= Y

n∈V

q_n(x_n(0)) Y

t∈[0,T−1]

n∈V

P(X_n(t+ 1)|X_n(t),{X_m(t), m ∈ N(n)}),

where q_n(x) denotes the initial probability that individual n starts off at statex. Based on the observations of an individual’s symptoms, we recursively compute:

δ_n,j(0, x) =q_n(x)O_n(y_n,j(0)|x), δ_n,j(t, x) = max

z∈X δ_n,j(t−1, z) ˆP_n^(t−1)(z, x)O_n(y_n,j(t)|x), t≥1.

Then for the specific observation sequence j, the optimal sequence of states is given by ˆ

x_n,j(t) := argmax_z∈Xδ_n,j(t, z). Thus, ˆx_n,j(t)∈ X is the most likely health status of individual n∈ V at time t ∈[0, T_sim] given observation process j∈ {1,· · · , B}. Then the health status ˆ

x_n(t) determined by considering all observation processes simultaneously is then given by whichever phase inX occurs most often in the aggregate set {ˆx_n,1(t),· · · ,xˆ_n,B(t)}. Ties are broken according to the state which is more “harmful” to the network, e.g., if the most likely state is tied between susceptible (S) or exposed (E), then we take the individual to be exposed because (s)he is liable to infecting more people in the network.

6.3.2 Including Multiple Variants

We can extend the CHMM module to account for variant viruses and mutations in a way similar to what was done for the compartmental ODE module (Section6.2.3). The unknown probabilities for the CHMM module, expanded to consider multiple strains, is given by

η_v(t) :=

{β_i,v(t)}_{i∈{1,···,|}_A(S)|}

j∈{1,···,K}

,{α_i,v(t)}_{i∈{1,···}_,|_A(S)|+|A^(I)|}, (6.15) {γ_i,v^(r)}_{i∈{1,···}_,A},{γ_i,v^(d)(t)}_{i∈{1,···}_,|_A(S)|+|A^(I)|},{ν_i,v(t)}_{i∈{1,···,|}_A(I)|}

Furthermore, the TPMPv^(t) for each v ∈ V and time t∈N is updated similarly to (6.7):

P_v^(t) =







P_v,SS^(t) P_v,SE^(t) 0 0 0 0 P_v,EE^(t) P_v,EI^(t) 0 0 P_v,IS^(t) 0 P_v,II^(t) P_v,IR^(t) P_v,ID^(t)

0 0 0 1 0

0 0 0 0 1





 ,

where each of the submatrices Pv,··^(t) are defined as follows, using the parameters defined in (6.15):

P_v,SE^(t) = diag

diag{β_1,v(t),· · ·, β_A,v(t)}, (6.16)

diag{β_A+1,v(t) +β_A+2,v(t),· · · , β2A−1,v(t) +β_2A,v(t)},

|A^(S)|

i=2A+1

β_i,v(t)

P_v,SS^(t) =I−P_v,SE^(t)

P_v,EI^(t) = diag{α_1,v(t),· · · , α_|_A(I)|,v(t)}, P_v,EE^(t) =I−P_v,EI^(t) (6.17) P_v,IS^(t) =

P^(t)_v,IS 04×12

P_v,IR^(t) =

i=1

γ_i,v^(r) (6.18)

P_v,ID^(t) = diag{γ_1,v^(d)(t),· · · , γ_|^(d)_A(I)|,v(t)}

P_v,II^(t) =I− X

χ∈{S,R,D}

P_v,Iχ^(t)

where

P^(t)_v,IS =







ν_1,v(t) 0 ν_3,v(t) 0 0 0 0 0 0

0 ν_2,v(t) 0 0 ν_5,v(t) 0 0 0 0

06×3 0 0 0 ν4,v(t) 0 ν6,v(t) 0 0 0

0 0 0 0 0 0 ν7,v(t) 0 0

0 0 0 0 0 0 0 ν_8,v(t) 0

0 0 0 0 0 0 0 0 ν_9,v(t)





 ,

andI−M for a rectangular matrixM ∈R^n×m is intended to mean an×n matrix where the diagonal elements are 1 minus the row sum ofM. The procedure for estimating parameters, which was detailed in Sections6.2(and will be detailed further in6.4), is then used with the multi-strain versions of the ODE and CHMM dynamics.

Dalam dokumen Control and State-Estimation of Jump Stochastic Systems by Learning Recurrent Spatiotemporal Patterns (Halaman 153-159)