A Simple Example - The Chinese Generals Problem

5.2 The Chinese Generals Problem

5.2.1 A Simple Example

Before proceeding to the next subsection for a full analysis of the CGP, recall that the definition includes the number of generalsS, their valencyγ, the initial conditionK(0) of the number of generals deciding on A₁, and the two tunable parameters M andT denoting the number of messengers dispatched and the corresponding threshold value, respectively.

A B

C

FIGURE5.8 Three generals

In this section, we analyze a simple example of a CGP where we use fixed values of S=_3,_γ=_1,_M =_{2, and}_T =2. The problem involves three generals labeledA,B, and C, linked in a simple triangular network shown in Figure5.8above. First, att =_{0 each} general can be either inG¹(0) or not, depending on the value of its messagev_i(0). There- fore, we have a total of eight possible message configurations as shown in Figure5.9.

For each of these configurations, the general withv_i(0)=1 can dispatch two messengers each. With equal probability each messenger can then either go to the neighboring general in the clockwise direction, or stay withG_i.

FIGURE5.9 Eight possible message configurations

For v(0) ={^v^A⁽⁰⁾^v^B⁽⁰⁾^v^C⁽⁰⁾}, we have eight equally likely message configurations that can be written as binary strings: {000}, {001}, {010}, {100}, {011}, {101}, {110}, and {111}. Notice that we sort the eight configurations according to the number of 1s in their strings. The reason is quite simple. The triangle in Figure 5.8is symmetrical, which means configurations having the same number of 1s in their strings should be mathe- matically identical.

Let us consider the highlighted configuration where two generals A andB, initially decide onA₁and dispatch two messengers each. Although the messengers are identical, for illustration purpose, we label the messengers atA’s andB’s positions witha1,a2,b1, andb₂. Let us devise a notation with two dots in which the messengers currently atA’s, B’s, andC’s positions are written before the first dot, between the dots, and after the second dot, respectively. Anoat any position indicates that there is no messenger. For the highlighted configuration, the messenger positions att=0 are denoted by:

a1a2.b1b2.o

For the highlighted message configuration shown on Figure5.9, there are 16 possible ways with which the messengers can arrive at their destinations as shown in Figure

5.10below. We call each one of them a messenger configuration. Each dot represents a messenger at that particular position.

FIGURE5.10 Sixteen possible messenger configurations

With the exception of their labels, these messenger configurations are practically identical to the ones produced in the cases where the message configurations have generals B andC in G¹(0), or generalsC and A in G¹(0). Using the same notation, all sixteen messenger configurations are listed below.

a₁a₂.b₁b₂.o a₁a₂.b₁.b₂ a₁a₂.b₂.b₁ a₁a₂.o.b₁b₂ a₁.a₂b₁b₂.o a₁.a₂b₁.b₂ a₁.a₂b₂.b₁ a₁.a₂.b₁b₂ a₂.a₁b₁b₂.o a₂.a₁b₁.b₂ a₂.a₁b₂.b₁ a₂.a₁.b₁b₂ o.a1a2b1b2.o o.a1a2b1.b2 o.a1a2b2.b1 o.a1a2.b1b2

From the above information, we can compute the message configuration at timet= 1. To determinev(t) att=_{1 if}_v(0)={¹¹⁰}), first we have to compute the arrival func- tionsL(0)={^L^A^(0),^L^B^(0),^L^C⁽⁰⁾}that compute the number of messengers at each general at timet =0. Then we apply the threshold functionV on these messenger arrival values to obtain the value ofv(1).

First, we will remove the distinction between all messengers arriving at the same general. This is compatible with Figure5.10, where the messengers are unlabeled by their

source general. To do this, we denote the messengers currently at A’s,B’s, andC’s bya, b, andc. As a shorthand, we denoteaabya²(and likewise withb). In this notation, the dots ando’s are no longer needed and we have a more concise notation:

a²b² a²bc a²bc a²c² ab³ ab²c ab²c abc² ab³ ab²c ab²c abc² b⁴ b³c b³c b²c²

or, algebraically, as a multivariate polynomial where each term and its coefficient corre- sponds to a messenger configuration and its multiplicities,

F2a=_b⁴+_a²_b²+_b²_c²+_a²_c²

+_2a²_bc+_4ab²_c+_2abc²+_2b³_c+_2ab³ _. _(5.2)

Att =0, the configuration is alwaysa²b²becauseG¹⁽⁰⁾={^A,^B}. Suppose after the first round, the configuration becomesab²c.F_2acontains abundant information: there are four ways to achieve this configuration, and the value ofL(0) is{^deg^a^deg^b^deg^c}^, which is{¹²¹}^.

FIGURE5.11 Obtaining the new message configurations from the previous messenger

configurations by applying the threshold

With a little more work, we can get even more information. By applying the threshold functionV withT =_{2 on}L(0), we obtainv(1)={⁰¹⁰}. An element of this string is a zero if the degree of the corresponding variable is less thanT, and a one otherwise. In Figure5.11above, the generalsG_i that satisfy the conditionv_i(1)=1 are marked by the grey circles.

Finally, we can also computeK(0) andK(1) by counting the numbers of ones inv(0) andv(1), which are 2 and 1, respectively. CalculatingK(t) is of interest to us because it shows the evolution ofG¹(t). Of course, in reality we can only compute an average value ofK(1) because the four messengers could just as well choose configurations other than ab²c. The average is:

K(1)=

PiK_i(1)

κ (5.3)

where the indexiruns over all sixteen possible configurations att=1. In the numerator summand, the functionK_i(t) counts the number of ones inv_i(t) from a configurationi at timet. The denumeratorκis simply the total number of configurations, which in our present case is sixteen.

Having discussedG¹⁽⁰⁾={Â,^B}, we can now consider the other possible member- ships ofG¹^(0):{∅}^,{Â}^,{^B}^,{^C}^,{Â,^B}^,{Â,^C}^,{^B^,C}^,{Â,^B^,^C}. To do this, we need a polynomial that is more general thanF_2a.

Consider the following polynomialF(a,b,c;z) (orF(z), for short). The coefficientF_kof z^kcaptures all the possible configurations given that|G¹⁽⁰⁾|=_k_{, i.e.,}_kgenerals initially deciding onA₁. Obviously,F(z) is more general thanF_2a as all the configurations inF_2a

can also be found inF₂.

F(z)=₍₁+_(a+_b)²_{z) (1}+_(b+_c)²_{z) (1}+_(c+_a)²_z)

=₁+_F₁_z+_F₂_z²+_F₃_z³ F_k=_[z^k_]F_(z)= ¹

∂^k

∂z^kF(z)|z=₀

F₁=_2ab+_2bc+_2ca+_2a²+_2b²+_2c² F₂=_a⁴+_b⁴+_c⁴

+_2ab³+_2bc³+_2ca³+_2a³_b+_2b³_c+_2c³_a

+_3a²_b²+_3b²_c²+_3c²_a²+_8a²_bc+_8ab²_c+_8abc² F₃=_10a²_b²_c²+_2a³_b³+_2b³_c³+_2a³_c³

+_6a³_b²_c+_6a³_bc²+_6a²_b³_c+_6a²_bc³ +_6ab³_c²+_6ab²_c³+_2a⁴_bc+_2ab⁴_c+_2abc⁴

+_a⁴_b²+_a²_b⁴+_b⁴_c²+_b²_c⁴+_c⁴_a²+_c²_a⁴ _(X1)

Let us first defineF_kl as the number ofl-th power ofa,b, andc found in the monomials (i.e., terms, or configurations) ofF_k. The index k restricts our count only toF_k, which is described above, whilel restricts the count to only those generals having ex- actlyl messengers:Li(0)=_l. For example, inF2, the formsa²,b², orc²can be found in these terms:

3a²b²+_3b²_c²+_3c²_a²+_8a²_bc+_8ab²_c+_8abc² _.

We can expand the above expression to count the number of monomials of degree 2, which is given by 3×³×²+₃×⁸×¹=42. Therefore,F₂₂=_42.

Having definedF_kl, we can now computeK_k(1), which is the average value of|G¹⁽¹⁾| over all configurations inF_k.

K₁(1)=_F₁₂_/κ₁=_6/(3·²²⁾

=_6/12=_0.5000 K₂(1)=_(F₂₂+_F₂₃+_F₂₄_)/κ₂

=₍₄₂+₁₂+_3)/(3·²⁴⁾

=_57/48=_1.1875 K₃(1)=_(F₃₂+_F₃₃+_F₃₄_)/κ₃

=₍₇₂+₄₈+_12)/(1·²⁶⁾

=_132/64=_2.0625,

or more generally, we can use (x^′∈ G \^x):

K_k(1)= ¹ κ_k

XMk

l≥TF_kl (5.4)

F_kl =^X

x∈G 1 k!

∂^l

∂x^lF(a,b,c;z)|x=_0,x′=₁

κ_k=^¡^S

¢(γ+₁₎^Mk

Consider the case where all three generals are inG¹(0). Not knowing each other’s deci- sions, they send their messengers out to notify their neighbors. FromK_k(1), we predict that|G¹⁽¹⁾|=_2.0625≈2. Likewise, in the next time step|G¹⁽²⁾|=_1.1875 ≈^{1, and fi-} nally, att=3, the generals no longer agree onA₁as|G¹⁽³⁾|=_0.50<1. Therefore, with the givenM,T,γ, andS, a proper consensus reflecting the generals’ initial observations cannot be reached.

In Section5.2.2, we present the general results forγ-regular network for all possible parameter values and show that with an appropriate choice of parameters, a proper con-

sensus can be reached. In preparation for these general results, let us first generalize our triangular network into a ring networkG withSgenerals (shown below with eight generals).

D F

C B H

FIGURE5.12 The networkGwith eight generals

In our analysis ofG, the values ofM,S, andT are no longer fixed. As a result, we can no longer count the results manually and have to resort to the mathematical formalism of multivariate generating function (mgf).

Let us denote the mgf forG by F(x;z), withx ={^xⁱ}={^x⁰^,^x¹^{, . . . ,}^x^S⁻¹}^andⁱ ∈N modS(i.e.,i+_S≡ⁱ ^modS) indexing theSgenerals inG. Denote by [z^k] the operator that extracts the coefficient ofz^kand by [x_i^l] the operator that extracts the coefficient of x_i^l. As before, leti^′denote any member of the index set that is different fromi which is {0, . . . ,S−¹}\ⁱ^.

F(x;z)=^Q^S

i=₁

¡1+_(x_i+_x_i₊₁₎^M_z^¢ F_k(x)=_[z^k_]F_(x;_z)= ¹

∂^k

∂z^kF(z)|z=0

F_kl =^P^S

i=1[x_i^l]F_k(x) (5.5)

=_S¹

∂^k

∂z^k 1 l!

∂^l

∂x_i^lF(x;z)|x_i=_0,x

i′=₁

The formulas forK_k(t) andκ_kare the same as the ones found in (5.4). The summands in (5.5) are identical due to the symmetry ofF(x;z) with respect to x. To derive the explicit formula forF_kl, supposel ≥1. Although counterintuitive, it is easier to start

with [x^l_i] instead of the standard operator [z^k].

F_l(z)=_S_[x^l_i_]F_(x;_z)

=_S ¹

∂^l

∂x^l

F(x;z)|x_i=_0,x

i′=₁

=_S^Q_i_′_∈{_2,S_}^¡₁+_(x_i_′+_x_i_′₊₁₎^M_z^¢|x_i_′=₁×

1 l!

∂^l

∂x^l₁

¡1+_(x_S+_x₁₎^M_z^{¢ ¡}₁+_(x₁+_x₂₎^M_z^¢|x1=₀

Note that in the last equation above, we have chosen to extract thel-th coefficient ofx₁, effectively choosingi =1. Due to symmetry, the expressions for otheri’s are the same, which allows us to use the multiplicative factor ofS.

=₍₁+₂^M_z)^S⁻^{2 1}

∂^l

∂x^l₁

¡1+₍₁+_x₁₎^M_z^¢²|x1=0

=₍₁+₂^M_z)^S⁻²_[x₁^l_](1+₂₍₁+_x₁₎^M_z+₍₁+_x₁₎^2M_z²₎

=₍₁+₂^M_z)^S⁻² h¡2

¢¡M l

¢(1)^M⁻^lz+^¡²

¢¡2M l

¢(1)^2M⁻^lz²i

=^P^S⁻²

i=₀

¡S−2 i

¢2^Mizⁱh 2¡M

¢z+^¡^2M

¢z²i

The value ofx_i′=1 is then substituted, before a tedious algebraic coefficient extraction procedure is run on the remaining polynomial that containsx₁.

F_kl =_[z^k_]F_l_(z)= ¹

∂^k

∂z^kF_l(z)|z=0 (5.6)

=_2S^¡^M

¢¡_S₋₂

k−1

¢2^M^(k⁻¹⁾+_S^¡^2M

¢¡_S₋₂

k−2

¢2^M^(k⁻²⁾

At this point, recall that we haven’t considered the case wherel = 0, which requires a slightly different derivation because the term corresponding tol =0 is the constant term in the polynomial.

The derivation forF_k0, is provided below. Again, we use the symmetry property of F(x;z). Note that at the first glance, the result forF_k0below seems to be missing the¡i M

factor when compared to Equation (5.6). However, recall that¡i M 0

¢≡^{1 for all}^{i M.}

F_l=₀(z)=^P^S

i=₁F(x;z)|x_i=_0,x

i′=₁

=_{S F}_(x;_z)|x_i=0,x_i_′=1

=_S^Q_i_∈{_1,S_}^¡₁+_(x_i+_x_i₊₁₎^M_z^¢|x_i=_0,x

i′=₁

=_S₍₁+₂^M_z)^S⁻²₍₁+_z)²

F_k0=_S_[z^k_](1+₂^M_z)^S⁻²₍₁+_z)² _(5.7)

=_S ¹

∂^k

∂z^k

PS−2 i=₀

¡S−2 i

¢2^Mizⁱ(1+_2z+_z²₎|z=₀

=_S^P²_i₌₀^¡²

¢¡_S₋₂

k−i

¢2^M^(k⁻ⁱ⁾

This last observation suggests a formula forF_kl that is valid forγ=1 and all values of kandl, that can be used in Equation (5.4):

F_kl =

i=₀

S Ãi M

!Ã2 i

!ÃS−² k−ⁱ

2^M(k⁻ⁱ⁾

To conclude our analysis on this example, we consider the other extreme value forλ.

Now, instead of settingλ=1 and working with a ring network, we setλ=_S−^{1 and work} with a complete graph, which is somewhat simpler to analyze. The steps of derivingF_kl are identical:

F(x;z)=^Q^S

i=1(1+₍^P^S

j=1xj)^Mz)

=₍₁+₍^P^S

j=₁x_j)^Mz)^S F_l(z)=_S_[x^l_](1+_(S−¹+_x)^M_z)^S

=_S_[x^l_]^P^S

i=₀

¡_S

¢(S−¹+_x)^Mi_zⁱ

=_S_[x^l_]^P^S

i=₀

¡_S

¢ PMi j=₀

¡_Mi

¢(S−¹⁾^Mi⁻^j^x^j^zⁱ

=_S^P^S

i=₀

¡S i

¢¡Mi l

¢(S−¹⁾^Mi⁻^l^zⁱ F_kl =_S_[z^k_]^P^S_i₌₀^¡^S

¢¡_Mi

¢(S−¹⁾^Mi⁻^l^zⁱ ^(5.8)

=_S^¡^S

¢¡Mk l

¢(S−¹⁾^Mk⁻^l ^.

As we mentioned previously, calculatingK_k(t) is important because the function allows us to learn about the evolution ofG¹(t) and whether consensus is possible under the threshold functionV. In our analysis of the triangular network, we manually calcu- latedK_k(t) for different values ofkand learned that a proper consensus is not possible.

Obviously, for the ring and complete network with variable parameters, a manual and exhaustive analysis ofK_k(t) is not an option. One feasible way would be a numerical evaluation of the values ofK_k(t) as a function of its parameters.

However, all is not lost. LetMkbe the number of messengers dispatched bykgenerals inG¹^{, and}^λ= ^Mk

S . If we fixλandMk ≫1, (5.8) becomes simple, and the qualitative behaviors ofK_k(t) and the consensus become clear.

P_kl = ^F^kl Sκ_k = ^S

¡S k

¢¡Mk l

¢(S−¹⁾^Mk⁻^l S¡_S

¢S^Mk → ^λ

l!e^λ K_k(1)=_S^P^Mk

l≥TP_kl → ^S^P^Mkl≥T λ^l

l!e^λ=_S(1−(T^Γ(T,λ)−1)!)

Perhaps not surprisingly, forMk ≫^{1 and}^γ=_S−1, each term in the summation in Equation (5.4) is the mathematical expression for a Poisson densityP_kl with parameter λ=^Mk

S and argumentl, and thereforeK_k(1) can be expressed in terms of the incomplete

gamma functionΓ.

Figure5.13is a plot ofK_k(t+_{1) against}_K_k(t), which is actually the plot ofK_k(1) against k =|G¹⁽⁰⁾|^for ^M =2 (the lower curve) andM =4 (the upper curve) compared to the diagonal lineK_k(t+₁₎=_K_k(t). For both curves, we have set the parametersγ=_S−^1, T =_{2, and}_S =100. These curves can be used to describe the time evolution ofK_k(t), and the existence of a consensus.

0 20 40 60 80 100

20 40 60 80 100

FIGURE 5.13 The plot ofKk(t+_{1) versus}_K_k_(t₎

For example, supposeK_k(0)=72. For theM =_{2 curve,}_K_k₍₁₎≈^{42, and}^K^k⁽²⁾≈^20, before eventually reachingK_k(t)= 0 (the dashed path with an arrowhead). In other words, all generals would eventually settle onA₀no matter how many of them initially agreed on A₁. In contrast, theM =4 curve intersects the diagonal line atk ≈^{22 and} k≈85. These points define two possible steady-state behaviors ofK_k(t). IfK_k(0)<_22, then as t → ∞^K^k^(t) settles at the stable fixed point atk =0. Otherwise, it settles at k≈^85.

Denote byF(k) the function that mapsK_k(t) toK_k(t+1). The properties ofΓrequire thatF(k)→^{0 as}^k→^{0 and}^F^(k)→^S^as^t → ∞, and in addition, the slopes ^∂F_∂k^(k) → ¹_S and ^∂F_∂k^(k) → _S¹ ^as^t →^{0 and}^t → ∞, respectively. Due to space limitation, we do not analyze the effects of changing M and T on F(k), or proofs to the above claims. We do, however, provide in the next section a sketch of the analysis of errors caused by the generals themselves.

Dalam dokumen Generalized Network Routing Metrics and Algorithms (Halaman 135-147)