Suppose that, given đđĄâ1, our algorithm has generated the set đ of disjoint augmentations. Let đâ denote a near optimal set of augmentations satisfying đđ(đđĄâ1â đâ) âĽđâ1
đ+1đđâ, which consists of d disjoint augmentations {đ´1, đ´2, . . , đ´đ }. For each đ´đ, let đđ denote an (arbitrary) endpoint of đ´đ if đ´đ is a path, or an arbitrary node in đ´đ if đ´đ is a cycle. Consider an event đ under our algorithm such that all the following are true:
⍠the set of self-selected seeds equals {đ1, đ2. . . , đđ },
⍠all seed nodes đđ select its intended size as đ đđ§đ(đ´đ), and
⍠the generated set of augmentation đ equals đâ.
Since it is clear that Pr{đ = đâ | đđĄâ1} ⼠Pr{đ | đđĄâ1}, we focus on a lower bound of Pr{đ | đđĄâ1}.
Note that i) the seed selection of {đđ} occurs with probability đđ (1 â đ)đâđ , where đ = |đą|, and ii) the probability that all seeds đđ independently select its intended size of đ đđ§đ(đ´đ) is đâđ . iii) Finally, we compute the probability that the generated set of augmentation đ equals đâ, which
occurs when, at each iteration, each active node selects its corresponding neighbor that belongs to đâ. Each selection has probability đ´â1 , where ÎŁ is the maximum node degree, and total đ selections will be made under the algorithm. Thus, combining the three probabilities, we have that
Pr{đ |đđĄâ1} ⼠đđ (1 â đ)đâđ â đâđâ đ´âV⼠min {1, ( đ 1 â đ)
đ
} (1 â đ đÎŁ )
đ
. (11)
setting đż = min {1, ( đ
1âđ)đ} (1âđ
đÎŁ)đ completes the proof. âĄ
C. PROOF OF LEMMA 2
At time đĄ > 0, define đđĄ= â4|â|2(|â|+1) ln đĄ
Îminđź â. Suppose that a non-near-optimal matching đ â đŽĚ đđź such that đĚđ(đĄ) ⼠đđĄ is scheduled. It also implies đĚđ (đĄ) ⼠đđĄ for all links đ â đ. Also, consider an arbitrary near-optimal matching đâ˛â đŽđđź, and let đ = {đ1, đ2, . . . , đđ´} and đâ˛= {đ1 , đ2 , . . . , đđľ }, where đ´ = |đ| and đľ = |đâ˛|, respectively. We consider the event that the total index sum over links in đ is greater than that over links in đâ˛.
đ{đđĚ đĄ(đ) ⼠đđĚ đĄ(đâ˛)} = đ {â đ¤Ě đđ,đĄ
đ´
đ=1 ⼠â đ¤Ě đ
đâ˛,đĄ đľ
đ=1 }
⤠đ { max
đđĄâ¤đ1,âŚ,đđ´<đĄâ (đ¤Ěđđ,đđ+ â(|â| + 1) ln đĄ đđ
)
đ´ đ=1
⼠max
0<đ1â˛,âŚ,đđľâ˛<đĄâ (đ¤Ěđ
đâ˛,đđâ˛+ â(|â| + 1) ln đĄ đđⲠ)
đľ
đ=1 }
(12)
which is bounded by â ⯠â âđĄđ âŻ
1â˛=1 âđĄđ đ{â¤}
đľâ˛=1 đĄđđ´=1
đĄđ1=1 , where event ⤠is defined as
â (đ¤Ěđđ,đđ+ â(|â| + 1) ln đĄ
đđ )
đ´
đ=1 ⼠â (đ¤Ěđ
đâ˛,đđâ˛+ â(|â| + 1) ln đĄ đđⲠ)
đľ
đ=1 .
For the ease of explanation, we set đ¤Ěđđ,đđ to denote the sample mean of link đđ scheduled đđ times.
We further decompose ⤠into {đ¸đ}, {đšđ} and â as
đ¸đ: đ¤Ěđđ,đđâ â(|â| + 1) ln đĄ
đđ ⼠đ¤đđ (13)
đšđ: đ¤Ěđ
đâ˛đđâ˛+ â(|â| + 1) ln đĄ đđⲠ⤠đ¤đ
đâ˛
â: â đ¤đ
đâ˛
đľ
đ=1 â â đ¤đđ
đ´
đ=1 < 2 â (â(|â| + 1) ln đĄ
đđ )
đ´ đ=1
Note that if đ{⤠} = 1, we should have âđ´đ=1đ{đ¸đ}+ âđľđ=1đ{đšđ}+ đ{â} ⼠1, which can be easily shown by contradiction. The probability of each event of đ¸đ and đšđ can be bounded by the Chernoff-Hoeffding bound as
Pr {đ¤Ěđđ,đđâ â(|â| + 1) ln đĄ
đđ ⼠đ¤đđ } ⤠đĄâ2(|â|+1),
Pr { đ¤Ěđ
đâ˛đđâ˛+ â(|â| + 1) ln đĄ đđⲠ⤠đ¤đ
đⲠ} ⤠đĄâ2(|â|+1),
(14)
Further, we can show Pr{â} = 0 if đđ ⼠â4|â|2(|â|+1) ln đĄ
Îminđź â for all links đđ â đ as follows. Suppose that â occurs. Then we should 0 > â đ¤đ
đâ˛
đľđ=1 â âđ´đ=1đ¤đđâ 2 â (â(|â|+1) ln đĄ đđ )
đ´đ=1 . The right term is no smaller than đź ¡ đđâ â max
đâđŽĚ đđźđđ(đ) â Îđźmin , which equals 0 by definition of Îminđź , resulting in a contradiction.
Using these and taking expectation on (12), we have
Pr{đđĚ đĄ(đ) ⼠đđĚ đĄ(đâ˛)} ⤠â ⯠â â âŻ
đĄ
đ1â˛=1 â Pr{â¤}
đĄ đđľâ˛=1 đĄ
đđ´=1 đĄ
đ1=1
⤠â ⯠â â âŻ
đĄ
đ1â˛=1 â (â Pr{đ¸đ}
đ´
đ=1 + â Pr{đšđ}
đľ
đ=1 )
đĄ đđľâ˛=1 đĄ
đđ´=1 đĄ
đ1=1
⤠đĄđ´+đľâ 2|â|đĄâ2(|â|+1)⤠2|â|đĄâ2.
âĄ
D. PROOF OF PROPOSITION 2
Overall, we show that the number of explorations to non-near optimal matchings is bounded. To this end, we consider a sequence of time points where a non-near-optimal matching is sufficiently played at each point. They serve as a foothold to count the total number of plays of non-near- optimal matchings.
To begin with, for the horizon time T > 0, let đđ = â4|â|2(|â|+1) ln đĄ
Îminđź â, and let đⲠdenote the first time when all non-near-optimal matchings are sufficiently (i.e., more than đđ times) explored, i.e.,
đâ˛= min{đĄ | đĚđ,đĄ ⼠đđ for all đ â đŽĚ đđź}.
(đ) When đâ˛â¤ đ : Let đŽĚ đđź = {đ1, đ2, ⌠, đđ} with đ = |đŽĚ đđź|. Further we define đŽĚ (đĄ) = {đ â đŽĚ đđź | đĚđ,đĄ ⼠đđ}, which is the set of non-near-optimal matchings that are scheduled sufficiently many times by time đĄ , and đŽ(đĄ) = đŽĚ đđźâ đŽĚ (đĄ) denotes the set of not-yet-sufficiently-scheduled non-near- optimal matchings. Also, let đđ denote the time when matching đđ is sufficiently scheduled, i.e., đĚđđ(đđ) = đđ. Without loss of generality, we assume đ1 < đ2< ⯠< đđ= đâ˛.
To apply the decomposition inequality (9), we need to estimate the expected value of âđâđŽĚ đđźđĚđ,đâ˛, which can be written as
â đĚđ,đâ˛
đâđŽĚ đđź = â â đ{đđĄ = đ}
đⲠđĄ=1 đâđŽĚ đđź
= đđđ + â â â đ{đđĄ= đ}
đâđŽĚ (đđ) đđ+1
đĄ=đđ đâ1
đ=1
(15)
Hence, we need to estimate âđâđŽĚ đđźPr{đđĄ= đ} for đĄ â (đđ, đđ+1], which can be obtained as in the following lemma.
LEMMA 5. For each đĄ â (đđ, đđ+1], we have â Pr{đđĄ = đ}
đâđŽĚ đđź ⤠(1 â đż) â â Pr{đđĄâ1= đ}
đâđŽĚ đđź
(16) + Pr{đđĄâ1â đŽ(đđ)} + (|đŽĚ (đđ)| + đż) â 2|â|đĄâ2
Proof. We first divide the case into three exclusive sub-cases based on the previous schedule đđĄâ1: events đ¸ = {đđĄâ1â đŽđđź}, đš = {đđĄâ1â đŽ(đđ)} and â = {đđĄâ1 â đŽĚ (đđ)}, and. Then we have â Pr{đđĄ = đ}
đâđŽĚ (đđ)
= â Pr{đđĄ = đ | đ¸} â Pr{đ¸}
đâđŽĚ (đđ) (17)
+ â Pr{đđĄ= đ | đš}
đâđŽĚ (đđ) â Pr{đš} (18)
+ â Pr{đđĄ = đ | â} â Pr{â}
đâđŽĚ (đđ) (19)
Let đđĄ denote the set of augmentations chosen under our algorithm at time đĄ. We can obtain a bound on (17) as
â Pr{đđĄ= đ | đ¸} â Pr{đ¸}
đâđŽĚ (đđ)
.
⤠â Pr{đđĚ đĄ(đ) ⼠đđĚ đĄ(đđĄâ1) | đ¸} â Pr{đ¸}
đâđŽĚ (đđ)
⤠|đŽĚ (đđ)| â 2|â|đĄâ2 (20)
where the last inequality comes from Lemma 2. The result holds for all đĄ â (đđ, đđ+1]. For the second term (18), we have
â Pr{đđĄ = đ | đš}
đâđŽĚ (đđ) â Pr{đš} ⤠Pr{đš} (21)
Finally, the third term (19) denotes the probability to transit from a sufficiently-played non-near- optimal matching to a sufficiently-played non-near-optimal matching, and thus we have
â Pr{đđĄ = đ | â} â Pr{â}
đâđŽĚ (đđ)
⤠â Pr{đđĄâ đŽĚ (đđ) | đđĄâ1 = đ} â Pr{đđĄâ1= đ}
đâđŽĚ (đđ) .
Letting đâ˛= đ â đđĄ, the conditional probability can be derived as Pr{đđĄ â đŽĚ (đđ) | đđĄâ1= đ}
⤠Pr{đđĄ â đŽĚ (đđ) | đđĄâ1 = đ, đâ˛â đŽđđź} â Pr{đⲠâ đŽđđź} + Pr{đⲠâ đŽĚ đđź}
⤠Pr{đđĄ â đŽĚ (đđ) | đđĄâ1 = đ, đâ˛â đŽđđź} â đż + 1 â đż
⤠Pr{đđĚ đĄ(đ) ⼠đđĚ đĄ(đđĄâ1) | đ â đŽĚ đđź, đâ˛â đŽđđź} â đż + 1 â đż
⤠2|â|đĄâ2â đż + 1 â đż.
where the second inequality comes from Proposition 1, the equality holds since đđĄ should be đ (otherwise, đđĄ = (đ â đđĄ ) â đŽĚ (đđ)) and thus đ should have the larger weight sum to be chosen by the augmentation algorithm, and the last inequality comes from Lemma 2. Hence, the third term (19) can be upper bounded by
â Pr{đđĄâ1= đ} â (2đż|â|đĄâ2+ 1 â đż)
đâđŽĚ (đđ)
⤠2đż|â|đĄâ2+ (1 â đż) â Pr{đđĄâ1 = đ}
đâđŽĚ (đđ) ,
(22)
for all đĄ â (đđ, đđ+1].
The result can be obtained by combining (20), (21), and (22). âĄ
In order to apply Lemma 5 to (15), we rewrite it in a recursive form. Let đ = 1 â đż, đťđ= (|đŽĚ (đđ)| + đż) â 2|â| = (đ + đż) ¡ 2|â|, and đŠđ(đĄ) = âđâđŽĚ (đđ)Pr{đđĄ = đ}. We have a recursive form of (16) as
đŠđ(đĄ) ⤠đđĄâđđđŠđ(đđ) (23) +đťđâ đđĄâđđâ2
đĄ đ=đđ+1
(24)
+ â đđĄâđâ
đĄ
đ=đđ+1 Pr{đđĄâ1â đŽ(đđ)}. (25) By summing it over đĄ â (đđ, đđ+1] on the both sides, we obtain the following lemma.
LEMMA 6. The total number of times that sufficiently played non-near-optimal matchings are selected during (đđ, đđ+1] is bounded by
â đŠđ(đĄ)
đđ+1 đ=đđ+1
â¤1
đż(1 +đ2
6 đťđ+ đź [â đđ,đ+1
đâđŽ(đđ) ]), (26)
where đđ,đ+1 denote the number of time slots that đ is scheduled in (đđ, đđ+1].
The proof of Lemma 6 follows the same line of analysis as [17] and included for the completion.
Proof. By summing up (25) over đĄ â (đđ, đđ+1], we have
â â đđĄâđâ
đĄ
đ=đđ+1 Pr{đđĄâ1â đŽ(đđ)}
đđ+1 đĄ=đđ
= â Pr{đđĄâ1â đŽ(đđ)}
đđ+1
đĄ=đđ â (â đđ
đđ+1âđĄ
đ=0 )
⤠1
1 â đđź [â đ{đđĄâ1â đŽ(đđ)}
đđ+1
đĄ=đđ ]
=1
đżđź [â đđ,đ+1
đâđŽ(đđ) ], (27)
where the inequality holds because âđđ=0đ+1âđĄđđ ⤠1
1âđ, Pr{đđĄâ1 â đŽ(đđ)} = đź[đ{đđĄâ1â đŽ(đđ)}], and đ{đđđâ đŽ(đđ)} = 0 since the matching scheduled at đđ does not belong to đŽ(đđ) by definition.
Also, by summing up (24) over (đđ, đđ+1], we have
â đťđâ đđĄâđâ
đĄ đ=đđ+1
đâ2⤠đťđâ 1 đżâ đ2
6 .
đđ+1 đĄ=đđ
(28) Similarly, we take the sum of (23) as
â đđĄâđđđŠđ(đđ)
đđ+1 đĄ=đđ
⤠â đđĄâđđ
đđ+1 đĄ=đđ
â¤1
đż (29)
Combining (27), (28), and (29), we obtain the result. âĄ
Now, by taking expectation on (15), we can obtain the expected total number of times that non- near optimal matchings are selected up to time đâ˛(⤠đ) as
â đź[đĚđ(đâ˛)]
đâđŽĚ đđź = đđđ + â âđ đŠđ(đĄ)
đ+1
đĄ=đđ đâ1
đ=1
⤠đđđ +1
đżâ (1 +đťđđ2
6 + đź [â đđ,đ+1
đâđŽ(đđ) ])
đâ1 đ=1
⤠đđđ +đ
đż (1 +|â|đżđ2
6 +|â|(đ â 1)đ2
6 + đđ).
The last inequality holds since (i) âđâ1đ=1đťđ= âđâ1đ=1(đ + đż) â 2|â| ⤠đ â |â| â (2đż + (đ â 1)), and (ii) âđâđŽ(đđ)đđ,đ+1 is the total number that the matchings that have been chosen less than đđ up to đđ are chosen during (đđ, đđ+1] and thus results in âđâ1âđâđŽ(đđ)đđ,đ+1
đ=1 ⤠âđđ=2đđ ⤠đđđ.
From đ ⤠|đŽ| â 1, we have
â đź[đĚđ(đâ˛)]
đâđŽĚ đđź ⤠đˇ1â ln đ
(Îminđź )2+|đŽ| â 1
đż (1 +|â|đżđ2
6 +|â|(|đŽ| â 2)đ2
6 )
(30) where đˇ1= (1 +1
đż) â 4|â|2(|â| + 1) â (|đŽ| â 1).
This provides a bound on the number of times that non-near optimal matchings are selected up to đâ˛. For the rest time, đĄ â (đâ˛, đ], we need to compute âđđĄ=đâ˛Pr{đđĄ â đŽĚ đđź}. Let đⲠ= đđĄâ1â đđĄ. Since next schedule đđĄ is either đđĄâ1 and đⲠunder the algorithm, we divide the event {đđĄâ đŽĚ đđź} into three sub-cases based on đđĄâ1 and đâ˛, and compute the probability as
Pr{đđĄ â đŽĚ đđź} = Pr{đâ˛â đŽĚ đđź, đđĄâ1â đŽĚ đđź}
+ Pr{đⲠâ đŽĚ đđź, đđĄâ1 â đŽđđź, đđ(đâ˛) ⼠đđ(đđĄâ1)}
+ Pr{đⲠâ đŽđđź, đđĄâ1 â đŽĚ đđź, đđ(đâ˛) ⤠đđ(đđĄâ1)}
This leads to
Pr{đđĄ â đŽĚ đđź} ⤠Pr{đâ˛â đŽĚ đđź | đđĄâ1 â đŽĚ đđź} â Pr {đđĄâ1â đŽĚ đđź}
+ Pr{đđ(đâ˛) ⼠đđ(đđĄâ1) | đⲠâ đŽĚ đđź, đđĄâ1â đŽđđź} + Pr{đđ(đâ˛) ⤠đđ(đđĄâ1) | đâ˛â đŽđđź, đđĄâ1â đŽĚ đđź}.
From Proposition 1, we have Pr{đâ˛â đŽĚ đđź | đđĄâ1â đŽĚ đđź} = 1 â Pr{đâ˛â đŽđđź | đđĄâ1â đŽĚ đđź} ⤠1 â đż = đ. Since đĚđ(đĄ) ⼠đđ for all đ and đĄ â (đâ˛, đ], Lemma 2 provides an upper bound 2|â|đĄâ2 on each conditional probability in the second and the third terms. As a result, we can obtain
Pr{đđĄ â đŽĚ đđź} ⤠đ Pr{đđĄâ1â đŽĚ đđź} + 4|â|đĄâ2. By extending the inequality in a recursive manner down tođâ˛, we obtain that
Pr{đđĄâ đŽĚ đđź} ⤠đđĄâđâ˛Pr{đđⲠâ đŽĚ đđź} + 4|â| â đđĄâđđâ2
đĄ đĄ=đâ˛+1
= đđĄâđâ˛+4|â| â đđĄâđđâ2
đĄ đĄ=đâ˛+1
where the last equality holds since Pr{đđⲠâ đŽĚ đđź} = 1 from the definition of đâ˛. Summing over đĄ â (đâ˛, đ] on the both sides, and from đ = 1 â đż, we have
â Pr{đđĄ â đŽĚ đđź}
đ
đĄ=đâ˛+1 â¤1 â đż
đż +2|â|đ2
3đż . (31)