Critical Cycles - Critical Cycles

5.3 Slack Matching

5.3.2 Critical Cycles

5.3.2.2 Critical Cycles

Thecommon longest internal cycleis the longest cycle that remains within any single PCHB in the ring.

Ci[n] = σ0[n]σ1[n]σ2[n]σ3[n]σ4[n]σ5[n]

τ_i^† = max

n {δ(C_i[n])} (5.7)

Handshake cyclesoccur between two adjacent (communicating) PCHBs. Thecommon longest handshake cycleis

Ch[n] = ρ1[n]ρ2[n]λ0[n+ 1]ρ3[n]ρ4[n]λ1[n+ 1]

τ_h^† = max

n {δ(Ch[n])} (5.8)

system we have

{n=K} λj {n= (K−1) modN} {n=K} σ_j {n=K}

{n=K} [ρ0|ρ1ρ2|ρ3ρ4] {n= (K+ 1) modN}

Since there are a finite number of edges in any instance ofC, we can assign all possible paths fitting the patternC into one of the following subpatterns:

{n=K} C0 {n=K}

{n=K} C₊₁ {n= (K+ 1) modN} {n=K} C+2 {n= (K+ 2) modN} {n=K} C₋₁ {n= (K−1) modN} {n=K} C₋2 {n= (K−2) modN}

We can now write our general pattern specification for paths with sourcefu_n as

P = (F^∗[C0|C+1|C+2|C₋1|C₋2]^∗)^∗

Paths that fit subpatternCn traverse more than one edge, begin atfuN, and end atfuN+n without traversing any other nodesfum, wherem 6=N andm6=N+n. Paths in subpatternF traverse only one edgeρ0[n].

Given this pattern for traversing an FBI graph, we find the simple cycle in the graph with the largestcycle time, which is defined as the cycle delay divided by number of tokens on the cycle. This section presents an analysis for a general heterogeneous ring of L-R buffers; the following sections make assumptions on the PCHBs to reduce the possible cycles put forth here into the four common cycles described in Section 5.3.2.1.

Case 1: Stationary Cycles {n =K}C₀{n =K}

Any pattern P that includes C₀ must include a cycle since src(C₀) = snk(C₀) = fu_K. Any path in C0 must contain twice the number of edgesρj as it does edgesλj. There are six such paths (all simple cycles) for any PCHB[n]:

C₀⁰[n] = σ0[n]σ1[n]σ2[n]σ3[n]σ4[n]σ5[n]

C₀¹[n] = σ0[n]σ1[n]σ2[n]ρ3[n] ρ4[n]λ1[n+ 1]

C₀²[n] = ρ₁[n]ρ₂[n]λ₀[n]σ₃[n]σ₄[n]σ₅[n]

C₀³[n] = ρ1[n]ρ2[n]λ0[n]ρ3[n]ρ4[n]λ1[n+ 1]

C₀⁴[n] = ρ1[n]ρ2[n]σ1[n+ 1]σ2[n+ 1]σ3[n+ 1]λ1[n+ 1]

C₀⁵[n] = σ₀[n]λ₀[n] ρ₃[n−1]ρ₄[n−1]σ₄[n]σ₅[n]

Analysis of the reset sets in Section 5.3.1.2 shows that all possible internal cyclesC₀contain one token. Thus, the maximum cycle time is as follows:

τC₀ = maxn

δ(C₀^j[n]) : 0≤n<N, 0≤j<6o

(5.9)

Note that τC0 includes all common internal cycles Ci[n] and handshake cycles Ch[n] from Sec- tion 5.3.2.1.

Case 2: Consecutive C-iterations With Direction Change:

[C+1|C+2][C₋1|C₋2] or [C₋1|C₋2][C+1|C+2]

All of the scenarios in this case reduce to stationary cycles. In other words, all eight paths in [C+1|C+2][C₋1|C₋2] and [C₋1|C₋2][C+1|C+2] contain a simple cycle c, where either c ∈ C0

or rot(c)∈C0 (whererot(c) is any rotation of the cycle c). To demonstrate this fact, let us first note that given a pathc∈C, wheresrc(c) =fun, the following edges must be a part ofc:

c∈C₊₁[n] ⇒ ρ₁[n]ρ₂[n]∈c∩(σ₂[n+ 1]∈c∪ρ₄[n]∈c) ∪ ρ₃[n]ρ₄[n]∈c

c∈C₊₂[n] ⇒ ρ₁[n]ρ₂[n]∈c ∩ ρ₃[n+ 1]ρ₄[n+ 1]∈c c∈C₋1[n] ⇒ σ0[n]λ0[n]∈c ∪ σ3[n]λ1[n]∈c c∈C₋2[n] ⇒ σ0[n]λ0[n]σ3[n−1]λ1[n−1]∈c

Now we can consider each possible iteration. Using the statements above to determine whether an edge is in the iteration or not, and the FBI graph to determine the sources and sinks of these edges, we can check for simple cycles by looking for vertices that appear twice in the path:

1. P_C=C₊₁[n]C₋₁[n+ 1] orP_C =C₋₁[n+ 1]C₊₁[n]:

We see thatsnk(ρ₂[n]) =src(λ₀[n+1]),snk(σ₂[n+1]) =src(σ₃[n+1]),src(ρ₃[n]) =snk(λ₀[n+

1]), andsnk(ρ4[n]) =src(λ1[n+ 1]). Therefore a simple cyclec∈PC exists s.t.rot(c)∈C0. 2. PC=C+2[n]C₋2[n+ 2] orPC =C₋2[n+ 2]C+2[n]:

Sincesrc((ρ3[n+ 1]) =snk(λ0[n+ 2]), there exists a simple cyclec∈PC s.t. rot(c)∈C0. 3. PC=C+1[n]C₋2[n+ 1]:

Since snk((ρ₂[n]) = src(λ₀[n+ 1]) and src(ρ₃[n]) = src(σ₃[n]), there exists a simple cycle c∈P_C s.t. rot(c)∈C₀.

4. PC=C₋2[n+ 2]C+1[n]:

Sincesnk((σ2[n+ 1]) =snk(λ0[n+ 2]), andsnk(ρ4[n]) =src(λ1[n+ 1]), there exists a simple cyclec∈PC s.t. rot(c)∈C0.

5. PC=C₋1[n+ 1]C+2[n]:

Since src((λ0[n+ 1]) =snk(ρ2[n]), andsrc(σ3[n+ 1]) =src(ρ3[n+ 1]), there exists a simple cyclec∈P_C s.t. rot(c)∈C₀.

6. P_C=C₊₂[n]C₋₁[n+ 2]:

Sincesrc((ρ3[n+ 1]) =snk(λ0[n+ 2]) andsnk(ρ4[n+ 1]) =src(λ1[n+ 2]), there exists a simple cyclec∈PC s.t. rot(c)∈C0.

Therefore all possible iterations in this case include a simple cycle, and the maximum cycle time for all subpatterns ofC in this case isτ_C₀.

Case 3: Forward-Latency Paths: {n=K}[F|C+1|C+2]{n≥(K+ 1) modN}

All iterations of P in these subpatterns move forward. Cycles can exist only when the forward- latency path spans the entire ring of N PCHBs at least once. Given a source vertex fun, there is only one possible path c ∈ F[n] (ρ0[n]), four possible paths c ∈ C+1[n] (labelled C₊₁⁰ [n] – C₊₁³ [n]), and one possible path c ∈ C₊₂[n] (ρ₁[n]ρ₂[n]σ₁[n+ 1]σ₂[n+ 1]ρ₃[n+ 1]ρ₄[n+ 1]). The number of tokens on each possible path ranges from zero to two, and depends upon whether the PCHBs traversed are initialized to send a value, to receive a value, or neither.

Note that if our pattern consists solely of edges ρ0[n] and P = F^*, then F can be repeated onlyN times before a simple cycle is created. In fact, this is the common forward-latency cycleCf

from Section 5.3.2.1, and sincem messages in the ring result inm initial tokens on edges labelledρ0, (there are m PCHBs that initially send tokens and use Rsnd), the forward-latency cycle-time is therefore commensurate with the previous common cycle definition ofτ_f^†:

1 m

N−1

n=0

δ(f u_n)

Returning to paths including subpatterns other than F, since each possible subpattern in this case moves forward, we can from our FBI graphG=hV,E,Riconstruct a new acyclic graphAF(G) and use it to determine the maximum cycle time for this case. Since forward-latency simple-cycles can traverse the PCHB ring once or twice (depending on whetherN is even or odd), we unroll the ring inAF(G) so that each PCHB has three vertices in the new acyclic graph. This is demonstrated in Figure 5.10.

The PCHB vertices in A_F(G) are connected by edges representing the paths in F[n], C₊₁[n]

and C+2[n]. Each edge is annotated with the delay of and number of tokens on the paths in G.

Thus, givenG=hV, E, Ri, we createAF(G) =hVF, EF, DF, TFi, where

• VF =N× {1,2,3}

[1] C₊₂[2]

F [0] F [1] F [2] F [0] F [1] F [2] F [0] F [1]

C+1[0] C+1[0]

C+1[0] C+1[1] C+1[2] C+1[1] C+1[2] C+1[1]

C₊₂[0] C₊₂[1] C₊₂[2] C₊₂[0] C₊₂ C₊₂[0]

n=0

copy 1

PCHB PCHB

n=2 copy 1

n=2 PCHB

n=0

copy 2 copy 2

PCHB n=1 copy 2 copy 1

PCHB n=0 copy 3

PCHB PCHB

n=1 copy 3

PCHB PCHB

n=2 copy 3 n=1

Figure 5.10: Acyclic forward-latency graphA_F(G).

• E_F ={F, C₊₁⁰ , C₊₁¹ , C₊₁² , C₊₁³ , C₊₂} ×N represents a pathc, where src(c) =f u_n

• DF :EF →N is the delay of the edge’s path

• TF :EF →Nis the number of tokens on the edge’s path

It remains therefore to search A_F(G) to determine the path with source and sink vertices both labelled with the same PCHB index 0 ≤ n < N that has the maximum quotient of total path delay divided by the total number of reset tokens on the path. (Some combinations of successive edges belonging to the classC+1 contain internal cycles and can be ignored by the heuristic.) This quotient is the newly-defined forward-latency cycle-timeτf for the system.

Case 4: Composite Backward-Latency Paths:

{n=K}[C₋₁ | C₋₂ | C₋₂F C₋₂]{n≤(K−1) modN}

First we note that, to avoid simple cycles from the subpatternC₀, the stepF is allowed only if it is between two instances of a path fromC₋2. Any other subpatterns involvingF are left for Case 5.

We therefore create a new class

{n=K}C₋3=C₋2F C₋2{n= (K−3) modN}

We can now rewrite the pattern for this case as [C₋₁|C₋₂|C₋₃]. There are four possible paths c∈ C₋₁, one possible pathc∈C₋₂, and one possible pathC₋₃.

Similarly to what we did for the forward handshake cycles of Case 3, we create an acyclic graph AB(G) from the FBI model in which edges represent paths from either C₋1[n], C₋2[n] or C₋3[n]. Again, it remains to search the acyclic graph to determine the path with matching source

and sink and the largest quotient of path delay divided by number of tokens (i.e., the backward- latency cycle-timeτ_b).

Case 5: Direction Change IncludingF-iterations:

C₋1F or F C₋1 or [F|C+1|C+2]F C₋2orC₋2F[F|C+1|C+2]

Let us begin by considering the two subpatterns that useF andC₋1, as well as the two subpatterns that includeC₋2 and two instances ofF. All of these subpatterns begin and end at the same node, forming a simple cycle. There are five cycles whose rotations create all paths for these subpatterns:

C₋⁰_F[n] = ρ₀[n] σ₀[n+ 1]σ₁[n+ 1]σ₂[n+ 1]σ₃[n+ 1]λ₁[n+ 1]

C₋¹_F[n] = ρ0[n] σ0[n+ 1]λ0[n+ 1]σ3[n]σ4[n]σ5[n]

C₋²_F[n] = ρ0[n] σ0[n+ 1]λ0[n+ 1]ρ3[n]ρ4[n+ 1]λ1[n+ 1]

C₋³_F[n] = ρ₀[n] ρ₁[n+ 1]ρ₂[n+ 2]λ₀[n+ 2]σ₃[n+ 1]λ₁[n+ 1]

C₋⁴_F[n] = ρ0[n] ρ0[n+ 1]σ0[n+ 2]λ0[n+ 2]σ3[n+ 1]λ1[n+ 1]

LetC₋_F be the set containing all of these cycles, all of which contain exactly one token.

All other subpatterns in this case reduce to a rotation of one of the five cycles listed above.

1. P_C =C₊₁[n]F[n+ 1]C₋₂[n+ 2]: Sincesnk((σ₂[n+ 1]) =snk(λ₀[n+ 2]), and snk((ρ₄[n]) = snk(σ₃[n+ 1]), there exists a simple cyclec∈P_C s.t. rot(c)∈C₋_F.

2. P_C=C₋₂[n+ 2]F[n]C₊₁[n+ 1]: Sincesnk((ρ₂[n+ 1]) =snk(σ₀[n+ 2]), andsrc((ρ₃[n+ 1]) = snk(λ0[n+ 2]), there exists a simple cyclec∈PC s.t. rot(c)∈C₋F.

3. PC=C+2[n]F[n+ 2]C₋2[n+ 3]: Sincesnk((ρ4[n]) =snk(σ3[n+ 2]), there exists a simple cycle c∈PC s.t. rot(c)∈C₋F.

4. PC=C₋2[n]F[n−2]C+2[n−1]: Sincesnk((σ0[n]) =snk(ρ2[n−1]), there exists a simple cycle c∈PC s.t. rot(c)∈C₋F.

Thus,

τC_−F = maxn

δ(C₋^j_F[n]) : 0≤n<N, 0≤j<5o

(5.10)

is the maximum cycle time for all subpatterns in this case.

Dalam dokumen High-Level Synthesis and Rapid Prototyping of Asynchronous VLSI Systems (Halaman 130-137)