Distributed AND Localized algorithm for Data-driven MPC

Chapter V: Data-driven Approach in the Noiseless Case

5.5 Distributed AND Localized algorithm for Data-driven MPC

amount of data needed to parametrize the behavior of the system does not scale with the size of the network but rather with𝑑, the size of the localized region, which is usually much smaller than 𝑛. To the best of our knowledge, this is the first such result. As we show next, this will prove key in extending data-driven SLS to the distributed setting.

Definition 12. Given the partition{𝔯₁, ...,𝔯𝑘}, a functional/set isrow-wise separable if:

• for a functional, 𝑔(𝚽) = Í𝑘

𝑖=1𝑔_𝑖 𝚽(𝔯𝑖,:)

for some functionals 𝑔_𝑖 for 𝑖 = 1, ..., 𝑘.

• for a set,𝚽∈ Pif and only if𝚽(𝔯𝑖,:) ∈ P𝑖∀𝑖for some setsP𝑖for𝑖=1, ..., 𝑘. An analogous definition exists forcolumn-wise separablefunctionals and sets, where the partition{𝔠₁, ...,𝔠𝑘}entails the columns of𝚽, i.e.,𝚽(:,𝔠𝑖).

By Assumption 1 the objective function and the safety/saturation constraints in equation (5.11) are row-separable in terms of𝚽. At the same time, the achievability and locality constraints (5.9), (5.10) are column-separable in terms of G. Hence, the data-driven MPC subroutine becomes:

minimize

𝚽,𝚿,{G^𝑖}^𝑁

𝑖=1

𝑁

∑︁

𝑖=1

𝑓^𝑖( [𝚽]𝑖[𝑥₀]_in_𝑖_(𝑑))

s.t.

𝑥₀=𝑥(𝑡), G^𝑖satisfies (5.10),

[𝚽𝑥]𝑖[𝑥₀]_in_𝑖_(𝑑) ∈ X^𝑖, [𝚽𝑢]𝑖[𝑥₀]_in_𝑖_(𝑑) ∈ U^𝑖, [𝚿^𝑖]_in

𝑖(𝑑) =𝐻_𝐿( [x˜]_in

𝑖(𝑑),[u˜]_in

𝑖(𝑑+1))G^𝑖

∀𝑖 =1, . . . , 𝑁 , 𝚽=𝚿.

(5.12)

Notice that the objective function and the constraints decompose across the 𝑑- localized neighborhoods of each subsystem𝑖. Given this structure, we can make use of ADMM to decompose this problem into row-wise local subproblems in terms of [𝚽]𝑖 and column-wise local subproblems in terms of G^𝑖 and𝚿^𝑖, both of which can also be parallelized across the subsystems. The ADMM subroutine iteratively updates the variables as

[𝚽]^{^𝑘^+1}

𝑖 =













argmin

[𝚽]𝑖

𝑓^𝑖( [𝚽]𝑖[𝑥₀]_in_𝑖₍𝑑))+

𝜌 2

𝑔^𝑖(𝚽,𝚿^{𝑘^},𝚲^{𝑘})

𝐹

𝑠.𝑡 . [𝚽𝑥]𝑖[𝑥₀]_in

𝑖(𝑑) ∈ X^𝑖,

[𝚽𝑢]𝑖[𝑥₀]_in_𝑖_(𝑑) ∈ U^𝑖, 𝑥₀=𝑥(𝑡).













(5.13a)

[𝚿^𝑖]^{^𝑘^+1}

in𝑖(𝑑) =













argmin

[𝚿^𝑖]_in𝑖_(𝑑),G^𝑖

∥𝑔ⁱⁿ^𝑖⁽^𝑑) [𝚽^𝑖]^{𝑘+1},[𝚿^𝑖],[𝚲^𝑖]^{𝑘^}

∥²_𝐹

𝑠.𝑡 . [𝚿^𝑖]_in_𝑖_(𝑑) = [𝐻_𝐿(x˜,u˜)]_in_𝑖₍_𝑑)G^𝑖, G^𝑖 satisfies (5.10).













(5.13b)

[𝚲]^{𝑘+1}

𝑖 =𝑔^𝑖(𝚽^{^𝑘^+1},𝚿^{^𝑘^+1},𝚲^{^𝑘^}) (5.13c) where we define

𝑔^∗(𝚽,𝚿,𝚲) := [𝚽]_∗− [𝚿]_∗+ [𝚲]_∗ with∗denoting a subsystem or collection of subsystems, and

[𝐻_𝐿(x˜,u˜)]_in_𝑖_(𝑑) :=𝐻_𝐿( [˜x]_in_𝑖_(𝑑),[u˜]_in_𝑖₍_𝑑+₁₎).

We note that to solve this subroutine, and in particular optimization (5.13b), each subsystem only needs to collect trajectory of states and control actions of subsystems that are at most𝑑+2 hops away. This subroutine thus constitutes a distributed and localized solution. We also emphasize that the trajectory length only needs to scale with the𝑑-localized system size instead of the global system size. The full algorithm is given in Algorithm 5.

Theoretical guarantees

It is worth noting that this reformulation is equivalent to the closed-loop DLMPC also introduced in Chapter 2, with the achievability constraint 𝑍_{𝐴 𝐵}𝚽= 𝐼 replaced by the data-driven parametrization in terms of Gand the Hankel matrix 𝐻_𝐿. For this reason, guarantees derived for the DLMPC formulation (3.1) directly apply to problem (5.11) when the constraint setsX andU are polytopes.

Convergence

Algorithm 5 relies on ADMM to separate both row, and column-wise computations, in terms of 𝚽 and G, respectively. Each of these is then distributed into the subsystems in the network, and a communication protocol is established to ensure the ADMM steps are properly followed. Hence, one can guarantee the convergence

Algorithm 5Subsystem’s𝑖implementation of D³LMPC

Input: tolerance parameters 𝜖_𝑝, 𝜖_𝑑 > 0, Hankel matrices [𝐻_𝐿(x˜,u˜)]_in_𝑖_(𝑑+₁₎ constructed from arbitrarily generated PE trajectories ˜x_in𝑖(𝑑+1),u˜_in𝑖(𝑑+1).

1: Measure local state [𝑥₀]𝑖, 𝑘 ← 0.

2: Share measurement [𝑥₀]𝑖without𝑖(𝑑) and receive[𝑥₀]𝑗 from 𝑗 ∈in𝑖(𝑑).

3: Solve optimization problem (5.13a).

4: Share [𝚽]^{^𝑘^+1}

𝑖 without𝑖(𝑑). Receive the corresponding [𝚽]^{^𝑘^+1}

𝑗 fromin𝑖(𝑑) and construct [𝚽^𝑖]^{𝑘+¹^}

in𝑖(𝑑).

5: Perform update (5.13b).

6: Share [𝚿^𝑖]^{_in^𝑘+1}

𝑖(𝑑) without𝑖(𝑑). Receive the corresponding [𝚿^𝑗]^{𝑘+1}_in

𝑖(𝑑) from 𝑗 ∈

in^𝑖(𝑑)and construct [𝚿]^{𝑘+¹^}

𝑖 .

7: Perform update (5.13c).

8: if

[𝚿]^{𝑘+1}

𝑖 − [𝚽]^{𝑘+1}

𝑖

𝐹

≤ 𝜖_𝑝 and

[ [𝚽]^{^𝑘^+1}

𝑖 − [𝚽]^{^𝑘^}

𝑖

𝐹

≤ 𝜖_𝑑: Apply computed control action:

[𝑢₀]𝑖 =H𝐿( [u]˜ in𝑖(𝑑))G^𝑖[𝑥₀]_in_𝑖₍𝑑), and return to step 1.

else:

Set𝑘 ← +1 and return to step 3.

of the data-driven version of DLMPC in the same way that convergence of model- based DLMPC is shown: by leveraging the convergence result of ADMM in [19].

For additional details see Lemma 6.

Recursive feasibility

Recursive feasibility for formulation (2.12) is guaranteed by means of a localized maximally invariant terminal set X𝑇. This set can be computed in a distributed manner and with local information only as described in Chapter 3. In particular, a closed-loop map𝚽for the unconstrained localized closed-loop system has to be computed. In the model-based SLS problem with quadratic cost, a solution exists for the infinite-horizon case [71], which can be done in a distributed manner and with local information only. When no model is available, the same problem can be solved using the localized data-driven SLS approach introduced in §IV with a finite-horizon approximation, which also allows for a distributed synthesis with only local data. The length of the time horizon chosen to solve the localized data-driven SLS problem might slightly impact the conservativeness of the terminal set, but since the conservatism in the FIR approach decays exponentially with the horizon length, this harm in performance is not expected to be substantial for usual values of the horizon. Once𝚽for the unconstrained localized closed-loop system has been

computed, Algorithm 2 can be used to synthesize this terminal set in a distributed and localized manner. Therefore, a terminal set that guarantees recursive feasibility for D³LMPC can be computed in a distributed manner offline using only local information and without the need for a system model.

Asymptotic stability

Similar to recursive feasibility, asymptotic stability for the D³LMPC problem (5.11) is directly inherited from the asymptotic stability guarantee for model-based DLMPC. In particular, adding a terminal cost based on the terminal set previously described is a sufficient condition to guarantee asymptotic stability of the DLMPC problem (3.1). Moreover, such cost can be incorporated in the D³LMPC formulation in the same way as in the model-based DLMPC problem, and the structure of the resulting problem is analogous. This terminal cost introduces coupling among subsystems, but the coupling can be dealt with by solving step 3 of Algorithm 5 via local consensus. Notice that since step 3 of Algorithm 5 is written in terms of 𝚽, Algorithm 3 can be directly used to handle this coupling.

Dalam dokumen Distributed and Localized Model Predictive Control (Halaman 97-101)