Chapter 3: Robust Nonlinear Control and Estimation via Contraction Theory . 50
3.2 LMI Conditions for Contraction Metrics
We design a nonlinear feedback tracking control law parameterized by a matrix- valued functionπ(π₯ , π₯π, π’π, π‘) (orπ(π₯ , π‘), see Theorem 3.2) as follows:
π’ =π’πβπΎ(π₯ , π₯π, π’π, π‘) (π₯βπ₯π) (3.10)
=π’πβπ (π₯ , π₯π, π’π, π‘)β1π΅(π₯ , π‘)β€π(π₯ , π₯π, π’π, π‘) (π₯βπ₯π)
where π (π₯ , π₯π, π’π, π‘) β» 0 is a weight matrix on the inputπ’ andπ(π₯ , π₯π, π’π, π‘) β» 0 is a positive definite matrix (which satisfies the matrix inequality constraints for a contraction metric, to be given in Theorem 3.1). As discussed in Sec. 3.1.III, the extended linear form of the tracking control (3.10) enables LTV systems-type ap- proaches to Lyapunov function construction, while being general enough to capture the nonlinearity of the underlying dynamics due to Lemma 3.2 [36].
Lemma 3.2. Consider a general feedback controllerπ’defined asπ’= π(π₯ , π₯π, π’π, π‘) withπ(π₯π, π₯π, π’π, π‘) =π’π, whereπ :RπΓRπΓRπ ΓRβ₯0β¦β Rπ. Ifπ is piecewise continuously differentiable, then βπΎ : Rπ Γ Rπ Γ Rπ Γ Rβ₯0 β¦β RπΓπ s.t. π’ = π(π₯ , π₯π, π’π, π‘) =π’πβπΎ(π₯ , π₯π, π’π, π‘) (π₯βπ₯π).
Proof. Usingπ(π₯π, π₯π, π’π, π‘) =π’π,π’can be decomposed asπ’=π’π+(π(π₯ , π₯π, π’π, π‘)β
π(π₯π, π₯π, π’π, π‘)). Since we haveπ(π₯ , π₯π, π’π, π‘) βπ(π₯π, π₯π, π’π, π‘) =β«1
0 (π π(ππ₯+ (1β π)π₯π, π₯π, π’π, π‘)/ππ)ππ, selectingπΎas
πΎ =β
β« 1 0
π π
π π₯
(ππ₯+ (1βπ)π₯π, π₯π, π’π, π‘)ππ gives the desired relation [36].
Remark 3.4. Lemma 3.2 implies that designing optimal π ofπ’ = π(π₯ , π₯π, π’π, π‘) re- duces to designing the optimal gainπΎ(π₯ , π₯π, π’π, π‘)ofπ’=π’πβπΎ(π₯ , π₯π, π’π, π‘) (π₯βπ₯π). We could also generalize this idea further using the CCM-based differential feedback controllerπΏπ’= π(π₯ , πΏπ₯ , π’, π‘)[3], [14], [15], [19], [20], [33] (see Theorem 4.6).
Substituting (3.10) into (3.1) and (3.2) yields the following virtual system of a smooth pathπ(π, π‘), parameterized by π β [0,1] to haveπ(0, π) =π₯π andπ(1, π‘) =π₯, for partial contraction in Theorem 2.2:
Β€
π(π, π‘) =π(π(π, π‘), π₯ , π₯π, π’π, π‘) +π(π, π₯ , π‘) (3.11) ππ(π, π‘) =π(π(π, π‘), π₯ , π₯π, π’π, π‘)π π‘+πΊ(π, π₯ , π‘)ππ²(π‘) (3.12) whereπ(π, π₯ , π‘) =π ππ(π₯ , π‘),πΊ(π, π₯ , π‘) =ππΊπ(π₯ , π‘), andπ(π, π₯ , π₯π, π’π, π‘)is defined as
π =(π΄(π, π₯ , π₯π, π’π, π‘) βπ΅(π₯ , π‘)πΎ(π₯ , π₯π, π’π, π‘)) (πβπ₯π) + π(π₯π, π‘) +π΅(π₯π, π‘)π’π (3.13) whereπ΄is the SDC matrix of Lemma 3.1 with(π ,π ,Β― π’Β―) =(π₯ , π₯π, π’π). Settingπ=1 in (3.11) and (3.12) results in (3.1) and (3.2), respectively, and settingπ=0 simply results in (3.3). Consequently, both π = π₯ and π = π₯π are particular solutions of (3.11) and (3.12). If there is no disturbance acting on the dynamics (3.1) and (3.2), the differential dynamics of (3.11) and (3.12) forπππ=π π/π πis given as
πππΒ€ =(π΄(π, π₯ , π₯π, π’π, π‘) βπ΅(π₯ , π‘)πΎ(π₯ , π₯π, π’π, π‘))πππ . (3.14) In [12], [13], [16], [17], it is proposed that the contraction conditions of Theorems 2.1 and 2.5 for the closed-loop dynamics (3.11) and (3.12) can be expressed as convex constraints as summarized in Theorem 3.1.
Theorem 3.1. Let π½be defined as π½=0for deterministic systems(3.1)and π½=πΌπ = πΏππΒ―2
π(πΌπΊ +1/2)
for stochastic systems (3.2), respectively, where πΒ―π is given in (3.2), πΏπ is the Lipschitz constant of π π/π π₯π for π of (3.10), and πΌπΊ β R>0 is an arbitrary constant as in Theorem 2.5. Also, letπ = π(π₯ , π₯π, π’π, π‘)β1(orπ =π(π₯ , π‘)β1, see
Theorem 3.2),πΒ― = ππ, and π = π. Then the following three matrix inequalities are equivalent:
πΒ€ +π
π π
π π + π π
π π
β€
π βͺ― β2πΌ πβπ½I, βπβ [0,1] (3.15) πΒ€ +2 sym(π π΄) β2π π΅ π β1π΅β€π βͺ― β2πΌ πβ π½I (3.16)
β Β€πΒ― +2 sym(π΄πΒ―) β2π π΅ π β1π΅β€ βͺ― β2πΌπΒ― β π½ π
Β―
π2 (3.17)
where π is as defined in (3.13). For stochastic systems with π½ = πΌπ > 0, these inequalities are also equivalent to
"
β Β€πΒ― +2 sym(π΄πΒ―) β2π π΅ π β1π΅β€+2πΌπΒ― πΒ―
πΒ― βπ
π½I
#
βͺ―0. (3.18)
Note thatπandπΒ― are required for(3.17)and(3.18)and the arguments(π₯ , π₯π, π’π, π‘) for each matrix are suppressed for notational simplicity.
Furthermore, under these equivalent contraction conditions, Theorems 2.4 and 2.5 hold for the virtual systems (3.11) and (3.12), respectively. In particular, if πIβͺ― π βͺ― πIof(2.26)holds, or equivalently
Iβͺ― πΒ―(π₯ , π₯π, π’π, π‘) βͺ― πI (3.19)
holds for π=π/π, then we have the following bounds:
β₯π₯(π‘) βπ₯π(π‘) β₯ β€ πβ(0)
β π
πβπΌπ‘+ πΒ―π πΌ
β
π(1βπβπΌπ‘) (3.20)
E
β₯π₯(π‘) βπ₯π(π‘) β₯2
β€ E[ππ β(0)]
π
πβ2πΌπ‘+ πΆπΆ 2πΌ
π (3.21)
whereππ β =β«π₯
π₯π
πΏπβ€π πΏπandπβ =β«π₯
π₯π
β₯ΞπΏπβ₯are as given in Theorem 2.3 withπ = Ξβ€Ξ, the disturbance bounds πΒ―π andπΒ―π are given in (3.1) and(3.2), respectively, and πΆπΆ = πΒ―2
π(2πΌπΊβ1 +1). Note that for stochastic systems, the probability that
β₯π₯βπ₯πβ₯is greater than or equal toπ βR>0is given as P[β₯π₯(π‘) βπ₯π(π‘) β₯ β₯π] β€ 1
π2
E[ππ β(0)]
π
πβ2πΌπ‘+ πΆπΆ 2πΌ
π
. (3.22)
Proof. Substituting (3.13) into (3.15) gives (3.16). Since π > 0 and π β» 0, multiplying (3.16) byπand then byπfrom both sides preserves matrix definiteness.
Also, the resultant inequalities are equivalent to the original ones [37, p. 114]. These operations performed on (3.16) yield (3.17). If π½ = πΌπ > 0 for stochastic systems,
applying Schurβs complement lemma [37, p. 7] to (3.17) results in the Linear Matrix Inequality (LMI) constraint (3.18) in terms of Β―π andπ. Therefore, (3.15) β (3.18) are indeed equivalent.
Also, since we have β₯πππ(π, π₯ , π‘) β₯ β€ πΒ―π forπ in (3.11) andβ₯πππΊ(π, π₯ , π‘) β₯2
πΉ β€ πΒ―2
π
forπΊin (3.12), the virtual systems in (3.11) and (3.12) clearly satisfy the conditions of Theorems 2.4 and 2.5 if it is equipped with (3.15), which is equivalent to (3.16) β (3.18). This implies the exponential bounds (3.20) β (3.22) rewritten usingπ=π/π, following the proofs of Theorems 2.4 and 2.5.
Because of the control and estimation duality in differential dynamics similar to that of the Kalman filter and Linear Quadratic Regulator (LQR) in LTV systems, we have an analogous robustness result for the contraction theory-based state estimator as to be derived in Sec. 4.2.
Although the conditions (3.15) β (3.18) depend on (π₯π, π’π), we could also use the SDC formulation with respect to a fixed point [12], [13] in Lemma 3.1 to make them independent of the target trajectory as in the following theorem.
Theorem 3.2. Let (π₯ ,Β― π’Β―) be a fixed point selected arbitrarily in Rπ Γ Rπ, e.g., (π₯ ,Β― π’Β―)= (0,0), and letπ΄(π₯ , π‘)be an SDC matrix constructed with(π ,π ,Β― π’Β―)= (π₯ ,π₯ ,Β― π’Β―) in Lemma 3.1, i.e.,
π΄(π, π₯ , π‘) (π₯βπ₯Β―) = π(π₯ , π‘) +π΅(π₯ , π‘)π’Β―β π(π₯ , π‘Β― ) βπ΅(π₯ , π‘Β― )π’ .Β― (3.23) Suppose that the contraction metric of Theorem 3.1 is designed by π(π₯ , π‘) with π΄ of(3.23), independently of the target trajectory (π₯π, π’π), and that the systems(3.1) and(3.2)are controlled by
π’ =π’πβπ (π₯ , π‘)β1π΅(π₯ , π‘)β€π(π₯ , π‘) (π₯βπ₯π) (3.24) with such π(π₯ , π‘), where π (π₯ , π‘) β» 0 is a weight matrix on π’. If the function π(π₯ , π₯π, π’π, π‘) = π΄(π, π₯ , π‘) (π₯πβπ₯Β―) +π΅(π₯ , π‘) (π’πβπ’Β―) is Lipschitz inπ₯ with its Lips- chitz constantπΏΒ―, then Theorem 3.1 still holds withπΌof the conditions(3.15)β(3.18) replaced byπΌ+πΏΒ―βοΈ
π/π. The same argument holds for state estimation of Theo- rem 4.3 to be discussed in Sec. 4.2.
Proof. The unperturbed virtual system of (3.1), (3.2), and (3.3) with π΄ of (3.23) andπ’of (3.24) is given as follows:
Β€
π =(π΄(π, π₯ , π‘) βπ΅(π₯ , π‘)πΎ(π₯ , π‘)) (πβπ₯π) +π΄(π, π, π‘) (π₯πβπ₯Β―) +π΅(π, π‘) (π’πβπ’Β―)
+ π(π₯ , π‘Β― ) +π΅(π₯ , π‘Β― )π’Β― (3.25)
whereπΎ(π₯ , π‘) = π (π₯ , π‘)β1π΅(π₯ , π‘)β€π(π₯ , π‘). Following the proof of Theorem 3.1, the computation ofπΒ€, whereπ =πΏπβ€π(π₯ , π‘)πΏπ, yields an extra term
2πΏπβ€π
π π
π π
(π, π₯π, π’π, π‘)πΏπ β€ 2 Β―πΏ
βοΈ
π π
πΏπβ€π πΏπ (3.26)
due to the Lipschitz condition on π, where π(π, π₯π, π’π, π‘) = π΄(π, π‘) (π₯π β π₯Β―) + π΅(π, π‘) (π’π βπ’Β―). This indeed implies that the system (3.25) is contracting as long as the conditions (3.15) β (3.18) hold with πΌ replaced by πΌ+ πΏΒ―βοΈ
π/π. The last statement on state estimation follows from the nonlinear control and estimation duality to be discussed in Sec. 4.2.
Remark 3.5. As in [12], we could directly use the extra term2πΏπβ€π(π π/π π)πΏπof (3.26)in(3.15)β(3.18)without upper-bounding it, although now the constraints of Theorem 3.2 depend on(π₯ , π, π‘)instead of(π₯ , π‘). Also, the following two inequalities given in [12] withπΎΒ― =π πΎ, πΎ βRβ₯0:
β Β€πΒ― + π΄πΒ― +π π΄Β― β€+πΎΒ―Iβπ π΅ π β1π΅β€ βͺ―0
"
Β―
πΎI+π π΅ π β1π΅β€βπ πΒ― β€βππΒ― β2πΌπΒ― πΒ―
Β―
π π
2πΌπ I
#
βͺ° 0. are combined as one LMI(3.18)in Theorems 3.1 and 3.2.
Example 3.4. The inequalities in Theorem 3.1 can be interpreted as in the Riccati inequality inHβcontrol. Consider the following system:
Β€
π₯ = π΄π₯+π΅π’π’+π΅π€π€ , π§=πΆπ§π₯ (3.27)
where π΄ β RπΓπ, π΅π’ β RπΓπ, π΅π€ β RπΓπ€, and πΆπ§ β RπΓπ are constant matrices, π€ β Rπ€ is an exogenous input, and π§ β Rπ is a system output. As shown in [4]
and [37, p. 109], there exists a state feedback gainπΎ = π β1π΅β€
π’π such that theL2 gain of the closed-loop system(3.27),supβ₯π€β₯β 0β₯π§β₯/β₯π€β₯, is less than or equal toπΎ if
2 sym(π π΄) β2π π΅π’π β1π΅β€
π’π+ π π΅π€π΅β€
π€π πΎ2
+πΆβ€
π§πΆπ§ βͺ― 0 (3.28)
has a solutionπ β» 0, whereπ β» 0is a constant weight matrix on the inputπ’. If we select π΅π€ andπΆπ§ to have π΅π€π΅β€
π€ βͺ° (πβ1)2 andπΆβ€
π§ πΆπ§ βͺ° 2πΌ π for someπΌ > 0, the contraction condition(3.16)in Theorem 3.1 can be satisfied with π = π, π΅ = π΅π’, andπ½ =1/πΎ2due to(3.28).
In Sec. 3.3 and Sec. 3.4, we will discuss the relationship to input-output stability theory as in Example 3.4, using the results of Theorem 3.1.