Chapter 4: Convex Optimality in Robust Nonlinear Control and Estimation
4.2 CV-STEM Estimation
We could also design an optimal state estimator analogously to the CV-STEM control of Theorem 4.2, due to the differential nature of contraction theory that enables LTV systems-type approaches to stability analysis. In particular, we exploit the estimation and control duality in differential dynamics similar to that of the Kalman filter and LQR in LTV systems.
Let us consider the following smooth nonlinear systems with a measurementπ¦(π‘), perturbed by deterministic disturbancesππ0(π₯ , π‘)andππ1(π₯ , π‘)with supπ₯ ,π‘β₯ππ0(π₯ , π‘) β₯=
Β―
ππ0 β Rβ₯0and supπ₯ ,π‘β₯ππ1(π₯ , π‘) β₯ = πΒ―π1 β Rβ₯0, or by Gaussian white noise, driven
by Wiener processes π²0(π‘) and π²1(π‘) with supπ₯ ,π‘β₯πΊπ0(π₯ , π‘) β₯πΉ = πΒ―π0 β Rβ₯0 and supπ₯ ,π‘β₯πΊπ1(π₯ , π‘) β₯πΉ =πΒ―π1 βRβ₯0:
Β€
π₯ = π(π₯ , π‘) +ππ0(π₯ , π‘), π¦ =β(π₯ , π‘) +ππ1(π₯ , π‘) (4.3) ππ₯ = π(π₯ , π‘)π π‘+πΊπ0ππ²0, π¦ π π‘ =β(π₯ , π‘)π π‘+πΊπ1ππ²1 (4.4) whereπ‘ βRβ₯0is time,π₯ :Rβ₯0β¦β Rπis the system state,π¦ :Rβ₯0β¦βRπis the system measurement, π : RπΓRβ₯0 β¦β Rπ and β : Rπ ΓRβ₯0 β¦β Rπ are known smooth functions,ππ0:RπΓRβ₯0β¦β Rπ,ππ1 :RπΓRβ₯0 β¦βR0,πΊπ0:RπΓRβ₯0β¦βRπΓπ€0, and πΊπ1:RπΓRβ₯0β¦β RπΓπ€1 are unknown bounded functions for external disturbances, π²0 :Rβ₯0β¦βRπ€0 andπ²1:Rβ₯0β¦β Rπ€1 are two independent Wiener processes, and the arguments ofπΊπ0(π₯ , π‘)andπΊπ1(π₯ , π‘)are suppressed for notational convenience.
Let π΄(ππ, π₯ ,π₯ , π‘Λ )andπΆ(ππ, π₯ ,π₯ , π‘Λ ) be the SDC matrices given by Lemma 3.1 with (π , π ,π ,Β― π’Β―) replaced by(π ,π₯ , π₯ ,Λ 0)and (β,π₯ , π₯ ,Λ 0), respectively, i.e.
π΄(ππ, π₯ ,π₯ , π‘Λ ) (π₯Λβπ₯) = π(π₯ , π‘Λ ) β π(π₯ , π‘) (4.5) πΆ(ππ, π₯ ,π₯ , π‘Λ ) (π₯Λβπ₯) =β(π₯ , π‘Λ ) ββ(π₯ , π‘). (4.6) We design a nonlinear state estimation law parameterized by a matrix-valued func- tionπ(π₯ , π‘Λ )as follows:
Β€Λ
π₯ = π(π₯ , π‘Λ ) +πΏ(π₯ , π‘Λ ) (π¦ββ(π₯ , π‘Λ )) (4.7)
= π(π₯ , π‘Λ ) +π(π₯ , π‘Λ )πΆΒ―(ππ,π₯ , π‘Λ )β€π (π₯ , π‘Λ )β1(π¦ββ(π₯ , π‘Λ ))
where Β―πΆ(ππ,π₯ , π‘Λ ) = πΆ(ππ,π₯ ,Λ π₯ , π‘Β― ) for a fixed trajectory Β―π₯ (e.g., Β―π₯ = 0, see Theo- rem 3.2), π (π₯ , π‘Λ ) β» 0 is a weight matrix on the measurement π¦, and π(π₯ , π‘Λ ) β» 0 is a positive definite matrix (which satisfies the matrix inequality constraint for a contraction metric, to be given in (4.12) of Theorem 4.5). Note that we could use other forms of estimation laws such as the EKF [2], [6], [7], analytical SLAM [8], or SDC with respect to a fixed point [1], [4], [9], depending on the application of interest, which result in a similar stability analysis as in Theorem 3.2.
4.2.I Nonlinear Stability Analysis of SDC-based State Estimation using Con- traction Theory
Substituting (4.7) into (4.3) and (4.4) yields the following virtual system of a smooth pathπ(π, π‘), parameterized byπβ [0,1]to haveπ(π=0, π‘) =π₯andπ(π=1, π‘) =π₯Λ:
Β€
π(π, π‘) =π(π(π, π‘), π₯ ,π₯ , π‘Λ ) +π(π, π₯ ,π₯ , π‘Λ ) (4.8) ππ(π, π‘) =π(π(π, π‘), π₯ ,π₯ , π‘Λ )π π‘+πΊ(π, π₯ ,π₯ , π‘Λ )ππ²(π‘) (4.9)
where π(π, π₯ ,π₯ , π‘Λ ) = (1 β π)ππ0(π₯ , π‘) + π πΏ(π₯ , π‘Λ )ππ1(π₯ , π‘), πΊ(π, π₯ ,π₯ , π‘Λ ) = [(1 β π)πΊπ0(π₯ , π‘), π πΏ(π₯ , π‘Λ )πΊπ1(π₯ , π‘)],π² = [π²β€
0 ,π²1β€]β€, and π(π, π₯ , π₯π, π’π, π‘)is defined as
π(π, π₯ ,π₯ , π‘Λ ) =(π΄(ππ, π₯ ,π₯ , π‘Λ ) βπΏ(π₯ , π‘Λ )πΆ(ππ, π₯ ,π₯ , π‘Λ )) (πβπ₯) + π(π₯ , π‘). (4.10) Note that (4.10) is constructed to containπ =π₯Λandπ =π₯as its particular solutions of (4.8) and (4.9). Ifπ =0 andπ² =0, the differential dynamics of (4.8) and (4.9) forπππ=π π/π πis given as
πππΒ€ =(π΄(ππ, π₯ ,π₯ , π‘Λ ) βπΏ(π₯ , π‘Λ )πΆ(ππ, π₯ ,π₯ , π‘Λ ))πππ . (4.11) The similarity between (3.14) (πππΒ€= (π΄βπ΅πΎ)πππ) and (4.11) leads to the following theorem [1]β[4]. Again, note that we could also use the SDC formulation with respect to a fixed point as delineated in Theorem 3.2 and as demonstrated in [1], [4], [9].
Theorem 4.3. Suppose βπ,Β― πΒ― β Rβ₯0 s.t. β₯π β1(π₯ , π‘Λ ) β₯ β€ πΒ―, β₯πΆ(ππ, π₯ ,π₯ , π‘Λ ) β₯ β€
Β―
π, βπ₯ ,π₯ , π‘Λ . Suppose also that πI βͺ― π βͺ― π πΌ of (2.26) holds, or equivalently, I βͺ― πΒ― βͺ― π πΌ of(3.19)holds withπ = π(π₯ , π‘Λ )β1,πΒ― = ππ, π =π, and π = π/π. As in Theorem 3.1, letπ½be defined asπ½ =0for deterministic systems(4.3)and π½=πΌπ =πΌπ0+π2πΌπ1 =πΏππΒ―2
π0(πΌπΊ +1/2)/2+π2πΏππΒ―2πΒ―2πΒ―2
π1(πΌπΊ +1/2)/2 for stochastic systems(4.4), where2πΌπ0= πΏππΒ―2
π0(πΌπΊ+1/2),2πΌπ1= πΏππΒ―2πΒ―2πΒ―2
π1(πΌπΊ+ 1/2), πΏπ is the Lipschitz constant of ππ/π π₯π, πΒ―π0 and πΒ―π1 are given in(4.4), and
βπΌπΊ βR>0is an arbitrary constant as in Theorem 2.5.
If π(π₯ , π‘Λ ) in (4.7) is constructed to satisfy the following convex constraint for
βπΌβR>0:
€¯
π+2 sym(π π΄Β― βππΆΒ―β€π β1πΆ) βͺ― β2πΌπΒ― βπ π½I (4.12) then Theorems 2.4 and 2.5 hold for the virtual systems(4.8)and(4.9), respectively, i.e., we have the following bounds fore =π₯Λβπ₯withπ =πand π=π/π:
β₯e(π‘) β₯ β€
β
ππβ(0)πβπΌπ‘+
Β― ππ0
β
π+πΒ―πΒ―πΒ―π1π πΌ
(1βπβπΌπ‘) (4.13)
E
β₯e(π‘) β₯2
β€ πE[ππ β(0)]πβ2πΌπ‘+ πΆπ0π+πΆπ1π π2 2πΌ
(4.14) where ππ β = β«π₯Λ
π₯
πΏπβ€π πΏπ and πβ = β«π₯Λ
π₯ β₯ΞπΏπβ₯ are given in Theorem 2.3 with π = πβ1 = Ξβ€Ξ defining a contraction metric, the disturbance bounds πΒ―π0, πΒ―π1,
Β―
ππ0, and πΒ―π1 are given in (4.3)and (4.4), respectively,πΆπ0 = πΒ―2
π0(2πΌπΊβ1+1), and πΆπ1 = πΒ―2πΒ―2πΒ―2
π1(2πΌπΊβ1+1). Note that for stochastic systems, the probability that
β₯eβ₯ is greater than or equal toπβR>0is given as P[β₯e(π‘) β₯ β₯ π] β€ 1
π2
πE[ππ β(0)]πβ2πΌπ‘+ πΆπΈ 2πΌ
(4.15) whereπΆπΈ =πΆπ0π+πΆπ1π π2.
Proof. Theorem 3.1 indicates that (4.12) is equivalent to
πΒ€ +2 sym(π π΄βπΆΒ―β€π β1πΆ) βͺ― β2πΌπβπ½I. (4.16) Computing the time derivative of a Lyapunov functionπ =πππβ€π πππwithπππ=
π π/π πfor the unperturbed virtual dynamics (4.11), we have using (4.16) that πΒ€ =πππβ€π πππ =πππβ€( Β€π +2π π΄β2 Β―πΆβ€π β1πΆ)πππ β€ β2πΌπ βπ½β₯πππβ₯2
which implies thatπ = πβ1defines a contraction metric. Since we have πβ1I βͺ― π βͺ― πβ1I,π β₯ πβ1β₯πππβ₯2, and
β₯Ξ(π₯ , π‘Λ )πππβ₯ β€ πΒ―π0/β
π+πΒ―π1πΒ―πΒ―
β π
β₯πππΊβ₯2πΉ β€ πΒ―2
π0+πΒ―2πΒ―2πΒ―2
π1π2
for π in (4.8) andπΊ in (4.9), the bounds (4.13) β (4.15) follow from the proofs of Theorems 2.4 and 2.5 [2], [3].
Remark 4.1. Although (4.12) is not an LMI due to the nonlinear term βπ π½I on its right-hand side for stochastic systems(4.4), it is a convex constraint asβπ π½ =
βππΌπ =βππΌπ0βπ3πΌπ1is a concave function forπ βR>0[3], [10].
4.2.II CV-STEM Formulation for State Estimation
The estimator (4.7) gives a convex steady-state upper bound of the Euclidean distance betweenπ₯and Λπ₯ as in Theorem 4.1 [1]β[4].
Theorem 4.4. If(4.12)of Theorem 4.3 holds, then we have the following bound:
π‘ββlim
βοΈ
E
β₯π₯Λβπ₯β₯2
β€ π0(πΌ, πΌπΊ)π+π1(πΌ, πΌπΊ)ππ (4.17) where π0 = πΒ―π0/πΌ, π1 = πΒ―πΒ―πΒ―π1/πΌ, π = 1 for deterministic systems (4.8), and π0=βοΈ
πΆπ0/(2πΌ),π1=πΆπ1/(2β
2πΌπΆπ0), andπ =2for stochastic systems(4.9), with πΆπ0andπΆπ0given asπΆπ0=πΒ―2
π0(2πΌβ1
πΊ +1)andπΆπ1= πΒ―2πΒ―2πΒ―2
π1(2πΌβ1
πΊ +1).
Proof. The upper bound (4.17) for deterministic systems (4.8) follows from (4.13) with the relation 1 β€ β
π β€ πdue toπ β€ π. For stochastic systems, we have using (4.14) that
πΆπ0π+πΆπ1π2π β€ πΆπ0(π+ (πΆπ1/(2πΆπ0))π2)2
due to 1β€ π β€ π2andπ βR>0. This gives (4.17) for stochastic systems (4.9).
Finally, the CV-STEM estimation framework is summarized in Theorem 4.5 [1]β[4].
Theorem 4.5. Suppose thatπΌ, πΌπΊ, πΒ―π0, πΒ―π1, πΒ―π0, πΒ―π1, andπΏπ in(4.12) and(4.17) are given. If the pair (π΄, πΆ) is uniformly observable, the non-convex optimization problem of minimizing the upper bound(4.17)is equivalent to the following convex optimization problem with the contraction constraint (4.12) and I βͺ― πΒ― βͺ― πI of (3.19):
π½β
πΆπ = min
πβR>0, πβR,πΒ―β»0
π0π+π1ππ +π2π(π, π,πΒ―) (4.18) s.t.(4.12) and (3.19)
whereπ0,π1, andπ are as defined in(4.17)of Theorem 4.4,π2 βRβ₯0, andπis some performance-based cost function as in Theorem 4.2.
The weight π1 for ππ indicates how much we trust the measurement π¦(π‘). Using non-zeroπ2enables finding contraction metrics optimal in a different sense in terms of π. Furthermore, the coefficients of the SDC parameterizations ππ and ππ in Lemma 3.1 (i.e., π΄ = Γ
ππ,ππ΄π andπΆ = Γ
ππ,ππΆπ in (4.5) and (4.6)) can also be treated as decision variables by convex relaxation [9], thereby adding a design flexibility to mitigate the effects of external disturbances while verifying the system observability.
Proof. The proposed optimization problem is convex as its objective and constraints are convex in terms of decision variables π, π, and Β―π (see Remark 4.1). Also, larger Β―ππ1and Β―ππ1in (4.3) and (4.4) imply larger measurement uncertainty. Thus by definition ofπ1in Theorem 4.4, the larger the weight ofπ, the less confident we are in π¦(π‘)(see Example 4.2). The last statement on the SDC coefficients for guaranteeing observability follows from Proposition 1 of [9] and Proposition 1 of [1].
Example 4.2. The weights π0 andπ1 of the CV-STEM estimation of Theorem 4.5 has an analogous trade-off to the case of the Kalman filter with the process and
sensor noise covariance matrices,πandπ, respectively, since the termπ0πin upper bound of the steady-state tracking error in(4.17)becomes dominant if measurement noise is much smaller than process noise (πΒ―π0 β« πΒ―π1 orπΒ―π0 β« πΒ―π1), and the term π1ππ becomes dominant if measurement noise is much greater than process noise (πΒ―π0 βͺ πΒ―π1orπΒ―π0βͺ πΒ―π1). In particular [2], [3],
β’ ifπ1is much greater thanπ0, large measurement noise leads to state estimation that responds slowly to unexpected changes in the measurement π¦ (i.e. small estimation gain due toπ =π β₯ β₯πβ₯), and
β’ if π1is much smaller thanπ0, large process noise leads to state estimation that responds fast to changes in the measurement (i.e. largeπ =π β₯ β₯πβ₯).
This is also because the solution π = πβ1 β» 0of the Kalman filter Riccati equa- tion [7, p. 375], πΒ€ = π΄π+π π΄β€ βππΆβ€π β1πΆ π+π, can be viewed as a positive definite matrix that defines a contraction metric as discussed in Example 2.3.