• Tidak ada hasil yang ditemukan

Chapter 2: Contraction Theory

2.1 Fundamentals

Consider the following smooth non-autonomous (i.e., time-varying) nonlinear sys- tem:

Β€

π‘₯(𝑑) = 𝑓(π‘₯(𝑑), 𝑑) (2.1)

where 𝑑 ∈ Rβ‰₯0 is time, π‘₯ : Rβ‰₯0 ↦→ R𝑛 the system state, and 𝑓 : R𝑛×Rβ‰₯0 ↦→ R𝑛 is a smooth function. Note that the smoothness of 𝑓(π‘₯ , 𝑑) guarantees the existence and uniqueness of the solution to (2.1) for a givenπ‘₯(0) =π‘₯0at least locally [9, pp.

88-95].

Definition 2.1. A differential displacement,𝛿π‘₯, is defined as an infinitesimal dis- placement at a fixed time as used in the calculus of variation [10, p. 107], and(2.1) yields the following differential dynamics:

𝛿π‘₯Β€(𝑑) = πœ• 𝑓

πœ• π‘₯

(π‘₯(𝑑), 𝑑)𝛿π‘₯(𝑑) (2.2)

where 𝑓(π‘₯(𝑑), 𝑑) is given in(2.1).

Let us first present a special case of the comparison lemma [9, pp. 102-103, pp.

350-353] to be used extensively throughout this thesis.

𝛿𝑧

Two neighboring

𝛿 αˆΆπ‘§

trajectories

𝑽

𝑑

Lyapunov Function Incremental Stability

𝑑

Figure 2.1: Illustration of contraction theory, where 𝑉 is a differential Lyapunov function𝑉 = 𝛿π‘₯βŠ€π‘€(π‘₯ , 𝑑)𝛿π‘₯, 𝛿 𝑧 = Θ(π‘₯ , 𝑑)𝛿π‘₯, and 𝑀(π‘₯ , 𝑑) = Θ(π‘₯ , 𝑑)Θ(π‘₯ , 𝑑)⊀ ≻ 0 defines a contraction metric (see Theorem 2.1).

Lemma 2.1. Suppose that a continuously differentiable function 𝑣 ∈ Rβ‰₯0 ↦→ R satisfies the following differential inequality:

Β€

𝑣(𝑑) ≀ βˆ’π›Ύ 𝑣(𝑑) +𝑐, 𝑣(0) =𝑣0, βˆ€π‘‘ ∈Rβ‰₯0

where𝛾 ∈R>0,𝑐 ∈R, and𝑣0∈R. Then we have 𝑣(𝑑) ≀ 𝑣0π‘’βˆ’π›Ύπ‘‘ + 𝑐

𝛾

(1βˆ’π‘’βˆ’π›Ύπ‘‘), βˆ€π‘‘ ∈Rβ‰₯0.

Proof. See [9, pp. 659-660].

2.1.I Contraction Theory and Contraction Metric

In Lyapunov theory, nonlinear stability of (2.1) is studied by constructing a Lyapunov function𝑉(π‘₯ , 𝑑), one example of which is𝑉 =π‘₯βŠ€π‘ƒ(π‘₯ , 𝑑)π‘₯. However, finding𝑉(π‘₯ , 𝑑) for general nonlinear systems is challenging as𝑉(π‘₯ , 𝑑)can be any scalar function of π‘₯ (e.g., a candidate𝑉(π‘₯ , 𝑑) can be obtained by solving a PDE [9, p. 211]). In con- trast, as summarized in Table 1.1, contraction theory uses a differential Lyapunov function that is always a quadratic function of𝛿π‘₯, i.e.,𝑉(π‘₯ , 𝛿π‘₯ , 𝑑) = 𝛿π‘₯βŠ€π‘€(π‘₯ , 𝑑)𝛿π‘₯, thereby characterizing a necessary and sufficient condition for incremental exponen- tial convergence of the multiple nonlinear system trajectories to one single trajectory.

Thus, the problem of finding𝑉 for stability analysis boils down to finding a finite- dimensional positive-definite matrix 𝑀, as illustrated in Figure 2.1 [1]. These properties to be derived in Theorem 2.1, which hold both for autonomous (i.e.

time-invariant) and non-autonomous systems, epitomize significant methodological simplifications of stability analysis in contraction theory.

Theorem 2.1. If there exists a uniformly positive definite matrix given as𝑀(π‘₯ , 𝑑)= Θ(π‘₯ , 𝑑)⊀Θ(π‘₯ , 𝑑) ≻ 0, βˆ€π‘₯ , 𝑑, whereΘ(π‘₯ , 𝑑) defines a smooth coordinate transforma-

tion of 𝛿π‘₯, i.e., 𝛿 𝑧 = Θ(π‘₯ , 𝑑)𝛿π‘₯, s.t. either of the following equivalent conditions holds forβˆƒπ›ΌβˆˆR>0,βˆ€π‘₯ , 𝑑:

πœ†max(𝐹(π‘₯ , 𝑑)) =πœ†max Θ€ +Ξ˜πœ• 𝑓

πœ• π‘₯

Ξ˜βˆ’1

≀ βˆ’π›Ό (2.3)

𝑀€ +𝑀

πœ• 𝑓

πœ• π‘₯ + πœ• 𝑓

πœ• π‘₯

⊀

𝑀 βͺ― βˆ’2𝛼 𝑀 (2.4)

where the arguments (π‘₯ , 𝑑) of 𝑀(π‘₯ , 𝑑) and Θ(π‘₯ , 𝑑) are omitted for notational sim- plicity, then all the solution trajectories of (2.1) converge to a single trajectory exponentially fast regardless of their initial conditions (i.e., contracting, see Defini- tion 2.3), with an exponential convergence rate𝛼. The converse also holds.

Proof. The proof of this theorem can be found in [1], but here we emphasize the use of the comparison lemma given in Lemma 2.1. Taking the time-derivative of a differential Lyapunov function of𝛿π‘₯(or𝛿 𝑧),𝑉 =𝛿 π‘§βŠ€π›Ώ 𝑧 =𝛿π‘₯βŠ€π‘€(π‘₯ , 𝑑)𝛿π‘₯, using the differential dynamics (2.2), we have

𝑉€(π‘₯ , 𝛿π‘₯ , 𝑑)=2𝛿 π‘§βŠ€πΉ 𝛿 𝑧=𝛿π‘₯⊀

𝑀€ + πœ• 𝑓

πœ• π‘₯

⊀

𝑀+𝑀

πœ• 𝑓

πœ• π‘₯

𝛿π‘₯

≀ βˆ’2𝛼𝛿 π‘§βŠ€π›Ώ 𝑧 =βˆ’2𝛼𝛿π‘₯βŠ€π‘€(π‘₯ , 𝑑)𝛿π‘₯

where the conditions (2.3) and (2.4) are used with the generalized Jacobian 𝐹 in (2.3) obtained from 𝛿𝑧€ = Ξ˜Β€π›Ώπ‘₯ +Ξ˜π›Ώπ‘₯Β€ = 𝐹 𝛿 𝑧. We get 𝑑βˆ₯𝛿 𝑧βˆ₯/𝑑 𝑑 ≀ βˆ’π›Όβˆ₯𝛿 𝑧βˆ₯ by 𝑑βˆ₯𝛿 𝑧βˆ₯2/𝑑 𝑑 = 2βˆ₯𝛿 𝑧βˆ₯𝑑βˆ₯𝛿 𝑧βˆ₯/𝑑 𝑑, which then yields βˆ₯𝛿 𝑧(𝑑) βˆ₯ ≀ βˆ₯𝛿 𝑧0βˆ₯π‘’βˆ’π›Όπ‘‘ by the comparison lemma of Lemma 2.1. Hence, any infinitesimal length βˆ₯𝛿 𝑧(𝑑) βˆ₯ and

βˆ₯𝛿π‘₯(𝑑) βˆ₯, as well as 𝛿 𝑧 and𝛿π‘₯, tend to zero exponentially fast. By path integration (see Definition 2.2 and Theorem 2.3), this immediately implies that the length of any finite path converges exponentially to zero from any initial conditions.

Conversely, consider an exponentially convergent system, which implies the follow- ing forβˆƒπ›½ >0 andβˆƒπ‘˜ β‰₯ 1:

βˆ₯𝛿π‘₯(𝑑) βˆ₯2 ≀ βˆ’π‘˜βˆ₯𝛿π‘₯(0) βˆ₯2π‘’βˆ’2𝛽𝑑 (2.5)

and define a matrix-valued functionΞ(π‘₯(𝑑), 𝑑) ∈ R𝑛×𝑛(not necessarilyΞžβ‰» 0) as Ξ =Β€ βˆ’2π›½Ξžβˆ’Ξžπœ• 𝑓

πœ• π‘₯

βˆ’ πœ• 𝑓

πœ• π‘₯

⊀

Ξ, Ξ(π‘₯(0),0) =π‘˜I. (2.6)

Note that, for𝑉 =𝛿π‘₯βŠ€Ξžπ›Ώπ‘₯, (2.6) gives𝑉€ =βˆ’2𝛽𝑉, resulting in𝑉 =βˆ’π‘˜βˆ₯𝛿π‘₯(0) βˆ₯2π‘’βˆ’2𝛽𝑑. Substituting this into (2.5) yields

βˆ₯𝛿π‘₯(𝑑) βˆ₯2 =𝛿π‘₯(𝑑)βŠ€π›Ώπ‘₯(𝑑) ≀𝑉 =𝛿π‘₯(𝑑)⊀Ξ(π‘₯(𝑑), 𝑑)𝛿π‘₯(𝑑) (2.7)

which indeed implies that Ξ βͺ° I ≻ 0 as (2.7) holds for any 𝛿π‘₯. Thus, Ξsatisfies the contraction condition (2.4), i.e., Ξ defines a contraction metric (see Defini- tion 2.3) [1].

Remark 2.1. Since 𝑀 of Theorem 2.1 is positive definite, i.e.,π‘£βŠ€π‘€ 𝑣 = βˆ₯Ξ˜π‘£βˆ₯2 β‰₯ 0, βˆ€π‘£ ∈ R𝑛, and βˆ₯Ξ˜π‘£βˆ₯2 = 0if and only if 𝑣 = 0, the equationΞ˜π‘£ = 0only has a trivial solution 𝑣 = 0. This implies thatΘ is always non-singular (i.e., Θ(π‘₯ , 𝑑)βˆ’1 always exists).

Definition 2.2. Letπœ‰0(𝑑) andπœ‰1(𝑑) denote some solution trajectories of (2.1). We say that(2.1) is incrementally exponentially stable ifβˆƒπΆ , 𝛼 > 0 s.t. the following holds [11]:

βˆ₯πœ‰1(𝑑) βˆ’πœ‰0(𝑑) βˆ₯ ≀𝐢 π‘’βˆ’π›Όπ‘‘βˆ₯πœ‰0(0) βˆ’πœ‰1(0) βˆ₯

for any πœ‰0(𝑑) andπœ‰1(𝑑). Note that, since we have βˆ₯πœ‰1(𝑑) βˆ’πœ‰0(𝑑) βˆ₯ = βˆ₯∫ πœ‰1

πœ‰0

𝛿π‘₯βˆ₯ (see Theorem 2.3), Theorem 2.1 implies incremental stability of the system(2.1).

Definition 2.3. The system(2.1)satisfying the conditions in Theorem 2.1 is said to be contracting, and a uniformly positive definite matrix𝑀that satisfies(2.4)defines a contraction metric. As to be discussed in Theorem 2.3 of Sec. 2.2, a contracting system is incrementally exponentially stable in the sense of Definition 2.2.

Example 2.1. One of the distinct features of contraction theory in Theorem 2.1 is incremental stability with exponential convergence. Consider the example given in [12]:

𝑑 𝑑 𝑑

"

π‘₯1 π‘₯2

#

=

"

βˆ’1 π‘₯1

βˆ’π‘₯1 βˆ’1

# "

π‘₯1 π‘₯2

#

. (2.8)

A Lyapunov function𝑉 = βˆ₯π‘₯βˆ₯2/2 for(2.8), where π‘₯ = [π‘₯1, π‘₯2]⊀, yields𝑉€ ≀ βˆ’2𝑉. Thus,(2.8)is exponentially stable with respect to π‘₯ =0. The differential dynamics of(2.8)is given as

𝑑 𝑑 𝑑

"

𝛿π‘₯1 𝛿π‘₯2

#

=

"

βˆ’1+π‘₯2 π‘₯1

βˆ’2π‘₯1 βˆ’1

# "

𝛿π‘₯1 𝛿π‘₯2

#

, (2.9)

and the contraction condition(2.4)for(2.9)can no longer be proven by𝑉 =βˆ₯𝛿π‘₯βˆ₯2/2, due to the lack of the skew-symmetric property of (2.8) in (2.9). This difficulty illustrates the difference between Lyapunov theory and contraction theory, where the former considers stability of (2.8) with respect to the equilibrium point, while the latter analyzes exponential convergence of any couple of trajectories in (2.8) with respect to each other (i.e., incremental stability in Definition 2.2) [1], [12].

Example 2.2. Contraction defined by Theorem 2.1 guarantees incremental stability of their solution trajectories but does not require the existence of stable fixed points.

Let us consider the following system forπ‘₯ :Rβ‰₯0↦→R:

Β€

π‘₯ =βˆ’π‘₯+𝑒𝑑. (2.10)

Using the constraint(2.4) of Theorem 2.1, we can easily verify that(2.10) is con- tracting as 𝑀 = I defines its contraction metric with the contraction rate 𝛼 = 1.

However, since(2.10)hasπ‘₯(𝑑)=𝑒𝑑/2+ (π‘₯(0) βˆ’1/2)π‘’βˆ’π‘‘as its unique solution, it is not stable with respect to any fixed point.

Example 2.3. Consider a Linear Time-Invariant (LTI) system, π‘₯Β€ = 𝑓(π‘₯) = 𝐴π‘₯. Lyapunov theory states that the origin is globally exponentially stable if and only if there exists a constant positive-definite matrix𝑃 ∈R𝑛×𝑛s.t. [13, pp. 67-68]

βˆƒπœ– > 0 s.t. 𝑃 𝐴+π΄βŠ€π‘ƒ βͺ― βˆ’πœ–I. (2.11)

Now, let 𝑝 = βˆ₯𝑃βˆ₯. Sinceβˆ’I βͺ― βˆ’π‘ƒ/𝑝, (2.11)implies that 𝑃 𝐴+ π΄βŠ€π‘ƒ ≀ βˆ’(πœ–/𝑝)𝑃, which shows that 𝑀 = 𝑃 with 𝛼 = πœ–/(2𝑝) satisfies (2.4) due to the relation

πœ• 𝑓/πœ• π‘₯ = 𝐴.

The contraction condition(2.4) can thus be viewed as a generalization of the Lya- punov stability condition(2.11) (see the generalized Krasovskii’s theorem [14, pp.

83-86]) in a nonlinear non-autonomous system setting, expressed in the differen- tial formulation that permits a non-constant metric and pure differential coordi- nate change [1]. Furthermore, if 𝑓(π‘₯ , 𝑑) = 𝐴(𝑑)π‘₯ and 𝑀 = I, (2.4) results in 𝐴(𝑑) +𝐴(𝑑)⊀ βͺ― βˆ’2𝛼I(i.e., all the eigenvalues of the symmetric matrix 𝐴(𝑑) +𝐴(𝑑)⊀ remain strictly in the left-half complex plane), which is a known sufficient condition for stability of Linear Time-Varying (LTV) systems [14, pp. 114-115].

Example 2.4. Most of the learning-based techniques involving neural networks are based on optimizing their hyperparameters by gradient descent [15]. Contraction theory provides a generalized view on the analysis of such continuous-time gradient- based optimization algorithms [16].

Let us consider a twice differentiable scalar output function 𝑓 : R𝑛× R ↦→ R, a matrix-valued function𝑀 :R𝑛 ↦→R𝑛×𝑛with𝑀(π‘₯) ≻0, βˆ€π‘₯ ∈R𝑛, and the following natural gradient system [17]:

Β€

π‘₯ =β„Ž(π‘₯ , 𝑑) =βˆ’π‘€(π‘₯)βˆ’1βˆ‡π‘₯𝑓(π‘₯ , 𝑑). (2.12)

Then, 𝑓 is geodesically𝛼-strongly convex for each𝑑in the metric defined by 𝑀(π‘₯) (i.e., 𝐻(π‘₯) βͺ° 𝛼 𝑀(π‘₯) with 𝐻(π‘₯) being the Riemannian Hessian matrix of 𝑓 with respect to 𝑀 [18]), if and only if (2.12) is contracting with rate 𝛼 in the metric defined by𝑀 as in(2.4)of Theorem 2.1, where 𝐴 =πœ• β„Ž/πœ• π‘₯. More specifically, the Riemannian Hessian verifies𝐻(π‘₯) =βˆ’( €𝑀 +𝑀 𝐴+ π΄βŠ€π‘€)/2. See [16] for details.

Remark 2.2. Theorem 2.1 can be applied to other vector norms ofβˆ₯𝛿 𝑧βˆ₯𝑝with, e.g., 𝑝 =1or𝑝 =∞[1]. It can also be shown that for a contracting autonomous system of the formπ‘₯Β€ = 𝑓(π‘₯), all trajectories converge to an equilibrium point exponentially fast.

2.1.II Partial Contraction

Although satisfying the condition (2.4) of Theorem 2.1 guarantees exponential convergence of any couple of trajectories in (2.1), proving their incremental stability with respect to a subset of these trajectories possessing a specific property could be sufficient for some cases [6], [8], [19], [20], leading to the concept of partial contraction [3].

Theorem 2.2. Consider the following nonlinear system with the stateπ‘₯ ∈Rβ‰₯0↦→R𝑛 and the auxiliary or virtual system with the stateπ‘ž ∈Rβ‰₯0 ↦→R𝑛:

Β€

π‘₯(𝑑) =g(π‘₯(𝑑), π‘₯(𝑑), 𝑑) (2.13)

Β€

π‘ž(𝑑) =g(π‘ž(𝑑), π‘₯(𝑑), 𝑑) (2.14)

where g : R𝑛 Γ— R𝑝 Γ— Rβ‰₯0 β†’ R𝑛 is a smooth function. Suppose that (2.14) is contracting with respect to π‘ž. If a particular solution of(2.14) verifies a smooth specific property, then all trajectories of(2.13)verify this property exponentially.

Proof. The theorem statement follows from Theorem 2.1 and the fact thatπ‘ž =π‘₯and a trajectory with the specific property are particular solutions of (2.14) (see [3] for details).

The importance of this theorem lies in the fact that we can analyze contraction of some specific parts of the system (2.13) while treating the rest as a function of the time-varying parameterπ‘₯(𝑑). Strictly speaking, the system (2.13) is said to be partially contracting, but we will not distinguish partial contraction from contraction of Definition 2.3 in this thesis for simplicity. Instead, we will use the variable π‘ž when referring to partial contraction of Theorem 2.2.

Examples of a trajectory with a specific property include the target trajectoryπ‘₯𝑑 of feedback control in (3.3), or the trajectory of actual dynamicsπ‘₯ for state estimation in (4.3) and (4.4) to be discussed in the subsequent chapters. Note that contraction can be regarded as a particular case of partial contraction.

Example 2.5. Let us illustrate the role of partial contraction using the following nonlinear system [12]:

Β€

π‘₯ =βˆ’π·(π‘₯)π‘₯+𝑒, π‘₯€𝑑 =βˆ’π·(π‘₯𝑑)π‘₯𝑑

whereπ‘₯ is the system state,π‘₯𝑑is the target state,𝑒is the control input designed as 𝑒 =βˆ’πΎ(π‘₯) (π‘₯βˆ’π‘₯𝑑) + (𝐷(π‘₯) βˆ’π·(π‘₯𝑑))π‘₯𝑑, and𝐷(π‘₯) +𝐾(π‘₯) ≻0. We could define a virtual system, which hasπ‘ž =π‘₯andπ‘ž =π‘₯𝑑 as its particular solutions, as follows:

Β€

π‘ž =βˆ’(𝐷(π‘₯) +𝐾(π‘₯)) (π‘žβˆ’π‘₯𝑑) βˆ’π·(π‘₯𝑑)π‘₯𝑑. (2.15) Since we haveπ›Ώπ‘žΒ€=βˆ’(𝐷(π‘₯) +𝐾(π‘₯))π›Ώπ‘žand𝐷(π‘₯) +𝐾(π‘₯) ≻0,(2.15)is contracting with𝑀 =Iin(2.4). However, if we consider the following virtual system:

Β€

π‘ž =βˆ’(𝐷(π‘ž) +𝐾(π‘ž)) (π‘žβˆ’π‘₯𝑑) βˆ’π·(π‘₯𝑑)π‘₯𝑑 (2.16) which also has π‘ž = π‘₯ and π‘ž = π‘₯𝑑 as its particular solutions, proving contraction is no longer straightforward because of the terms πœ• 𝐷/πœ• π‘žπ‘– and πœ• 𝐾/πœ• π‘žπ‘– in the differential dynamics of (2.16). This is due to the fact that, in contrast to (2.15), (2.16) has particular solutions nonlinear in π‘ž in addition to π‘ž = π‘₯ and π‘ž = π‘₯𝑑, and the condition(2.4) becomes more involved for(2.16) as it is for guaranteeing exponential convergence of any couple of these particular solution trajectories [12].

Example 2.6. As one of the key applications of partial contraction given in Theo- rem 2.2, let us consider the following closed-loop Lagrangian system [14, p. 392]:

H (q) Β₯q+ C (q,q) €€ q+ G (q) =𝑒(q,qΒ€, 𝑑) (2.17) 𝑒(q,qΒ€, 𝑑) =H (q) Β₯qπ‘Ÿ + C (q,q) €€ qπ‘Ÿ + G (q) βˆ’ K (𝑑) ( Β€qβˆ’ Β€qπ‘Ÿ) (2.18) whereq,qΒ€ ∈R𝑛,qΒ€π‘Ÿ =q€𝑑(𝑑) βˆ’Ξ›(𝑑) (qβˆ’q𝑑(𝑑)),H :R𝑛 ↦→R𝑛×𝑛,C :R𝑛×R𝑛↦→ R𝑛×𝑛, G : R𝑛 ↦→ R𝑛, K : Rβ‰₯0 ↦→ R𝑛×𝑛, Ξ› : Rβ‰₯0 ↦→ R𝑛×𝑛, and (q𝑑,q€𝑑) is the target trajectory of the state (q,q). Note thatΒ€ K,Λ≻ 0are control gain matrices (design parameters), andH βˆ’Β€ 2Cis skew-symmetric withH ≻0by construction.

By comparing with (2.17)and(2.18), we define the following virtual observer-like system ofπ‘ž(notq) that hasπ‘ž =qΒ€ andπ‘ž =qΒ€π‘Ÿ as its particular solutions:

H (q) Β€π‘ž+ C (q,q)Β€ π‘ž+ K (𝑑) (π‘žβˆ’ Β€q) + G (q)=𝑒(q,qΒ€, 𝑑) (2.19)

which givesH (q)π›Ώπ‘žΒ€+ (C (q,q) + K (Β€ 𝑑))π›Ώπ‘ž =0. We thus have that 𝑑

𝑑 𝑑

(π›Ώπ‘žβŠ€H (q)π›Ώπ‘ž) =π›Ώπ‘žβŠ€( Β€H βˆ’2C βˆ’2K)π›Ώπ‘ž =βˆ’2π›Ώπ‘žβŠ€Kπ›Ώπ‘ž (2.20) where the skew-symmetry ofH βˆ’Β€ 2C is used to obtain the second equality. Since K ≻ 0, (2.20) indicates that the virtual system (2.19)is partially contracting in π‘ž withH defining its contraction metric. Contraction of the full state (q,q)Β€ will be discussed in Example 2.7.

Note that if we treat the arguments(q,q)Β€ ofH andCalso as the virtual stateπ‘ž, we end up having additional terms such asπœ•H /πœ• π‘žπ‘–, which makes proving contraction analytically more involved as in Example 2.5.

As can be seen from Examples 2.5 and 2.6, the role of partial contraction in theo- rem 2.2 is to provide some insight on stability even for cases where it is difficult to prove contraction for all solution trajectories as in Theorem 2.1. Although finding a contraction metric analytically for general nonlinear systems is challenging, we will see in Chapter 3 and Chapter 4 that the convex nature of the contraction condition (2.4) helps us find it numerically.