Chapter 4: Analysis, Proofs, and Implementation of Neural-Fly
4.5 Stability and Robustness Formal Guarantees and Proof
We divide the proof of (3.12) into two steps. First, in Theorem 4.5.1, we show that the combined composite velocity tracking error and adaptation error, k [π ; Λπ] k, ex- ponentially converges to a bounded error ball. This implies the exponential con- vergence of π . Then in Corollary 4.5.1.1 we show that when π is exponentially bounded,πΛ is also exponentially bounded. Combining the exponential bound from Theorem 4.5.1 and the ultimate bound from Corollary 4.5.1.1 proves Theorem 3.4.1.
Before discussing the main proof, let us consider the robustness properties of the feedback controller without considering any specific adaptation law. Taking the dy- namics (3.1), control law (3.7), the composite velocity error definition (3.10), and the parameter estimation errorπΛ =πΛβπ, we find
ππ Β€+ (πΆ+πΎ)π =βππΛ+π (4.6) We can use the Lyapunov functionV =π >π π under the assumption of boundedπΛ to show that
π‘ββlim kπ k β€ supπ‘kπβππΛkπmax(π)
πmin(πΎ)πmin(π) (4.7) Taking this results alone, one might expect that any online estimator or learning algorithm will lead to good performance. However, the boundedness of πΛ is not guaranteed; Slotine and Li discuss this topic thoroughly [9]. In the full proof below, we show the stability and robustness of the Neural-Fly adaptation algorithm.
First, we introduce the parameter measurement noiseπΒ―, whereπΒ― = π¦β ππ. Thus,
Β―
π = π + π and kπΒ―k β€ kπk + kπk by the triangle inequality. Using the above closed loop dynamics (4.6), the parameter estimation error πΛ, and the adaptation
law (3.8) and (3.9), the combined velocity and parameter-error closed-loop dynam- ics are given by
"
π 0
0 πβ1
# "
Β€ π
Β€Λ π
# +
"
πΆ+πΎ π
βππ π>π β1π+π πβ1
# "
π
Λ π
#
=
"
π
π>π β1πΒ―βπβ1ππβπβ1πΒ€
#
(4.8) π
π π‘
πβ1
=βπβ1π πΒ€ β1=πβ1
2π πβπ+π π>π β1π π
πβ1 (4.9) For our stability proof, we rely on the fact thatπβ1is both uniformly positive definite and uniformly bounded, that is, there exists some positive definite, constant matri- ces π΄andπ΅such that π΄ πβ1 π΅. Dieci and Eirola [7] show the slightly weaker result that π is positive definite and finite when πis bounded under the looser as- sumptionπ 0. Following the proof from [7] with the additional assumption that πis uniformly positive definite, one can show the uniform definiteness and uniform boundedness of π. Hence, πβ1 is also uniformly positive definite and uniformly bounded.
Theorem 4.5.1. Given dynamics that evolve according to (4.8) and (4.9), uniform positive definiteness and uniform boundedness ofπβ1, the norm of
"
π
Λ π
#
exponentially converges to the bound given in (4.10) with rateπΌ.
lim
π‘ββ
"
π
Λ π
#
β€ 1
πΌπmin(M)
sup
π‘
kπk +sup
π‘
( kπ>π β1πΒ―k) +πmax(πβ1)sup
π‘
( kππ+ Β€πk)
(4.10) whereπΌandM are functions ofπ, π , π , πΎ , π andπ, and πmin(Β·)andπmax(Β·) are the minimum and maximum eigenvalues of(Β·)over time, respectively. Given Corol- lary 4.5.1.1 and (4.10), the bound in (3.12) is proven. Noteπmax(πβ1) =1/πmin(π) and a sufficiently large value ofπmin(π)will make the RHS of (4.10) small.
Proof. Now consider the Lyapunov functionV given by
V =
"
π
Λ π
#> "
π 0
0 πβ1
# "
π
Λ π
#
(4.11)
This Lyapunov function has the derivative VΒ€ =2
"
π
Λ π
#> "
π 0
0 πβ1
# "
Β€ π
Β€Λ π
# +
"
π
Λ π
#> "
πΒ€ 0
0 π π‘π πβ1
# "
π
Λ π
#
(4.12)
=β2
"
π
Λ π
#>"
πΆ+πΎ π
βππ π>π β1π+π πβ1
# "
π
Λ π
# +2
"
π
Λ π
#>"
π
π>π β1πΒ―βπβ1ππβπβ1πΒ€
#
+
"
π
Λ π
#>"
πΒ€ 0
0 π π‘π πβ1
# "
π
Λ π
#
(4.13)
=β2
"
π
Λ π
#>"
πΎ π
βππ π>π β1π+π πβ1
# "
π
Λ π
# +2
"
π
Λ π
#> "
π
π>π β1πΒ―βπβ1ππβπβ1πΒ€
#
+
"
π
Λ π
#>"
0 0
0 2π πβ1βπβ1π πβ1+π>π β1π
# "
π
Λ π
#
(4.14)
=β
"
π
Λ π
#>"
2πΎ 0
0 π>π β1π+πβ1π πβ1
# "
π
Λ π
# +2
"
π
Λ π
#> "
π
π>π β1πΒ―βπβ1ππβπβ1πΒ€
#
(4.15) where we used the fact πΒ€ β2πΆ is skew-symmetric. AsπΎ, πβ1π πβ1, π, andπβ1 are all uniformly positive definite and uniformly bounded, and π>π β1π is positive semidefinite, there exists someπΌ >0such that
β
"
2πΎ 0
0 π>π β1π+πβ1π πβ1
#
β2πΌ
"
π 0
0 πβ1
#
(4.16) for allπ‘.
Define an upper bound for the disturbance termπ·as
π· =sup
π‘
"
π
π>π β1πΒ―βπβ1ππβπβ1πΒ€
#
(4.17) and define the functionM,
M =
"
π 0
0 πβ1
#
(4.18) By (4.16), the Cauchy-Schwartz inequality, and the definition of the minimum eigen- value, we have the following inequality forVΒ€:
V β€ β2Β€ πΌV +2 s
V
πmin(M)π· (4.19)
Consider the related systems,WwhereW = V,2WWΒ€ =VΒ€, and the following three equations hold
2WW β€ β2Β€ πΌW2+ 2π·W p
πmin(M) (4.20)
W β€ βΒ€ πΌW + π· p
πmin(M) (4.21)
By the Comparison Lemma [10],
β
V =W β€eβπΌπ‘ W (0) β π· πΌ
p
πmin(M)
!
+ π·
πΌ p
πmin(M) (4.22) and the stacked state exponentially converges to the ball
π‘ββlim
"
π
Λ π
#
β€ π·
πΌπmin(M) (4.23)
This completes the proof.
Next, we present a corollary which shows the exponential convergence ofπΛwhenπ is exponentially stable.
Corollary 4.5.1.1. Ifkπ (π‘) k β€ π΄exp(βπΌπ‘) +π΅/πΌfor some constants π΄, π΅, andπΌ, andπ =πΒ€Λ+ΞπΛ, then
kπΛk β€ eβπmin(Ξ)π‘kπΛ(0) k +
β« π‘ 0
eβπmin(Ξ) (π‘βπ)π΄eβπΌπdπ+
β« π‘ 0
eβπmin(Ξ) (π‘βπ)π΅ πΌdπ
(4.24) thuskπΛk exponentially approaches the bound
π‘ββlim kπΛk β€ π΅
πΌπmin(Ξ) (4.25)
Proof. From the Comparison Lemma [10], we can easily show (4.24). This can be further reduced as follows.
kπΛk β€ eβπmin(Ξ)π‘kπΛ(0) k +π΄eβπmin(Ξ)π‘
β« π‘
0
e(πmin(Ξ)βπΌ)πdπ+
β« π‘
0
eβπmin(Ξ) (π‘βπ)π΅ πΌdπ (4.26)
β€ eβπmin(Ξ)π‘kπΛ(0) k +π΄
eβπΌπ‘βeβπmin(Ξ)π‘ πmin(Ξ) βπΌ
+ π΅
1βeβπmin(Ξ)π‘
πΌπmin(Ξ) (4.27)
Taking the limit, we arrive at (4.25)
With the following corollary, we will justify that πΌ is strictly positive even when π β‘ 0, and thus the adaptive control algorithm guarantees robustness even in the absence of persistent excitation or with ineffective learning. In practice, we expect some measurement information about all the elements of π, that is, we expect a non-zeroπ.
Corollary 4.5.1.2. Ifπβ‘ 0, then the bound in (4.10) can be simplified to lim
π‘ββ
"
π
Λ π
#
β€ supkπk +πmax(πβ1)sup( kππ+ Β€πk)
min(π, πmin(πΎ)/πmax(π))πmin(M) (4.28) Proof. Assumingπβ‘0immediately leads toπΌof
πΌ=min 1
2πmin(πβ1π),
πmin(πΎ) πmax(π)
(4.29) π β‘ 0also simplifies the πΒ€ equation to a stable first-order differential matrix equa- tion. By integrating this simplified πΒ€ equation, we can show πexponentially con- verges to the valueπ = 2π
π. This leads to bound in (4.28).
We now introduce another corollary for the Neural-Fly-Constant, when π = πΌ. In this case, the regularization term is not needed, as it is intended to regularize the linear coefficient estimate in the absence of persistent excitation, so we setπ = 0. This corollary also shows that Neural-Fly-Constant is sufficient for perfect tracking control when π is constant; though in this case, even the nonlinear baseline controller with integral control will converge to perfect tracking. In practice for quadrotors, we only expect π to be constant when the drone air-velocity is constant, such as in hover or steady level flight with constant wind velocity.
Corollary 4.5.1.3. If π β‘ πΌ,π = π πΌ, π =π πΌ,π = 0, andπ(0) = π0πΌ is diagonal, whereπ,πandπ0are strictly positive scalar constants, then the bound in (4.10) can be simplified to
lim
π‘ββ
"
π
Λ π
#
β€ 1+πβ1
supπ‘kπ βπk+π/π
πmax(π)
πmin(πΎ)πmin(M) (4.30) Proof. Under these assumptions, the matrix differential equation forπis reduced to the scalar differential equation
π π π π‘
=πβ π2/π (4.31)
where π(π‘) = π(π‘)πΌ. This equation can be integrated to find that π exponentially converges to π = β
ππ. Then by (4.16), πΌ β€ p
π/π and πΌ β€ πmin(πΎ)/πmax(π). If we choose π andπ such that p
π/π = πmin(πΎ)/πmax(π), then we can takeπΌ = πmin(πΎ)/πmax(π). Then, the error bound reduces to
lim
π‘ββ
"
π
Λ π
#
β€ π· πmax(π)
πmin(πΎ)πmin(M) (4.32) Takeπas a constant. ThenπΒ€ =0,π = π βπ, andπ·is bounded by
π· β€
1+πβ1
sup
π‘
kπ βπk +π/π (4.33)