APPLICATION β ACTUATED SLIP
6.3 Direct Approach via Backstepping Barrier Functions
(e-x) (e-y) (c)
0 2 4 6 8
0.7 0.8 0.9
(a) (b)
DSP
(d)
P1 P2
Figure 6.8: An example of the generated 3D periodic walking: (a-b) horizontal trajectories of the mass vs time (the red indicates the velocities in the DSP), (c) the vertical trajectory of the mas, (d) the leg length tracking (the red and blue are the desired and actualπΏ, respectively), and (e) the phase portraits of the mass (blue) in the π₯β π§ plane (e-x) and π¦βπ§ plane (e-y) comparing to the desired orbits of the H-LIP (black).
from the stepping controller perturbs the stepping-in-place walking into desired 3D walking. Fig. 6.8 shows the simulation result. The aSLIP starts to execute the forward walking atπ‘ =3s. As indicated in the phase plot in Fig. 6.8 (d1-d2), the translational states of the aSLIP converge closely to the desired P1 and P2 orbits of the H-LIP in each plane. 3D walking is thus generated on the aSLIP.
COM state of the aSLIP, and then present the results to realize the aSLIP walking via the H-LIP based synthesis. The periodic vertical trajectory of the swing foot π§sw = π§β πΏswcos(π½sw) is controlled via a feedback linearizing controller. Similar to that on AMBER in Chapter 5, its desired trajectory is simply constructed to be π§des
sw =π§max
sw cos(ππ‘
SSP
πβ 1
2π), whereπ§max
sw > 0 is chosen to avoid scuffing.
Backstepping-Control Lyapunov Function for Vertical Stabilization
In this part, we present the dynamics structure of the vertical mass state in each domain. Then, we show the canonical Lyapunov backstepping that guarantees to stabilize this class of control problem. Lastly, we formulate a control Lyapunov backstepping, which yields an inequality condition for the control and opens oppor- tunities to include extra constraints on the input in an optimization.
Strict-feedback Form of the Vertical State: The objective is to drive the vertical position of the mass to follow a desired trajectory. We define the output as
π :=
"
π§βπ§des(π‘)
Β€
π§β Β€π§des(π‘)
#
, (6.21)
where [π§des(π‘),π§Β€des(π‘)] is the desired vertical trajectory for the point mass of the aSLIP to follow. For walking on flat ground,π§desis a constant. Differentiating the output yields the output dynamics:
Β€
π = ππ+πππΉSSP/DSP
π§ , (6.22)
ππ :=
"
Β€ π§β Β€π§des
βgβ Β₯π§des
#
, ππ :=
"
0
1 π
#
. (6.23)
πΉSSP/DSP
π§ is the net vertical force in each domain:
SSP: πΉSSP
π§ =πΉπ cos(π½π ), (6.24)
DSP: πΉDSP
π§ =πΉπ
1cos(π½π
1) +πΉπ
2cos(π½π
2), (6.25)
where the stance leg is denoted by the subscript π . Recall π½ is the leg angle. We view πΉSSP/DSP
π§ as the fictitious input to this system in Eq. (6.22). Differentiating πΉSSP/DSP
π§ yields the affine dynamics with the actual actuation of the leg length as the input. The dynamics are different in the SSP and DSP:
πΉΒ€SSP
π§ = πSSP
π§ +ππ§πSSP
π§ , (6.26)
πΉΒ€DSP
π§ = πDSP
π§ +ππ§πDSP
π§ , (6.27)
whereππ§ =Dβ 0, and, πSSP
π§ :=cos(π½π )ππ , (6.28)
πDSP
π§ :=cos(π½π
1)ππ
1+cos(π½π
2)ππ
2. (6.29)
The expressions of πSSP
π§ and πDSP
π§ are omitted. πSSP/DSP
π§ represents the vertical component of the leg length actuation. The leg angles π½ β (βπ
2, π
2), thus given a desiredπSSP/DSP
π§ , there always exists leg length actuation for realization. Note that the desiredπDSP
π§ is not uniquely realized by the leg length actuationππ
1andππ
2. Note that here we useππ represent the leg length actuation on the stance leg in the SSP, and thusππ
1 andππ
2 represent the leg length actuation on the two stance legs in the DSP.
As a result, for walking in both domains, we can synthesizeππ§ β Rto stabilize the vertical trajectory of the mass based on the dynamics in the strict-feedback form [66]:
Β€
π= ππ+πππΉπ§, πΉΒ€π§ = ππ§+ππ§ππ§.
(6.30) (6.31) Lyapunov Backstepping: Lyapunov Backstepping [67] can be applied to stabilize the dynamics in Eq. (6.30) and (6.31). For the dynamics in Eq. (6.30) with πΉπ§ being the input, a feedback linearization controller can be synthesized:
Β― πΉπ§ = 1
ππ
2
(βππ
2+πΎIOπ), (6.32)
whereπΎIO = [πΎπ, πΎπ] is the linear feedback gain and the subscript2indicates the second element of the vector. This yields the linear closed-loop dynamics:
Β€
π = ππ+πππΉΒ―π§ =
"
0 1
πΎπ πΎπ
#
π := π΄πππ. (6.33)
Thus πΎIO is chosen with πΎπ < 0, πΎπ < 0 so that π΄ππ is stable (with negative eigenvalues). On the closed-loop dynamics, a Lyapunov function can be found:
ππ = ππππ, with π > 0 (being positive definite). π satisfies the continuous-time Lyapunov functionπ π΄ππ+π΄π
πππ =βπ, whereπ >0 is selected. It is easy to verify that:
πΒ€π =βπππ π β€ βπmin(π) ||π||2, (6.34) whereπmin(π)is the smallest eigenvalue of the matrixπ.
To synthesize the actual control fromππ§ to stabilizeπβ0, we define
πΉπΏ :=πΉπ§βπΉΒ―π§, (6.35)
and the Lyapunov function to be
π(π, πΉπ§) =ππ+ 1 2πΉ2
πΏ. (6.36)
Differentiating this yields πΒ€(π, πΉπ§) =
πππ
π π
(ππ+πππΉπ§) +πΉπΏπΉΒ€πΏ
=
πππ
π π
(ππ+πππΉΒ―π§) + πππ
π π
πππΉπΏ +πΉπΏπΉΒ€πΏ
=πΒ€π+ πππ
π π
πππΉπΏ+πΉπΏπΉΒ€πΏ (6.37)
β€ βπmin(π) ||π||2+
πππ
π π
πππΉπΏ+πΉπΏπΉΒ€πΏ. (6.38) If we choose
πΉΒ€πΏ =β
πππ
π π
ππβπ πΉπΏ, (6.39)
withπ > 0, then
πΒ€(π, πΉπ§) β€ βπmin(π) ||π||2βπ πΉ2
πΏ
β€ βmin(πmin(π), π) || [ππ, πΉπΏ]π||2. (6.40) By Lyapunovβs method, the system with (π, πΉπΏ) as states is exponentially stable to the origin(π, πΉπΏ)= (0,0). This provides a closed-formBackstepping controlleron ππ§ from Eq. (6.39). SinceπΉΒ€πΏ = ππ§+ππ§ππ§β Β€πΉΒ―π§,the controller is
ππ§ = 1 ππ§
(β
πππ
π π
ππβπ πΉπΏ+ Β€πΉΒ―π§β ππ§). (6.41) Backstepping-CLF: The closed-form controller in Eq. (6.41) appears to be able to stabilize π β 0. However, the resultant leg force πΉπ§ can be negative, which is not valid for walking. Here we develop the control Lyapunov function (CLF) of the backstepping to provide aninequality conditionon the inputππ§for stabilizingπ. The inequality motivates an optimization-based controller, in which the condition of non-negative leg forcing can be enforced via additional constraints, e.g., the control barrier function (CBF) in the next part.
Note thatπΒ€ is an affine function w.r.t. the inputππ§: πΒ€(π, πΉπ§)=πΒ€π+
πππ
π π
πππΉπΏ+πΉπΏ(ππ§+ππ§ππ§ β Β€πΉΒ―π§)
=πΒ€π+
πππ
π π
πππΉπΏ+πΉπΏ(ππ§β Β€πΉΒ―π§) +πΉπΏππ§ππ§. The exponential stability can be established via enforcing
πΒ€(π, πΉπ§) β€ βπΎπ(π, πΉπ§), (6.42) with some chosenπΎ >0. This yields aBackstepping-CLF inequality:
π΄Backstepping
CLF ππ§ β€ πBackstepping
CLF , (6.43)
whereπ΄Backstepping
CLF :=πΉπΏππ§, πBackstepping
CLF :=β Β€ππβπππ
π π πππΉπΏβπΉπΏ(ππ§β Β€πΉΒ―π§) βπΎπ .When πΉπΏ β 0, Eq. (6.43) is a constraint on ππ§. When πΉπΏ = 0, the inequality becomes πΒ€π β€ βπΎπ = βπΎππ, which is automatically satisfied as long as 0 β€ πΎ β€ πmin(π)
πmax(π). As a result, as long asππ§satisfies the backstepping-CLF inequality,πexponentially converges to 0. Note that this inequality is an affine condition onππ in SSP or ππ 1 andππ 2in DSP as indicated by Eq. (6.28). Thus, in the next part, we will formulate quadratic program (QP) based controllers that include the inequality in Eq. (6.43) with the incorporation of the control barrier functions.
Control Barrier Functions for Walking
In the application of walking, the leg forces should be positive during contact.
Moreover, in the DSP, one leg force should gracefully cross 0 to initiate lift-off.
These conditions can be described via sets and thus be enforced via control barrier function (CBF) with an inequality condition that guarantees set invariance on the dynamics. We start by introducing the CBF, show the application for the walking of the aSLIP, and finally integrate it with the Backstepping-CLF to formulate the final backstepping-barrier function based quadratic program (BBF-QP) controllers for walking.
Control Barrier Functions:The control barrier function [11] describes a condition for control input that guarantees for set invariance. We consider a super level set C of a continuously differentiable scalar function β : Rπ β R. By definition:
C = {π₯ β Rπ|β(π₯) β₯ 0}. Here we useπ₯ for a general state representation, instead of the horizontal position of the aSLIP. βis acontrol barrier functionfor the affine control systemπ₯Β€ = π(π₯) +π(π₯)π’if
sup
π’βπ
[Lπβ(π₯) + Lπβ(π₯)π’+πΌ(β(π₯))] β₯0, (6.44)
whereLπβ(π₯) = π βπ π₯ π(π₯) and Lπβ(π₯) = π βπ π₯π(π₯) are the Lie derivatives; Lπβ(π₯) + Lπβ(π₯) = βΒ€(π₯). π is the set where the inputπ’is in, andπΌ(Β·) is an extended class Kβ function1. This condition indicates that there exists an input to stabilize the set C, i.e., making sureβ(π₯) β₯0. If the state is inC, it will stay in the set forever if the CBF inequalityis satisfied:
Lπβ(π₯) + Lπβ(π₯)π’ β₯ βπΌ(β(π₯)). (6.45) This makes sure that the lower bound of the derivative βΒ€ is increasing with the decrease of β. It can be proven that the set C is exponentially stable under this condition [11].
CBF for aSLIP Walking: Eq. (6.45) represents an inequality constraint on the input to make sure thatβ β₯ 0, for which βis defined differently for the walking in the SSP and the DSP.
SSP:The stance leg force πΉπ should be non-negative, so is its vertical component πΉSSP
π§ . Thus, we let
βπ =πΉSSP
π§ , βΒ€π β₯ βπΌ(βπ ), (6.46) which provides an inequality on the inputππ :
π΄π
CBFππ β€ ππ
CBF, (6.47)
π΄π
CBF :=βππ§cos(π½π ), (6.48) ππ
CBF := πSSP
π§ +πΌ(βπ ), (6.49)
where we simply selectπΌ(Β·)to be a linear function, i.e.,πΌ(βπ ) =πΌ βπ This inequality naturally fits with the Backstepping-CLF inequality in Eq. (6.43) to formulate a backstepping-barrier function based quadratic program (BBF-QP) controller:
(ππ , πΏ)= argmin
(ππ ,πΏ)βR2
π2
π +πΏ2 s.t. π΄Backstepping
CLF πSSP
π§ β€ πBackstepping
CLF +πΏ,
π΄π
CBFππ β€ ππ
CBF,
(6.50)
whereπΏis a relaxation variable to avoid infeasibility, similar to that in the CLF-QP in Chapter 2. In the case when the CBF constraint violates the Backstepping-CLF constraint, the Backstepping-CLF constraint is relaxed and the CBF constraint is still enforced.
1πΌ:RβR,πΌ(0)=0 andπΌis strictly monotonically increasing.
DSP:Both leg forces should remain non-negative. The leg force onπ 2should remain non-negative through out the DSP. Thusβπ
2 =πΉπ
2 and the CBF inequality is π΄π 2
CBFππ
2 β€ ππ 2
CBF. (6.51)
The leg force on π 1 should gradually decrease and reach to zero to trigger the transition into the next SSP. LetπΉ0
π 1 be the leg force on π 1 in the beginning of the DSP. A desired leg force trajectory can be designed: πΉπ
π 1(π‘) = πΉ0
π 1(1β π‘
πDSP). One may consider to design a feedback controller to drive πΉπ
1 β πΉπ
π 1. However, this creates high restriction onππ
1and can lead to conflict between the Backstepping-CLF inequality in Eq. (6.43) and the CBF inequality in Eq. (6.51).
To decreaseπΉπ
1in a relaxed fashion, we create the inequality condition: (1βπ)πΉπ
π 1 β€ πΉπ
1 β€ (1+π)πΉπ
π 1, where π β (0,1) is a relaxation coefficient. As shown in Fig.
6.9, this generates an admissible force region (indicated by the blue region), which decreases as the desired force πΉπ
π 1 decreases with time. This two inequalities can be converted into a single inequality: βπ
1 = (π πΉπ
π 1)2β (πΉπ
1 β πΉπ
π 1)2 β₯ 0. Note that this barrier function is ill-defined asπΉπ
π 1 approaches to 0 (the setCbecomes trivial).
Thus we increase the relaxation by adding a positive valueΞπΉ in the inequality:
(1βπ)πΉπ
π 1 βΞπΉ β€ πΉπ
1 β€ (1+π)πΉπ
π 1 +ΞπΉ , (6.52) which generates the red admissible region. By defining
βπ
1 =(π πΉπ
π 1 +ΞπΉ)2β (πΉπ
1 βπΉπ
π 1)2 β₯ 0, (6.53) the setCis always non-trivial before lift-off. Thus we have another CBF inequality:
π΄π 1
CBFππ
1 β€ ππ 1
CBF.
Similarly, the two CBF inequalities are incorporated with the Backstepping-CLF inequality to formulate the final BBF-QP controller for the walking in DSP:
(ππ
1, ππ
2, πΏ) = argmin
(ππ
1,ππ
2,πΏ)βR3
π2
π 1+π2
π 2+πΏ2 s.t. π΄Backstepping
CLF πDSP
π§ β€ πBackstepping
CLF +πΏ
π΄π 1
CBFππ
1 β€ ππ 1
CBF, π΄π 2
CBFππ
2 β€ ππ 2
CBF.
(6.54)
Despite the complexity in the derivation, the BBF-QPs are designed simply to stabilize the vertical position of the mass to the desired trajectory and simultaneously satisfy the conditions on the leg forces during walking.
Figure 6.9: Contact force condition for lift-off in the DSP (the red region represents the admissible region for the leg force).
Results
The control procedure with the chosen parameters is presented in Algorithm 2. The stepping gain πΎ in Eq. (3.17) is chosen to be the deadbeat gain. The BBF-QP controller is solved at 1kHz. The aSLIP starts from an initial static configuration and walks to a desired pre-impact velocity π£β, which is chosen based on the orbits of the H-LIP. The aSLIP parameters are chosen to match with the robot Cassie1. A video of the results can be seen inhttps://youtu.be/fUZu6y-Gu4g.
Algorithm 2BBF-QP with H-LIP based Walking Synthesis
Initialization: Behavior: π§0 = 1m,πSSP = 0.4s, πDSP = 0.1s. Control: πΌ = 500, πΎ =10,π =10, π=0.5,ΞπΉ =20.
whileSimulation/Control loopdo if SSPthen
Desired step sizeβH-LIP stepping in Eq. (3.17).
Desired swing foot positionβEq. (6.20).
πβBBF-QP in Eq. (6.50) else
πβBBF-QP in Eq. (6.54) end if
end while
We evaluate the approach for walking on flat ground to different desired velocities.
Fig. 6.10 shows the results. The aSLIP converges closely to the desired walking of the H-LIP. The leg forces behave as expected. Fig. 6.10 (c) shows the trajectories of the horizontal velocity, which are not constant in the DSP. This contributes to the errorπ€. We numerically calculate all π€ for different walking simulations and
1π=33kg, and K=8000N/m, D=100Ns/m, which are the nonlinear leg spring parameters of Cassie atπΏβ1m; see Appendix B. The spring is chosen to be linear for generality.
DSP 0.80.5 0.2
(a)
(c) (d)
πΈ π
HβLIP
AdmissibleΒ Region
(b)
π
π£
π
ππ£
Figure 6.10: Walking using direct synthesis via BBF-QP: (a) the phase portrait of the horizontal state trajectory (blue) in the SSP for walking withπ£β =0.5m/s, and the black is the corresponding orbit of the H-LIP (green lines are the orbital lines, (b) the leg forces during walking with the gray region being the admissible region in the DSP; (c) the walking velocity and (d) the trajectories of the discrete error state eπ = [ππ, ππ£]π of walking with differentπ£β =0.2,0.5,0.8m/s (indicated by dashed lines in (c)).
inner approximateπ by a square. Since(π΄+π΅πΎ)2=0,πΈ =π β (π΄+π΅πΎ)π; we get an inner approximation ofπΈ (shown in Fig. 6.10 (d)), and all the error statese are inside πΈ, which validates the proposed approach of the H-LIP based stepping in Chapter 3. It is also important to note that, similar to the indirect synthesis on 3D-aSLIP, the direct synthesis via BBF-QP can be readily applied to 3D-aSLIP with two orthogonal planar stepping stabilizations via the H-LIP.