• Tidak ada hasil yang ditemukan

Direct Approach via Backstepping Barrier Functions

APPLICATION βŠƒ ACTUATED SLIP

6.3 Direct Approach via Backstepping Barrier Functions

(e-x) (e-y) (c)

0 2 4 6 8

0.7 0.8 0.9

(a) (b)

DSP

(d)

P1 P2

Figure 6.8: An example of the generated 3D periodic walking: (a-b) horizontal trajectories of the mass vs time (the red indicates the velocities in the DSP), (c) the vertical trajectory of the mas, (d) the leg length tracking (the red and blue are the desired and actual𝐿, respectively), and (e) the phase portraits of the mass (blue) in the π‘₯βˆ’ 𝑧 plane (e-x) and π‘¦βˆ’π‘§ plane (e-y) comparing to the desired orbits of the H-LIP (black).

from the stepping controller perturbs the stepping-in-place walking into desired 3D walking. Fig. 6.8 shows the simulation result. The aSLIP starts to execute the forward walking at𝑑 =3s. As indicated in the phase plot in Fig. 6.8 (d1-d2), the translational states of the aSLIP converge closely to the desired P1 and P2 orbits of the H-LIP in each plane. 3D walking is thus generated on the aSLIP.

COM state of the aSLIP, and then present the results to realize the aSLIP walking via the H-LIP based synthesis. The periodic vertical trajectory of the swing foot 𝑧sw = π‘§βˆ’ 𝐿swcos(𝛽sw) is controlled via a feedback linearizing controller. Similar to that on AMBER in Chapter 5, its desired trajectory is simply constructed to be 𝑧des

sw =𝑧max

sw cos(𝑇𝑑

SSP

πœ‹βˆ’ 1

2πœ‹), where𝑧max

sw > 0 is chosen to avoid scuffing.

Backstepping-Control Lyapunov Function for Vertical Stabilization

In this part, we present the dynamics structure of the vertical mass state in each domain. Then, we show the canonical Lyapunov backstepping that guarantees to stabilize this class of control problem. Lastly, we formulate a control Lyapunov backstepping, which yields an inequality condition for the control and opens oppor- tunities to include extra constraints on the input in an optimization.

Strict-feedback Form of the Vertical State: The objective is to drive the vertical position of the mass to follow a desired trajectory. We define the output as

πœ‚ :=

"

π‘§βˆ’π‘§des(𝑑)

Β€

π‘§βˆ’ €𝑧des(𝑑)

#

, (6.21)

where [𝑧des(𝑑),𝑧€des(𝑑)] is the desired vertical trajectory for the point mass of the aSLIP to follow. For walking on flat ground,𝑧desis a constant. Differentiating the output yields the output dynamics:

Β€

πœ‚ = π‘“πœ‚+π‘”πœ‚πΉSSP/DSP

𝑧 , (6.22)

π‘“πœ‚ :=

"

Β€ π‘§βˆ’ €𝑧des

βˆ’gβˆ’ Β₯𝑧des

#

, π‘”πœ‚ :=

"

0

1 π‘š

#

. (6.23)

𝐹SSP/DSP

𝑧 is the net vertical force in each domain:

SSP: 𝐹SSP

𝑧 =𝐹𝑠cos(𝛽𝑠), (6.24)

DSP: 𝐹DSP

𝑧 =𝐹𝑠

1cos(𝛽𝑠

1) +𝐹𝑠

2cos(𝛽𝑠

2), (6.25)

where the stance leg is denoted by the subscript 𝑠. Recall 𝛽 is the leg angle. We view 𝐹SSP/DSP

𝑧 as the fictitious input to this system in Eq. (6.22). Differentiating 𝐹SSP/DSP

𝑧 yields the affine dynamics with the actual actuation of the leg length as the input. The dynamics are different in the SSP and DSP:

𝐹€SSP

𝑧 = 𝑓SSP

𝑧 +π‘”π‘§πœSSP

𝑧 , (6.26)

𝐹€DSP

𝑧 = 𝑓DSP

𝑧 +π‘”π‘§πœDSP

𝑧 , (6.27)

where𝑔𝑧 =Dβ‰ 0, and, 𝜏SSP

𝑧 :=cos(𝛽𝑠)πœπ‘ , (6.28)

𝜏DSP

𝑧 :=cos(𝛽𝑠

1)πœπ‘ 

1+cos(𝛽𝑠

2)πœπ‘ 

2. (6.29)

The expressions of 𝑓SSP

𝑧 and 𝑓DSP

𝑧 are omitted. 𝜏SSP/DSP

𝑧 represents the vertical component of the leg length actuation. The leg angles 𝛽 ∈ (βˆ’πœ‹

2, πœ‹

2), thus given a desired𝜏SSP/DSP

𝑧 , there always exists leg length actuation for realization. Note that the desired𝜏DSP

𝑧 is not uniquely realized by the leg length actuationπœπ‘ 

1andπœπ‘ 

2. Note that here we useπœπ‘  represent the leg length actuation on the stance leg in the SSP, and thusπœπ‘ 

1 andπœπ‘ 

2 represent the leg length actuation on the two stance legs in the DSP.

As a result, for walking in both domains, we can synthesizeπœπ‘§ ∈ Rto stabilize the vertical trajectory of the mass based on the dynamics in the strict-feedback form [66]:

Β€

πœ‚= π‘“πœ‚+π‘”πœ‚πΉπ‘§, 𝐹€𝑧 = 𝑓𝑧+π‘”π‘§πœπ‘§.

(6.30) (6.31) Lyapunov Backstepping: Lyapunov Backstepping [67] can be applied to stabilize the dynamics in Eq. (6.30) and (6.31). For the dynamics in Eq. (6.30) with 𝐹𝑧 being the input, a feedback linearization controller can be synthesized:

Β― 𝐹𝑧 = 1

π‘”πœ‚

2

(βˆ’π‘“πœ‚

2+𝐾IOπœ‚), (6.32)

where𝐾IO = [𝐾𝑝, 𝐾𝑑] is the linear feedback gain and the subscript2indicates the second element of the vector. This yields the linear closed-loop dynamics:

Β€

πœ‚ = π‘“πœ‚+π‘”πœ‚πΉΒ―π‘§ =

"

0 1

𝐾𝑝 𝐾𝑑

#

πœ‚ := π΄π‘π‘™πœ‚. (6.33)

Thus 𝐾IO is chosen with 𝐾𝑝 < 0, 𝐾𝑑 < 0 so that 𝐴𝑐𝑙 is stable (with negative eigenvalues). On the closed-loop dynamics, a Lyapunov function can be found:

π‘‰πœ‚ = πœ‚π‘‡π‘ƒπœ‚, with 𝑃 > 0 (being positive definite). 𝑃 satisfies the continuous-time Lyapunov function𝑃 𝐴𝑐𝑙+𝐴𝑇

𝑐𝑙𝑃 =βˆ’π‘„, where𝑄 >0 is selected. It is easy to verify that:

π‘‰Β€πœ‚ =βˆ’πœ‚π‘‡π‘„ πœ‚ ≀ βˆ’πœ†min(𝑄) ||πœ‚||2, (6.34) whereπœ†min(𝑄)is the smallest eigenvalue of the matrix𝑄.

To synthesize the actual control fromπœπ‘§ to stabilizeπœ‚β†’0, we define

𝐹𝛿 :=πΉπ‘§βˆ’πΉΒ―π‘§, (6.35)

and the Lyapunov function to be

𝑉(πœ‚, 𝐹𝑧) =π‘‰πœ‚+ 1 2𝐹2

𝛿. (6.36)

Differentiating this yields 𝑉€(πœ‚, 𝐹𝑧) =

πœ•π‘‰πœ‚

πœ• πœ‚

(π‘“πœ‚+π‘”πœ‚πΉπ‘§) +𝐹𝛿𝐹€𝛿

=

πœ•π‘‰πœ‚

πœ• πœ‚

(π‘“πœ‚+π‘”πœ‚πΉΒ―π‘§) + πœ•π‘‰πœ‚

πœ• πœ‚

π‘”πœ‚πΉπ›Ώ +𝐹𝛿𝐹€𝛿

=π‘‰Β€πœ‚+ πœ•π‘‰πœ‚

πœ• πœ‚

π‘”πœ‚πΉπ›Ώ+𝐹𝛿𝐹€𝛿 (6.37)

≀ βˆ’πœ†min(𝑄) ||πœ‚||2+

πœ•π‘‰πœ‚

πœ• πœ‚

π‘”πœ‚πΉπ›Ώ+𝐹𝛿𝐹€𝛿. (6.38) If we choose

𝐹€𝛿 =βˆ’

πœ•π‘‰πœ‚

πœ• πœ‚

π‘”πœ‚βˆ’π‘˜ 𝐹𝛿, (6.39)

withπ‘˜ > 0, then

𝑉€(πœ‚, 𝐹𝑧) ≀ βˆ’πœ†min(𝑄) ||πœ‚||2βˆ’π‘˜ 𝐹2

𝛿

≀ βˆ’min(πœ†min(𝑄), π‘˜) || [πœ‚π‘‡, 𝐹𝛿]𝑇||2. (6.40) By Lyapunov’s method, the system with (πœ‚, 𝐹𝛿) as states is exponentially stable to the origin(πœ‚, 𝐹𝛿)= (0,0). This provides a closed-formBackstepping controlleron πœπ‘§ from Eq. (6.39). Since𝐹€𝛿 = 𝑓𝑧+π‘”π‘§πœπ‘§βˆ’ €𝐹¯𝑧,the controller is

πœπ‘§ = 1 𝑔𝑧

(βˆ’

πœ•π‘‰πœ‚

πœ• πœ‚

π‘”πœ‚βˆ’π‘˜ 𝐹𝛿+ Β€πΉΒ―π‘§βˆ’ 𝑓𝑧). (6.41) Backstepping-CLF: The closed-form controller in Eq. (6.41) appears to be able to stabilize πœ‚ β†’ 0. However, the resultant leg force 𝐹𝑧 can be negative, which is not valid for walking. Here we develop the control Lyapunov function (CLF) of the backstepping to provide aninequality conditionon the inputπœπ‘§for stabilizingπœ‚. The inequality motivates an optimization-based controller, in which the condition of non-negative leg forcing can be enforced via additional constraints, e.g., the control barrier function (CBF) in the next part.

Note that𝑉€ is an affine function w.r.t. the inputπœπ‘§: 𝑉€(πœ‚, 𝐹𝑧)=π‘‰Β€πœ‚+

πœ•π‘‰πœ‚

πœ• πœ‚

π‘”πœ‚πΉπ›Ώ+𝐹𝛿(𝑓𝑧+π‘”π‘§πœπ‘§ βˆ’ €𝐹¯𝑧)

=π‘‰Β€πœ‚+

πœ•π‘‰πœ‚

πœ• πœ‚

π‘”πœ‚πΉπ›Ώ+𝐹𝛿(π‘“π‘§βˆ’ €𝐹¯𝑧) +πΉπ›Ώπ‘”π‘§πœπ‘§. The exponential stability can be established via enforcing

𝑉€(πœ‚, 𝐹𝑧) ≀ βˆ’π›Ύπ‘‰(πœ‚, 𝐹𝑧), (6.42) with some chosen𝛾 >0. This yields aBackstepping-CLF inequality:

𝐴Backstepping

CLF πœπ‘§ ≀ 𝑏Backstepping

CLF , (6.43)

where𝐴Backstepping

CLF :=𝐹𝛿𝑔𝑧, 𝑏Backstepping

CLF :=βˆ’ Β€π‘‰πœ‚βˆ’πœ•π‘‰πœ‚

πœ• πœ‚ π‘”πœ‚πΉπ›Ώβˆ’πΉπ›Ώ(π‘“π‘§βˆ’ €𝐹¯𝑧) βˆ’π›Ύπ‘‰ .When 𝐹𝛿 β‰  0, Eq. (6.43) is a constraint on πœπ‘§. When 𝐹𝛿 = 0, the inequality becomes π‘‰Β€πœ‚ ≀ βˆ’π›Ύπ‘‰ = βˆ’π›Ύπ‘‰πœ‚, which is automatically satisfied as long as 0 ≀ 𝛾 ≀ πœ†min(𝑄)

πœ†max(𝑃). As a result, as long asπœπ‘§satisfies the backstepping-CLF inequality,πœ‚exponentially converges to 0. Note that this inequality is an affine condition onπœπ‘  in SSP or πœπ‘ 1 andπœπ‘ 2in DSP as indicated by Eq. (6.28). Thus, in the next part, we will formulate quadratic program (QP) based controllers that include the inequality in Eq. (6.43) with the incorporation of the control barrier functions.

Control Barrier Functions for Walking

In the application of walking, the leg forces should be positive during contact.

Moreover, in the DSP, one leg force should gracefully cross 0 to initiate lift-off.

These conditions can be described via sets and thus be enforced via control barrier function (CBF) with an inequality condition that guarantees set invariance on the dynamics. We start by introducing the CBF, show the application for the walking of the aSLIP, and finally integrate it with the Backstepping-CLF to formulate the final backstepping-barrier function based quadratic program (BBF-QP) controllers for walking.

Control Barrier Functions:The control barrier function [11] describes a condition for control input that guarantees for set invariance. We consider a super level set C of a continuously differentiable scalar function β„Ž : R𝑛 β†’ R. By definition:

C = {π‘₯ ∈ R𝑛|β„Ž(π‘₯) β‰₯ 0}. Here we useπ‘₯ for a general state representation, instead of the horizontal position of the aSLIP. β„Žis acontrol barrier functionfor the affine control systemπ‘₯Β€ = 𝑓(π‘₯) +𝑔(π‘₯)𝑒if

sup

π‘’βˆˆπ‘ˆ

[Lπ‘“β„Ž(π‘₯) + Lπ‘”β„Ž(π‘₯)𝑒+𝛼(β„Ž(π‘₯))] β‰₯0, (6.44)

whereLπ‘“β„Ž(π‘₯) = πœ• β„Žπœ• π‘₯ 𝑓(π‘₯) and Lπ‘”β„Ž(π‘₯) = πœ• β„Žπœ• π‘₯𝑔(π‘₯) are the Lie derivatives; Lπ‘“β„Ž(π‘₯) + Lπ‘”β„Ž(π‘₯) = β„ŽΒ€(π‘₯). π‘ˆ is the set where the input𝑒is in, and𝛼(Β·) is an extended class K∞ function1. This condition indicates that there exists an input to stabilize the set C, i.e., making sureβ„Ž(π‘₯) β‰₯0. If the state is inC, it will stay in the set forever if the CBF inequalityis satisfied:

Lπ‘“β„Ž(π‘₯) + Lπ‘”β„Ž(π‘₯)𝑒 β‰₯ βˆ’π›Ό(β„Ž(π‘₯)). (6.45) This makes sure that the lower bound of the derivative β„ŽΒ€ is increasing with the decrease of β„Ž. It can be proven that the set C is exponentially stable under this condition [11].

CBF for aSLIP Walking: Eq. (6.45) represents an inequality constraint on the input to make sure thatβ„Ž β‰₯ 0, for which β„Žis defined differently for the walking in the SSP and the DSP.

SSP:The stance leg force 𝐹𝑠 should be non-negative, so is its vertical component 𝐹SSP

𝑧 . Thus, we let

β„Žπ‘  =𝐹SSP

𝑧 , β„ŽΒ€π‘  β‰₯ βˆ’π›Ό(β„Žπ‘ ), (6.46) which provides an inequality on the inputπœπ‘ :

𝐴𝑠

CBFπœπ‘  ≀ 𝑏𝑠

CBF, (6.47)

𝐴𝑠

CBF :=βˆ’π‘”π‘§cos(𝛽𝑠), (6.48) 𝑏𝑠

CBF := 𝑓SSP

𝑧 +𝛼(β„Žπ‘ ), (6.49)

where we simply select𝛼(Β·)to be a linear function, i.e.,𝛼(β„Žπ‘ ) =𝛼 β„Žπ‘ This inequality naturally fits with the Backstepping-CLF inequality in Eq. (6.43) to formulate a backstepping-barrier function based quadratic program (BBF-QP) controller:

(πœπ‘ , 𝛿)= argmin

(πœπ‘ ,𝛿)∈R2

𝜏2

𝑠 +𝛿2 s.t. 𝐴Backstepping

CLF 𝜏SSP

𝑧 ≀ 𝑏Backstepping

CLF +𝛿,

𝐴𝑠

CBFπœπ‘  ≀ 𝑏𝑠

CBF,

(6.50)

where𝛿is a relaxation variable to avoid infeasibility, similar to that in the CLF-QP in Chapter 2. In the case when the CBF constraint violates the Backstepping-CLF constraint, the Backstepping-CLF constraint is relaxed and the CBF constraint is still enforced.

1𝛼:Rβ†’R,𝛼(0)=0 and𝛼is strictly monotonically increasing.

DSP:Both leg forces should remain non-negative. The leg force on𝑠2should remain non-negative through out the DSP. Thusβ„Žπ‘ 

2 =𝐹𝑠

2 and the CBF inequality is 𝐴𝑠2

CBFπœπ‘ 

2 ≀ 𝑏𝑠2

CBF. (6.51)

The leg force on 𝑠1 should gradually decrease and reach to zero to trigger the transition into the next SSP. Let𝐹0

𝑠1 be the leg force on 𝑠1 in the beginning of the DSP. A desired leg force trajectory can be designed: 𝐹𝑑

𝑠1(𝑑) = 𝐹0

𝑠1(1βˆ’ 𝑑

𝑇DSP). One may consider to design a feedback controller to drive 𝐹𝑠

1 β†’ 𝐹𝑑

𝑠1. However, this creates high restriction onπœπ‘ 

1and can lead to conflict between the Backstepping-CLF inequality in Eq. (6.43) and the CBF inequality in Eq. (6.51).

To decrease𝐹𝑠

1in a relaxed fashion, we create the inequality condition: (1βˆ’π‘)𝐹𝑑

𝑠1 ≀ 𝐹𝑠

1 ≀ (1+𝑐)𝐹𝑑

𝑠1, where 𝑐 ∈ (0,1) is a relaxation coefficient. As shown in Fig.

6.9, this generates an admissible force region (indicated by the blue region), which decreases as the desired force 𝐹𝑑

𝑠1 decreases with time. This two inequalities can be converted into a single inequality: β„Žπ‘ 

1 = (𝑐 𝐹𝑑

𝑠1)2βˆ’ (𝐹𝑠

1 βˆ’ 𝐹𝑑

𝑠1)2 β‰₯ 0. Note that this barrier function is ill-defined as𝐹𝑑

𝑠1 approaches to 0 (the setCbecomes trivial).

Thus we increase the relaxation by adding a positive valueΔ𝐹 in the inequality:

(1βˆ’π‘)𝐹𝑑

𝑠1 βˆ’Ξ”πΉ ≀ 𝐹𝑠

1 ≀ (1+𝑐)𝐹𝑑

𝑠1 +Δ𝐹 , (6.52) which generates the red admissible region. By defining

β„Žπ‘ 

1 =(𝑐 𝐹𝑑

𝑠1 +Δ𝐹)2βˆ’ (𝐹𝑠

1 βˆ’πΉπ‘‘

𝑠1)2 β‰₯ 0, (6.53) the setCis always non-trivial before lift-off. Thus we have another CBF inequality:

𝐴𝑠1

CBFπœπ‘ 

1 ≀ 𝑏𝑠1

CBF.

Similarly, the two CBF inequalities are incorporated with the Backstepping-CLF inequality to formulate the final BBF-QP controller for the walking in DSP:

(πœπ‘ 

1, πœπ‘ 

2, 𝛿) = argmin

(πœπ‘ 

1,πœπ‘ 

2,𝛿)∈R3

𝜏2

𝑠1+𝜏2

𝑠2+𝛿2 s.t. 𝐴Backstepping

CLF 𝜏DSP

𝑧 ≀ 𝑏Backstepping

CLF +𝛿

𝐴𝑠1

CBFπœπ‘ 

1 ≀ 𝑏𝑠1

CBF, 𝐴𝑠2

CBFπœπ‘ 

2 ≀ 𝑏𝑠2

CBF.

(6.54)

Despite the complexity in the derivation, the BBF-QPs are designed simply to stabilize the vertical position of the mass to the desired trajectory and simultaneously satisfy the conditions on the leg forces during walking.

Figure 6.9: Contact force condition for lift-off in the DSP (the red region represents the admissible region for the leg force).

Results

The control procedure with the chosen parameters is presented in Algorithm 2. The stepping gain 𝐾 in Eq. (3.17) is chosen to be the deadbeat gain. The BBF-QP controller is solved at 1kHz. The aSLIP starts from an initial static configuration and walks to a desired pre-impact velocity π‘£βˆ—, which is chosen based on the orbits of the H-LIP. The aSLIP parameters are chosen to match with the robot Cassie1. A video of the results can be seen inhttps://youtu.be/fUZu6y-Gu4g.

Algorithm 2BBF-QP with H-LIP based Walking Synthesis

Initialization: Behavior: 𝑧0 = 1m,𝑇SSP = 0.4s, 𝑇DSP = 0.1s. Control: 𝛼 = 500, 𝛾 =10,π‘˜ =10, 𝑐=0.5,Δ𝐹 =20.

whileSimulation/Control loopdo if SSPthen

Desired step size←H-LIP stepping in Eq. (3.17).

Desired swing foot position←Eq. (6.20).

πœβ†BBF-QP in Eq. (6.50) else

πœβ†BBF-QP in Eq. (6.54) end if

end while

We evaluate the approach for walking on flat ground to different desired velocities.

Fig. 6.10 shows the results. The aSLIP converges closely to the desired walking of the H-LIP. The leg forces behave as expected. Fig. 6.10 (c) shows the trajectories of the horizontal velocity, which are not constant in the DSP. This contributes to the error𝑀. We numerically calculate all 𝑀 for different walking simulations and

1π‘š=33kg, and K=8000N/m, D=100Ns/m, which are the nonlinear leg spring parameters of Cassie atπΏβ‰ˆ1m; see Appendix B. The spring is chosen to be linear for generality.

DSP 0.80.5 0.2

(a)

(c) (d)

𝐸 𝐞

H‐LIP

AdmissibleΒ Region

(b)

𝑝

𝑣

𝑒

𝑒𝑣

Figure 6.10: Walking using direct synthesis via BBF-QP: (a) the phase portrait of the horizontal state trajectory (blue) in the SSP for walking withπ‘£βˆ— =0.5m/s, and the black is the corresponding orbit of the H-LIP (green lines are the orbital lines, (b) the leg forces during walking with the gray region being the admissible region in the DSP; (c) the walking velocity and (d) the trajectories of the discrete error state eπ‘˜ = [𝑒𝑝, 𝑒𝑣]𝑇 of walking with differentπ‘£βˆ— =0.2,0.5,0.8m/s (indicated by dashed lines in (c)).

inner approximateπ‘Š by a square. Since(𝐴+𝐡𝐾)2=0,𝐸 =π‘Š βŠ• (𝐴+𝐡𝐾)π‘Š; we get an inner approximation of𝐸 (shown in Fig. 6.10 (d)), and all the error statese are inside 𝐸, which validates the proposed approach of the H-LIP based stepping in Chapter 3. It is also important to note that, similar to the indirect synthesis on 3D-aSLIP, the direct synthesis via BBF-QP can be readily applied to 3D-aSLIP with two orthogonal planar stepping stabilizations via the H-LIP.