当前位置：网站首页>Formula understanding in quadruped control

Formula understanding in quadruped control

2022-06-26 08:33:00 【Yu Getou】

explain ： This article is used to record the formula understanding of quadruped related papers , Due to my limited ability , The understanding of the formula comes from studying the content of the paper , A combination of online articles and personal guesses , You are welcome to criticize and correct any inaccuracies , This article will also be updated . Welcome to contact me ：[email protected]

《MIT Cheetah 3: Design and Control of a Robust, Dynamic Quadruped Robot》

1. Support leg controller

（1） Based on the centralized quality model (lumped model) Balance controller for (Balance Controller)

i. The motion equation of the robot which regards the robot as a single rigid body model :
$\underbrace{\begin{bmatrix} I_3 &\dots & I_3 \\ [p_1 p_c]\times &\dots &[p_4 p_c]\times \end{bmatrix}}_{A} F=\underbrace{\begin{bmatrix} m(\ddot{p}_c+g )\\ I_G\dot{\omega _b} \end{bmatrix}}_b$

Using the basic Newton's laws of motion

Together = ma
Resultant moment = inertia * Angular acceleration

ii. Use PD The control law calculates the centroid acceleration and angular acceleration
$\left[\begin{array}{c} \ddot{\boldsymbol{p}}_{c, d} \\ \dot{\boldsymbol{\omega}}_{b, d} \end{array}\right]=\left[\begin{array}{c} \boldsymbol{K}_{p, p}\left(\boldsymbol{p}_{c, d}-\boldsymbol{p}_{c}\right)+\boldsymbol{K}_{d, p}\left(\dot{\boldsymbol{p}}_{c, d}-\dot{\boldsymbol{p}}_{c}\right) \\ \boldsymbol{K}_{p, \omega} \log \left(\boldsymbol{R}_{d} \boldsymbol{R}^{T}\right)+\boldsymbol{K}_{d, \omega}\left(\boldsymbol{\omega}_{b, d}-\boldsymbol{\omega}\right) \end{array}\right]$
The goal is ： Get the optimal control force F, Make the center of mass tend to the ideal center of mass dynamic state
$\boldsymbol{b_d} = \begin{bmatrix} m(\ddot{\boldsymbol{p}}_{c,d}+\boldsymbol{g})\\ \boldsymbol{I_G}\dot{\boldsymbol{\omega}}_{b,d} \end{bmatrix}$
Because the robot motion equation is linear , The optimization problem can be transformed into QP problem
$\boldsymbol{F^*}=\min_{F\in R^{12}} \quad (AF-b_d)^TS(AF-b_d)+\alpha ||F||^2+\beta || F-F_{prev}||^2\\ s.t. \quad CF\le d$
significance ：

$AF-b_d)^TS(AF-b_d)$ ---- Make the center of mass state meet the expectation ,S The matrix represents the priority of rotation and displacement in the control .
$F||^2$ ---- Minimize the force
$F-F_{prev}||^2$ ---- $F_{prev}$ Represents the last force in the iteration , The goal is to limit the difference between the solutions of the two iterations .
$\alpha,\beta>0$ ---- Specify the standardization of knowledge , And filter the solution to some extent .
$CF\le d$ ---- The details are unknown , The purpose is to ensure that the solution is in the feasible region .

（2）MPC controller (Model Predictive Control)

$J=\sum_{i=0}^{k}(x_i-x_i^{ref})^TQ_i(x_i-x_i^{ref})+u_i^TR_iu_i$
The goal is ： Constantly updated $u_i$ Input to the control system , Minimize the above cost function .

The update frequency f： In this paper, the optimal solution is given by 30Hz The frequency of is input to the controller
Solving process ： Paper points out , Need to put MPC Problem into QP Solve the problem .
MPC For details of the controller, please refer to my study notes ：MPC Learning notes

2. Swing leg controller

（1） Foot drop point calculation

$p_{step,i}=p_{h,i}+\underbrace{\frac{T_{c_\phi}}{2}{\dot{p}}_{c,d}}_{Raibert \ Heuristic}+\underbrace{\sqrt{\frac{z_0}{||g||}}({\dot{p}}_c-{\dot{p}}_{c,d})}_{Capture \ Point}$
$\frac{T_{c_\phi}}{2}{\dot{p}}_{c,d}$ ---- according to 《Legged robots that balance》 Description in , The landing point of the foot end determined according to the formula , Will make the motion symmetrical , Acceleration is 0

$\sqrt{\frac{z_0}{||g||}}({\dot{p}}_c-{\dot{p}}_{c,d})$ ---- according to 《Capture point: A step toward humanoid push recovery》 Description in , The goal is to make the foothold closer capture point, At this point the legs will be balanced .

$T_{c_\varphi}$ ---- In an ideal state stance State duration
$z_0$ ---- The height of the target location point
$p_{h,i}$ ---- leg i Corresponding hip motor hip The location of

（2） Swing trajectory calculation –PD controller + Feedforward control

i. Calculation of feedforward torque
$\tau_{\mathrm{ff}, i}=\boldsymbol{J}_{i}^{\top} \boldsymbol{\Lambda}_{i}\left({ }^{\mathfrak{B}} \boldsymbol{a}_{i, \text { ref }}-\dot{\boldsymbol{J}}_{i} \dot{\boldsymbol{q}}_{i}\right)+\boldsymbol{C}_{i} \dot{\boldsymbol{q}}_{i}+\boldsymbol{G}_{i}$
$J_i$ ---- Jacobian matrix of the foot
$\Lambda _i$ ---- Inertia matrix of operation space
$^\mathfrak{B}a_{i,ref}$ ---- Reference acceleration
$q_i$ ---- Vector of joint structure
$C_i$ ---- Coriolis matrix
$G_i$ ----- Heavy moment

ii. Track following controller
$\boldsymbol{\tau}_{i}=\boldsymbol{J}_{i}^{\top}\left[\boldsymbol{K}_{p}\left({ }^{\mathfrak{B}} \boldsymbol{p}_{i, \text { ref }}-{ }^{\mathfrak{B}} \boldsymbol{p}_{i}\right)+\boldsymbol{K}_{d}\left({ }^{\mathfrak{B}} \boldsymbol{v}_{i, \mathrm{ref}}-{^\mathfrak{B}} \boldsymbol{v}_{i}\right)\right]+\boldsymbol{\tau}_{\mathrm{ff}, i}$

Control frequency f：4.5kHz

In order to make PD The controller remains stable , You need to adjust the gain $K_{p}$ Do the following
$K_{p,j}=\omega_{des}^2\Lambda_{jj}(q)$

$K_{p,j}$ ---- $K_p$ No j Diagonal terms .
$\omega_{des}$ ---- Target natural frequency
$\Lambda_{jj}$ ---- The first j Inertia matrix of diagonal terms in operation space

3. Virtual Prediction supports polygons （Virtual Predictive Support Polygon）

i. Calculate the weight factor for each leg

（1） Virtual support polygon ： Provide centroid position points under various gait
（2） Define the contribution of each leg to the prediction polygon （ Phase by phase ）： Point out which leg should be raised or touched , And the time of state switching
$\begin{array}{r} K_{c_{\phi}}=\frac{1}{2}\left[\operatorname{erf}\left(\frac{\phi}{\sigma_{c_{0}} \sqrt{2}}\right)+\operatorname{erf}\left(\frac{1-\phi}{\sigma_{c_{1}} \sqrt{2}}\right)\right] \\\\ K_{\bar{c}_{\phi}}=\frac{1}{2}\left[2+\operatorname{erf}\left(\frac{-\phi}{\sigma_{\bar{c}_{0}} \sqrt{2}}\right)+\operatorname{erf}\left(\frac{\phi-1}{\sigma_{\bar{c}_{1}} \sqrt{2}}\right)\right] \\\\ \Phi=s_{\phi} K_{c_{\phi}}+\bar{s}_{\phi} K_{\bar{c}_{\phi}} \end{array}$
$e r f (x)$ ---- Gaussian error function
$K_{c_{\phi}},K_{\bar{c}_{\phi}}$ ---- Weight factor of support phase and swing phase
$\Phi$ ---- The total weight factor of the leg
meaning ： In the supported state , The closer a leg is to the center , The more it can be used as a support point , Under the dynamic pendulum , The closer a leg is to the center , The less it can be used to balance

ii. Calculate the virtual points of each leg

$\left[\begin{array}{c} \boldsymbol{\xi}_{i}^{-} \\ \boldsymbol{\xi}_{i}^{+} \end{array}\right]=\left[\begin{array}{ll} \boldsymbol{p}_{i} & \boldsymbol{p}_{i^{-}} \\ \boldsymbol{p}_{i} & \boldsymbol{p}_{i^{+}} \end{array}\right]\left[\begin{array}{c} \Phi_{i} \\ 1-\Phi_{i} \end{array}\right]$

A positive superscript indicates counterclockwise , A negative superscript indicates clockwise

iii. Calculate the predicted polygon vertices for each leg

$\boldsymbol{\xi}_{i}=\frac{\Phi_{i} \boldsymbol{p}_{i}+\Phi_{i} \boldsymbol{\xi}_{i}^{-}+\Phi_{i+} \boldsymbol{\xi}_{i}^{+}}{\Phi_{i}+\Phi_{i^{-}}+\Phi_{i^{+}}}$

iv. Calculate the desired centroid position

$\hat{\boldsymbol{p}}_{C o M, d}=\frac{1}{N} \sum_{i=1}^{N} \boldsymbol{\xi}_{i}$

4. Attitude adjustment in sloping terrain

meaning ： Represents a plane , That is, the plane where the quadruped robot is currently located
$z(x, y)=a_{0}+a_{1} x+a_{2} y$
meaning ： Get a plane according to the current state of the robot , namely a={ $a_0,a_1,a_2$ }
$\begin{aligned} \boldsymbol{a} &=\left(\boldsymbol{W}^{T} \boldsymbol{W}\right)^{\dagger} \boldsymbol{W}^{T} \boldsymbol{p}^{z} \\ \boldsymbol{W} &=\left[\begin{array}{lll} \mathbf{1} & \boldsymbol{p}^{x} & \boldsymbol{p}^{y} \end{array}\right]_{4 \times 3} \end{aligned}$

$\boldsymbol{p^x},\boldsymbol{p^y},\boldsymbol{p^z}$ ---- Including the status of each leg of the robot , such as $\boldsymbol{p^x} = (p_1^x,p_2^x,p_3^x,p_4^x)$

5. State estimation

It uses Kalman filtering and Extended Kalman filter , I won't elaborate here .

《Highly Dynamic Quadruped Locomotion via Whole-Body Impulse Control and Model Predictive Control》

1. Hybrid controller

The goal is ： Split the position control into two simple controllers to control

（1）MPC controller

$\begin{array}{l} m \ddot{\mathbf{p}}=\sum_{i=1}^{n_{c}} \mathbf{f}_{i}-\mathbf{c}_{g}, \\\\ \frac{d}{d t}(\boldsymbol{I} \boldsymbol{\omega})=\sum_{i=1}^{n_{c}} \mathbf{r}_{i} \times \mathbf{f}_{i} \end{array}$

Similar to the above balance controller , Objects are Lumped mass model , But here we use MPC To find the optimal control force .

（2）WBIC controller （Whole-Body Impulse Control）

$\boldsymbol{A}\left(\begin{array}{c} \ddot{\mathbf{q}}_{f} \\ \ddot{\mathbf{q}}_{j} \end{array}\right)+\mathbf{b}+\mathbf{g}=\left(\begin{array}{c} \mathbf{0}_{6} \\ \boldsymbol{\tau} \end{array}\right)+\boldsymbol{J}_{c}^{\top} \mathbf{f}_{r}$
$A$ ---- quality ( inertia ) matrix
$b$ ---- Coriolis force
$g$ ---- gravity
$\tau$ ---- Joint torque
$f_r$ ---- Increased force
$J_c$ ---- Jacobian matrix of touchdown point