Author's personal copy
Automatica 45 (2009) 2982–2985
Contents lists available at ScienceDirect
Automatica journal homepage: www.elsevier.com/locate/automatica
Technical communique
On the second-order sliding mode control of nonlinear systems with uncertain control direction Giorgio Bartolini, Alessandro Pisano ∗ , Elio Usai Department of Electrical and Electronic Engineering, University of Cagliari, Cagliari, Italy
article
info
Article history: Received 5 March 2009 Received in revised form 29 May 2009 Accepted 7 September 2009 Available online 4 October 2009 Keywords: Unknown control direction Second-order sliding modes
abstract This note deals with the implementation of a second-order sliding mode control algorithm for a class of nonlinear systems in which the sign of the high-frequency gain, though constant, is unknown. A specific second-order sliding mode control algorithm, the ‘‘Suboptimal’’ algorithm, is properly modified in order to face the uncertainty in the control direction. It is shown that after a finite time the uncertain sign is identified and the standard finite time convergence takes place from that time on. Simulation results are provided. © 2009 Elsevier Ltd. All rights reserved.
1. Introduction In this note we consider a class of systems affected by bounded matching uncertainties where the sliding output has relative degree two and whose main pathology consists in having a constant uncertain control direction. This work can be then considered as an ideal continuation and generalization of the previous paper (Bartolini, Ferrara, & Giacomini, 2003) in which nonlinear uncertain systems having unknown constant high-frequency gain (HFG) sign and known relative degree 1 were considered. The main ingredient of the proposed scheme is the introduction of an adaptive gain, switching between the values +1 and −1, which premultiplies the control variable. It is shown that after a finite number of switchings the adaptive gain settles to the constant, ‘‘correct’’, value guaranteeing the finite time convergence of the system to the origin. We consider systems in normal form as follows x˙ = f (x, s, s˙)
(1) ∗
s¨ = ϕ(x, s, s˙, t ) + g γ (x, s, s˙, t )u(t ),
(2)
where s ∈ R represents the sliding variable, u ∈ R is the control input, x ∈ Rm is the internal dynamics vector. Let the internal
The material in this paper was not presented at any conference. This paper was recommended for publication in revised form by Associate Editor Zhihua Qu under the direction of Editor André L. Tits. ∗ Corresponding address: University of Cagliari, Dept. of Electrical and Electronic Engineering, Piazza d'Armi, I-09123 Cagliari, Italy. Tel.: +39 0706755760; fax: +39 0706755782. E-mail addresses:
[email protected] (G. Bartolini),
[email protected] (A. Pisano),
[email protected] (E. Usai). 0005-1098/$ – see front matter © 2009 Elsevier Ltd. All rights reserved. doi:10.1016/j.automatica.2009.09.018
dynamics (1) be input-to-state (ISS) stable (Khalil, 2002), and let the scalar uncertain drift and gain functions fulfill the following inequalities for some constants F , G1 and G2 .
|ϕ(·)| ≤ F ,
0 < G1 ≤ γ (·) ≤ G2 .
(3) ∗
The uncertain constant control gain g ∈ {−1, +1} represents the uncertainty on the HFG sign. The solutions of (1)–(2) will be understood in the Filippov sense (Filippov, 0000) in order to allow the use of discontinuous control. The above conditions imply that the control gain γ (·) is assumed to have known sign. In the adaptive control literature the case of systems with unknown HFG sign has been treated by means of the so-called Nussbaum functions (Nussbaum, 1983) which have been applied to deal with large classes of linear and nonlinear processes with relative degree one, stable zero-dynamics, and unknown control direction (Ryan, 1995). This method does not guarantee exponential convergence, and, furthermore, may frequently lead to the presence of peaking phenomena (Bartolini et al., 2003; Sussmann & Kokotovic, 1991). The generalization of the adaptive control approach to systems with relative degree two and uncertain sign of the control gain has not been as much fruitful. The paper Ryan (1995), the seminal contribution to this topic, considered the presence of a persistent viscous dissipation. More recently, in the framework of the so-called VS-MRAC (Variable Structure Model Reference Adaptive Control) approach (Oliveira, Peixoto, Nunes, & Hsu, 2007) a method has been presented, based on the ‘‘monitoring function’’ concept, for systems having a known arbitrary relative degree by using derivative estimators based on higher-order sliding modes (see Levant (2003)) to reduce it to one. The problem was also studied in Drakunov (1993) for relative degree one sliding mode control systems.
Author's personal copy
5 4 3 2 1 0 –1 –2 –3 –4 –5
2983
6 ∆1
∆4
∆3
∆2
∆5
∆1
∆2
1
2
∆4
∆3
4 dy(t)/dt
dy(t)/dt
G. Bartolini et al. / Automatica 45 (2009) 2982–2985
2 0 –2 –4
0
1
2
3
4
5
6
7
–6 –1
0
y(t)
3
4 y(t)
5
6
7
8
9
Fig. 1. The system trajectory in TEST 1 (left) and TEST 2 (right).
In this paper we provide a solution based on the second-order sliding mode approach. In Section 2 the paper’s main result is presented, and a simple academic simulation example is discussed. Section 3 shows some simulation result on a fourth-order plant which is open-loop unstable, and the Section 4 draws some concluding remarks. 2. Main result Consider system (1)–(2) and let the sliding output variable s be the only signal available for measurements. Let known constants F , G1 , G2 exist such that conditions (3) are fulfilled. The requirement of a constant value for the upper bound F might seem to imply a local, a priori bounded, domain of attraction. As shown in Bartolini, Ferrara, Pisano, and Usai (2001), by considering ‘‘sufficiently large’’ value of F the convergence domain can be enlarged arbitrarily (semi-global stability). The problem of making s(t ) identically zero after a finite transient is the main topic of this paper. It was shown in Bartolini, Pisano, and Usai (2008) that the following ‘‘Suboptimal’’ second-order sliding mode control algorithm (Bartolini et al., 2001) u(t ) = −UM sign(s(t ) − β s(tMi )), i = 0, 1, 2, . . . tMi ≤ t < tM ,i+1 UM =
ηF G1
η>1
β ∈ 1−
2G1 (η − 1)
η(G2 + G1 )
Di = {(s, s˙) ∈ R2 : ∆i ≤ |s| < ∆i+1 },
∆0 = 0,
∆1 = |s(0)| + k2 ,
i = 1, 2, . . .
(7)
∆i+1 = ∆1 (1 + µ)i
i = 1, 2, . . .
(8)
where µ > 0 and k 6= 0 are appropriate constants. It follows from (8) that
∆i+1 − ∆i = ∆1 µ(1 + µ)i−1 i > 1
(9)
which means that the ‘‘thickness’’ of domains Di is larger and larger as the index i grows. The above ‘‘increasing-thickness’’ property is instrumental in guaranteeing the convergence. The monitoring function approach in Oliveira et al. (2007) shares a similar functioning principle. By (8), at the initial time t = t0 the system belongs to the domain D0 . The constant ρ(t ) is initialized to 1, and its sign is reversed every time that the actual Di domain (i = 0, 1, 2, . . .) is left for the first time. An analytical representation of this adaptation mechanism is as follows.
(4)
,1
(5)
with tM0 ≡ t0 and tMi , i = 1, 2, . . . is the sequence of time instants at which s˙(tMi ) = 0, can stabilize in finite time the system (1)–(3) with g ∗ = 1, i.e. with known HFG sign. Remark 1. The detection of the sequence tMi (or, equivalently, of the sequence s(tMi )) can be made without making the exact reconstruction of the derivative s˙ of the switching function. In the absence of noise a possible estimation algorithm, which uses a peak detector device, has been illustrated in Bartolini, Ferrara, and Usai (1997) and Bartolini, Ferrara, and Usai (1998). The robustness of the suboptimal algorithm against the approximate detection of the sequence tMi has been also investigated there. Even if the presented peak detector algorithm does not involve divisions for small sampling times, it is sensitive against the measurement noise. The interested reader can find in Bartolini, Pisano, and Usai (2004) a modified implementation which makes the algorithm robust against the noise effect. From now on, denote the ‘‘Suboptimal’’ algorithm in compact form as u(t ) = −UM sub(s; β). To address the uncertainty on the HFG sign, let us premultiply the suboptimal control law by a timevarying gain coefficient ρ(t ): u(t ) = −UM ρ(t )Sub(s; β),
for adjusting ρ(t ), let us partition the s − s˙ phase plane by means of disjoint domains Di as follows
ρ(t ) ∈ {−1, +1} .
(6)
The gain ρ(t ) appearing in (6) can assume only the two values ρ(t ) = 1 or ρ(t ) = −1. In order to define an appropriate logic
Initialization : ρ(0) = 1; i = 1; step 1. if |s(t )| > ∆i then set ρ(t ) := −1, i := i + 1, and go to step 2. step 2. if |s(t )| > ∆i then set ρ(t ) := 1, i := i + 1, and go to step 1.
(10)
We shall prove that after a finite number of switchings of ρ(t ) (possibly zero) the origin of the state plane is reached in finite time. The following academic example highlights the importance of the progressive enlargement of the thickness of the domains Di . Let us consider the simple perturbed double integrator y¨ = ϕ + u, with initial conditions y(0) = 0.5, y˙ (0) = 1 and uncertain constant drift term ϕ = 1. The controller ignores the sign of the control gain. The algorithm (6), (10), has been implemented with the parameters UM = 4 and β = 0.5. Preliminarily (TEST 1), a modified definition of the regions Di is considered, namely the coefficients ∆i are chosen such that ∆1 = 1, ∆i = ∆i−1 + 1 (i = 2, 3, . . .). This corresponds to the regions Di in (7) having constant unitary thickness 1, i.e. ∆1 = 1, ∆2 = 2, ∆3 = 3, ∆4 = 4, etc. This means that the fundamental tuning condition (11), that will be derived later, is not fulfilled in the TEST 1. It is seen in Fig. 1left that the system trajectory diverges. The proposed formula (8) for the computation of the sequence ∆i has been considered in TEST 2, with the parameters k2 = 0.5 and µ = 1. This leads to ∆1 = 1, ∆2 = 2, ∆3 = 4, ∆4 = 8, etc. It is seen in the Fig. 1-right that the system trajectory now exhibits the desired convergence properties. The correct value of the gain ρ(t ) is identified after three switches, and region D4 is not entered anymore.
Author's personal copy
2984
G. Bartolini et al. / Automatica 45 (2009) 2982–2985
Convergence is ensured if
(∆i+2 − ∆i+1 ) >
.
.
1+µ>
Fig. 2. The contraction mechanism.
2.1. The proposed algorithm We are now in position of proving the following: Theorem 1. Given system (1)–(3), the control algorithm (6), (10) guarantees the finite time attainment of condition s = s˙ = 0 provided that the sequence ∆i is defined according to (8) with
(G2 − G1 )UM + 2F . G1 UM − F
(11)
Proof of Theorem 1. The sequence of increments and decrements of s˙ characterizes the proposed strategy. Consider the system behaviour after the domain Di has been entered at the time t = ti (i ≥ 1). At the time t = ti , |s(ti )| = ∆i . Denote |˙s(ti )| = s˙i . Without loss of generality consider the case when both s and s˙ are increasing in the domain Di (see Fig. 2). Thus, in finite time t = ti+1 the domain Di+1 is entered. At that time the control gain ρ(t ) reverses its sign, and, from this point on, s˙ decreases in modulus. By construction, s˙i+1 > s˙i and s˙i+2 < s˙i+1 . It must be proven that, if the domain Di+2 is entered the corresponding value of s˙i+2 is strictly less than s˙i . Due to the uncertainties in the system dynamics, a worst case analysis is mandatory which selects the uncertainties that maximize the modulus of s˙i+2 . Clearly, by making reference to the Fig. 2, this amounts to consider the maximal positive slope in the domain Di and the less negative slope in the domain Di+1 . In the domain Di , the trajectory having the maximal positive slope is s(t ) = ∆i +
1 s˙2 (t ) − s˙2i 2 G2 U M + F
(12)
.
s˙2i+1 = s˙2i + 2(G2 UM + F )(∆i+1 − ∆i ).
(13)
In the domain Di+1 the trajectory having the less negative slope is 1 s˙2 (t ) − s˙2i+1 . s(t ) = ∆i+1 − 2 G1 UM − F
(14)
The value of s˙i+2 is obtained by putting s(t ) = ∆i+2 into (14). By (13) s˙2i+2 = s˙2i + 2[(G2 UM + F )(∆i+1 − ∆i )
− (G1 UM − F )(∆i+2 − ∆i+1 )].
(15) s2i .
To ensure the convergence it must be proven that ˙ < ˙ This means that the bracketed term in (15) must be negative, i.e.
(G2 UM + F )(∆i+1 − ∆i ) < (G1 UM − F )(∆i+2 − ∆i+1 ).
(16)
Note that if the thickness of the adjacent domains Di and Di+1 would be constant (e.g., if ∆i+1 − ∆i = ∆i+2 − ∆i+1 = const .) the worst case sequence {˙si } diverges and no contractive behaviour can be longer guaranteed.
G2 UM + F G1 UM − F
.
(18)
It is trivial to derive from (18) the tuning inequality (11). The maximal number of switches for the gain function ρ(t ) can be computed. Let s˙1 the value of s˙ at the time instant at which the domain D1 is entered. It was shown that the bracketed term in (15) is negative. Then it can be written s˙2i+2 = s˙2i − ξ 2 . The switching process ends when s˙2i+2 becomes negative. The maximal number of switches is then NM = 1 + 2i∗ , where i∗ is the smallest integer such that s˙21 − 2ξ 2 i∗ < 0. By increasing µ it correspondingly increases the ξ constant, which implies that the maximal number of switches can be reduced by increasing µ. Once the axis s˙ = 0 is hit at the generic time instant ti∗ > ti the value of ρ(t ) will be such that sign(ρ(t ∗ )) = sign(g ∗ ), and, at the same time, the domain Di will never be left due to the suboptimal algorithm contraction property which establishes that successive horizontal axis crossing point are closer and closer to the origin (Bartolini et al., 2001). Then, from the time instant ti∗ on, the unknown sign of the high-frequency gain has been identified and the remaining convergence proof follows the standard arguments already used in previous papers (e.g. Bartolini et al. (2001)). This conclude the proof. Remark 2. By similar considerations as those made in Bartolini et al. (2008), it can be shown that the proposed technique can be also applied, without any modification, to stabilize systems with relative degree one and having unknown sign of the highfrequency gain. Remark 3. The error in estimating the sequence s(tMi ) (see the Remark 1) causes a deterioration of the ideal performance s = s˙ = 0 stated in the Theorem 1. Actually, the variables s(t ) and s˙(t ) will eventually enter a small vicinity of zero instead of converging exactly to zero (Bartolini et al., 1997, 1998). 3. Simulations
The value of s˙i+1 is obtained by putting s(t ) = ∆i+1 into (12) which yields
s2i+2
(17)
By taking into account (9) the convergence condition (17) can be expressed as follows in terms of the µ parameter.
.
µ>
(G2 UM + F ) (∆i+1 − ∆i ). (G1 UM − F )
Consider the fourth-order system
x˙ 1 = x2 , y˙ 1 = y2 ,
x˙ 2 = −x1 − x32 + y1 + y2 (19) y˙ 2 = y21 + y2 + x1 + x2 + g ∗ (3 + sin(2π t ))u
with the input term u premultiplied by the uncertain coefficient g ∗ ∈ {−1, 1} which models the assumed uncertainty on the sign of the control high-frequency gain. The internal dynamics can be easily shown to be ISS. Note that the input output dynamics may exhibit the finite escape time phenomenon if not properly controlled. The initial conditions are x(0) = y(0) = [1, 1]. Assume the bound F = 15 for the system drift term, and set coefficients G1 = 2, G2 = 4. By using the tuning formulas (5), (11) the parameter values UM = 40, β = 0.6, k2 = 0.1, µ = 1 are derived. With the selected parameters the state plane is partitioned into the strips having ∆1 = 0.6, ∆2 = 1.2, etc. Let us first simulate the system with negative high-frequency gain g ∗ = −1 (TEST 3). Fig. 3 shows that convergence is achieved after one switch of the coefficient ρ(t ), and Fig. 4-left shows the corresponding trajectory in the y1 − y2 plane. Now run the simulation test putting the positive control gain g ∗ = 1 (TEST 4). The correct value for ρ(t ) is identified with no switches for ρ(t ), and fast convergence to zero is ensured (see Fig. 4-right).
Author's personal copy
G. Bartolini et al. / Automatica 45 (2009) 2982–2985
2985
y (t) 1
1 0 –1
0
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
0.45
0.5
0
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
0.45
0.5
0
0.05
0.1
0.15
0.2
0.25
0.3
0.35
0.4
0.45
0.5
y (t) 2
20 0 –20
ρ(t)
1 0 –1
Time [sec]
Fig. 3. TEST 3 (g ∗ = −1). The state trajectories and the coefficient ρ(t ) vs. time.
Fig. 4. The phase-plane trajectory in TEST 3 (right) and in TEST 4 (right).
4. Conclusions A second-order sliding mode based control algorithm has been proposed to address the finite time stabilization problem for a class of nonlinear uncertain systems with relative degree two and uncertain sign of the control gain. The simulation results confirm the expected performance. References Bartolini, G., Ferrara, A., & Giacomini, L. (2003). A switching controller for systems with hard uncertainties. IEEE Transactions on Circuits Systems I—Fundamental Theory Applications, 50(8), 984–990. Bartolini, G., Ferrara, A., Pisano, A., & Usai, E. (2001). On the convergence properties of a 2-sliding control algorithm for nonlinear uncertain systems. International Journal of Control, 74, 718–731. Bartolini, G., Ferrara, A., & Usai, E. (1997). Output tracking control of uncertain nonlinear second-order systems. Automatica, 33(27), 2203–2212. Bartolini, G., Ferrara, A., & Usai, E. (1998). Chattering avoidance by second-order sliding mode control. IEEE Transactions on Automatic Control, 43(2), 241–246. Bartolini, G., Pisano, A., & Usai, E. (2004). An improved second-order sliding mode control scheme robust against the measurement noise. IEEE Transactions on Automatic Control, 49(10), 1731–1737.
Bartolini, G., Pisano, A., & Usai, L. (2008). On an output-feedback stabilization problem with uncertainty in the relative degree. International Journal of Robust and Nonlinear Control, 7, 741–755. Drakunov, S. (1993). Sliding mode control of the systems with uncertain direction of the control vector. In Proc. 32nd IEEE conference on decision and control CDC ´ (pp. 2477–2478). Vol. 93. Filippov, A. F. (1988). Differential equations with discontinuous right-hand side. Dordrecht, The Netherlands: Kluwer. Khalil, H. K. (2002). Nonlinear systems (third edn). Upper Saddle River, New Jersey: Prentice Hall. Levant, A. (2003). Higher-order sliding modes, differentiation and output-feedback control. International Journal of Control, 76(9/10), 924–941. Nussbaum, R. D. (1983). Some remarks on a conjecture in parameter adaptive control. Systems & Control Letters, 3, 243–246. Oliveira, T. R., Peixoto, A. J., Nunes, E. V. L., & Hsu, L. (2007). Control of uncertain nonlinear systems with arbitrary relative degree and unknown control direction using sliding modes. International Journal of Adaptive Control and Signal Processing, 21(8/9), 692–707. Ryan, E. P. (1995). Universal stabilization of a class of nonlinear systems with homogeneous vector fields. Systems & Control Letters, 26, 177–184. Ryan, E. P. (1995). Adaptive stabilization of a class of nonlinear systems of relative degree two. In Proc. 1995 European control conference (pp. 1670–1675). Sussmann, H. J., & Kokotovic, P. V. (1991). The peaking phenomenon and the global stabilization of nonlinearsystems. IEEE Transactions on Automatic Control, 36(4), 424–440.