Nonlinear Discrete-Time Observer Design with Linearizable Error Dynamics MingQing Xiao Department of Mathematics Southern Illinois University Carbondale, IL 62901/USA
[email protected]
Nikolaos Kazantzis Department of Chemical Engineering Worcester Polytechnic Institute Worcester, MA 01609 /USA
[email protected]
Costas Kravaris Department of Chemical Engineering University of Patras GR-26500/Greece
[email protected]
Arthur J. Krener Department of Mathematics University of California Davis, CA 95616/USA
[email protected]
Abstract A necessary and sufficient condition for the existence of a discrete-time nonlinear observer with linearizable error dynamics is provided. The result can be applied to any real analytic nonlinear system whose linear part is observable. The necessary and sufficient condition derived is associated with the solvability of a nonlinear functional equation. A corollary is Siegel’s theorem on the linearizability of a mapping. The method of observer design suggested by this theorem is constructive and can be applied approximately to any sufficiently smooth, linearly observable system yielding a local observer with approximately linear error dynamics.
Keywords: Discrete Time Nonlinear system, Nonlinear observer, Linearizable Error Dynamics, Output Injection, Functional Equations.
1
Introduction
We consider the problem of estimating the current state x(k) of a nonlinear discrete-time dynamical system described by a system of first-order difference equations x(k + 1) = f (x(k)) = F x(k) + O(x(k))2
(1.1)
2
(1.2)
y(k) = h(x(k)) = Hx(k) + O(x(k)) ,
from the past observations y(s), s ≤ k, where k ∈ {0, 1, 2, . . . }. The vector fields f : IRn → IRn , and h : IRn → IRp are assumed to be sufficiently smooth with f (0) = 0, h(0) = 0 and p ≤ n. Later, when showing convergence of a certain formal power series, we shall require that f, h are real analytic functions. An observer is a dynamic system driven by the observations x ˆ(k + 1) = fˆ(ˆ x(k), y(k))
(1.3)
such that the estimation error x ˜(k) = x(k) − x ˆ(k) goes to zero as k → ∞. A local observer is one whose error converges to zero for small x(0) and x ˜(0). 1
One technique of constructing an observer for continuous-time systems is to find a nonlinear change of state and output coordinates which transforms the system (1.2) into a system with linear dynamics driven by nonlinear output injection and with linear output map [10], [8], [2]. The design of an observer for such a system is relatively easy since the error dynamics can be made linear in the transformed coordinates. This approach was extended to discrete time systems in [16] , [17], [3]. For more on discrete time observer design we refer the reader to [5] and its references. Recently, Kazantzis and Kravaris [6] proposed a new approach to finding a continuous-time observer whose error dynamics is also linear in the transformed coordinates. They seek a nonlinear change of state coordinates which transforms the system (1.2) into a system with linear dynamics driven by nonlinear output injection and with an arbitrary output map. This is much easier to accomplish and they were able to do so for all linearly observable, real analytic systems whose spectrum of the linear part lies wholly in the left half plane or wholly in the right half plane. Krener and Xiao [12], [13] have extended this approach to linearly observable, real analytic systems where the spectrum of the linear part is arbitrary. They also showed that the sufficient condition for linearizable error dynamics is necessary [14] and extended the method to some systems which are not linearly observable [15]. When the system is only C r , this approach yields a local observer with approximately linear error dynamics. Kazantzis and Kravaris [7] have extended this approach to linearly observable, real analytic nonlinear discrete-time systems where the spectrum of the linear part lies wholly inside the unit disc or wholly outside the unit disc. The current paper drops the requirement on the spectrum of the linear part. The basic approach is as follows. Suppose that there exists a change of state coordinates z = θ(x), an output injection term β(y) and an n × n matrix A with eigenvalues strictly inside the unit disc such that the dynamics of (1.2) in the z-coordinates is linear and driven by the above nonlinear output injection term z(k + 1) = Az(k) + β(y(k)).
(1.4)
Then, one can construct an observer given by zˆ(k + 1) = Aˆ z (k) + β(y(k)) −1 z (k)) x ˆ(k) = θ (ˆ
(1.5)
whose error z˜ = z − zˆ dynamics is linear in the transformed coordinates z˜(k + 1) = A˜ z (k).
(1.6)
Since the eigenvalues of A are strictly inside the unit disc we are assured that z˜(k) → 0 as k → ∞. Notice, that in the original coordinates the observer is given by x(k))) + β(y) − β(h(ˆ x(k)))] . x ˆ(k + 1) = θ−1 [θ(f (ˆ
(1.7)
If one can only find a change of state coordinates z = θ(x), an output injection β(y) and an n × n Hurwitz matrix A such that the dynamics of (1.2) becomes almost linear driven by the above nonlinear output injection term z(k + 1) = Az(k) + β(y(k)) + g(z(k)) 2
(1.8)
where g(z) = O(z)d+1 , then one can construct a local observer zˆ(k + 1) = Aˆ z (k) + β(y(k)) + g(ˆ z (k)) x ˆ(k) = θ−1 (ˆ z (k))
(1.9)
with approximately linear error dynamics z˜(k + 1) = A˜ z (k) + g(z(k)) − g(ˆ z (k)) z˜(k + 1) = A˜ z (k) + O(z(k), z˜(k))d O(˜ z (k)).
(1.10)
It should be pointed out, that the observer in the original coordinates takes the same form (1.7) as before. To transform the original system to (1.4), θ(x), β(y) and A must satisfy the following functional equation: θ(f (x)) = Aθ(x) + β(h(x)).
(1.11)
When the linear part of the system (1.2) is observable, we shall solve (1.11) by constructing a formal power series for θ up to the degree of smoothness of the system for any suitable β(y) and A. We shall show that this series converges if the system and the output injection are real analytic and A is chosen to satisfy a condition that holds almost everywhere. Truncating the power series at degree d yields a solution to θ(f (x)) = Aθ(x) + β(h(x)) + g(z).
(1.12)
where g(z) = O(z)d+1 and hence z = θ(x) transforms the original system to (1.8). The analysis is similar to that of [12] but it has some subtle differences. Moreover the converse holds. If there exists an observer with linear error dynamics in the transformed coordinates, then it is because (1.11) is solvable. The paper is organized as follows. Section 2 states the main results of this paper and defines some needed concepts. The formal solution of (1.11) is given in Section 3. Furthermore, it is shown that the formal solution exists to a degree of differentiability that is equal to the degree of differentiability of the original system minus one. Section 4 provides an outline of the proof of convergence of the formal solution for real analytic systems. We also show that (1.11) has a unique solution for any choice of β(y) as long as the eigenvalues are outside a set of zero measure in C I. The necessity is proved in Section 4. Two examples are provided in the last section to illustrate the main results of the paper.
2
Statement of Main Results
Before we state the main results we must introduce some concepts. Definition 2.1. Suppose that the eigenvalues of F are λ = (λ1 , . . . , λn ). A complex number µ is multiplicatively resonant with the spectrum of F of degree d > 0 if there exist non-negative integers (m1 , m2 , . . . , mn ) with |m| = k mk = d such that mn 1 m2 λm = λm 1 λ2 · · · λn = µ.
3
Definition 2.2. Suppose that the eigenvalues of F are λ = (λ1 , . . . , λn ). Given constants C > 0, ν > 0, we say a complex number µ is of multiplicative type (C, ν) with respect to the spectrum of F if for any vector m = (m1 , m2 , . . . , mn ) of nonnegative integers, |m| = k mk , we have |µ − λm | ≥
C . |m|ν
(2.1)
Clearly if µ is of multiplicative type (C, ν) with respect to σ(F ) then it is not multiplicatively resonant with the spectrum of F of any degree. The set of µ’s which are of multiplicative type (C, ν) for some C > 0 is dense in the complex plane when ν is large enough (see Theorem 4.1). Notice, that occasionally we delete the modifiers ”multiplicative” and ”multiplicatively”. The following Theorems capture the present study’s main results. Theorem 2.1. Assume that f : IRn → IRn , h : IRn → IRp and β : IRp → IRn are C d+1 vector ∂f ∂h ∂β fields with f (0) = 0, h(0) = 0, β(0) = 0 and F = (0), H = (0), B = (0). Let ∂x ∂x ∂x −1 −1 A = T (F − T BH)T for some invertible matrix T . Suppose that none of the eigenvalues of A are multiplicatively resonant with the spectrum σ(F ) of F of degree 1 ≤ k ≤ d. Then there exists a unique degree d polynomial solution z = θ(x) to the functional equation (1.12) defined locally ∂θ around x = 0 with (0) = T , so that θ is a local diffeomorphism. If all eigenvalues of A are ∂x chosen to be inside the unit disk, then the local state observer for (1.2) given by (1.7) has locally geometrically stable error dynamics which is approximately linear in the transformed coordinates (1.10). Theorem 2.2. Assume that f : IRn → IRn , h : IRn → IRp and β : IRp → IRn are real analytic ∂f ∂h ∂β vector fields with f (0) = 0, h(0) = 0, β(0) = 0 and F = (0), H = (0), B = (0). Let ∂x ∂x ∂x A = T (F − T −1 BH)T −1 for some invertible matrix T . Suppose that there exists a C > 0, ν > 0 such that all the eigenvalues of A are of multiplicative type (C, ν) with respect to the spectrum σ(F ) of F . Then, there exists a unique analytic solution z = θ(x) to the functional equation (1.11) ∂θ defined locally around x = 0 with (0) = T , so that θ is a local diffeomorphism. If all eigenvalues ∂x of A are chosen to be inside the unit disk, then the local state observer for (1.2) given by (1.7) has locally geometrically stable error dynamics which is linear in the transformed coordinates (1.6). Theorem 2.3. Conversely, assume that f : IRn → IRn , h : IRn → IRp are C d+1 vector fields with f (0) = 0, h(0) = 0. If there exists an observer (1.3) and a C d+1 change of coordinates z = θ(x) such that the error dynamics in transformed coordinates is linear (1.6), then θ satisfies (1.11) for some A and a C d+1 output injection term β(y). If the system and the change of coordinates are real analytic, then β(y) is real analytic as well. Remarks: We have stated the above Theorems assuming (1.2) is real but all the results hold for complex systems too. If θ is a locally diffeomorphic solution to (1.11) for some A and β and the spectra of F and A are disjoint, then it is not hard to show that (H, F ) is an observable pair and (A, B) is a controllable pair [12]. On the other hand, if (H, F ) is an observable pair then one can set the spectrum of A = T (F − T −1 BH)T −1 arbitrarily by an appropriate choice of B. 4
∂θ It is with no loss of generality that we can fix (0) to be an arbitrary invertible matrix T . If θ¯ ∂x ∂ θ¯ ¯ Then (0) = T¯, then T¯ is invertible, so we can define θ = T T¯−1 θ. is a local diffeomorphism and ∂x ∂θ (0) = T . In the proofs it will be convenient to consider a particular invertible matrix T . Of ∂x course changing T changes A because A = T (F − T −1 BH)T −1 . According to Theorem 4.1 of this paper, almost all complex numbers are of type (C, ν) with respect to σ(F ) for degree 1 and higher, so this assumption on the spectrum of A and the corresponding one of no resonances are hardly restrictions on the choice of A when (H, F ) is an observable pair. It should be also emphasized, that if we let β = 0, and A = F , then Theorem 2.2 reduces to the well-known Siegel mapping theorem (Page 214, [1]) which we now state in accordance to the present context of analysis. Siegel Mapping Theorem Assume that f : IRn → IRn is a real analytic vector field with f (0) = 0 ∂f and F = (0). Assume that the eigenvalues of F are of multiplicative type (C, ν) for |m| > 1 with ∂x respect to each other, i.e. the spectrum σ(F ) of F . Then, there is a real analytic diffeomorphism satisfying functional equation (1.11) with A = F and β = 0. Notation: In this paper we consider vectors x = (x1 , x2 , . . . , xn ) with n real components xk or equivalently points x in IRn . Let |x| = max{|x1 |, . . . , |xn |}. The vectors m = (m1 , m2 , . . . , mn ) whose components are non-negative integers are called multi-indices and form a subset Zn≥0 of IRn . We define for m ∈ Zn≥0 the scalars |m| = m1 + m2 + · · · + mn ;
m! = m1 !m2 ! · · · mn !,
and for x ∈ IRn , m ∈ Zn≥0 the monomial mn 1 m2 xm = xm 1 x2 · · · xn .
In Zn≥0 we define a partial ordering by writing m≥α
whenever
mi ≥ αi
for i = 1, 2, . . . , n.
The general d-th degree polynomial in x1 , x2 , . . . , xn is of the form pm xm P (x) = |m|≤d
with m ∈ x ∈ IR , pm ∈ IR. Let f (x) be a function whose domain is an open set Ω in IRn and whose range lies in IR. For y ∈ Ω we call f real analytic at y if there exist fm ∈ IR and a neighborhood N of y such that fm (x − y)m (2.2) f (x) = Zn≥0 ,
n
m
for all x in N , where the sum is taken over all multi-indices m. The convergence of (2.2) means absolute convergence for each x ∈ N . We say f is real analytic in Ω if f is real analytic at each y ∈ Ω. A vector f (x) = (f1 (x), . . . , fn (x)) defined in Ω is called real analytic if each of its components is real analytic. Convergence then means component-wise absolute convergence. 5
3
The Formal Solution of the Functional Equation
In this section we shall prove Theorem 2.1. We expand the terms of (1.11) in power series as follows f (x) = F x + f [2] (x) + f [3] (x) + · · · + f [d] (x) + O(x)d+1 , β(h(x)) = BHx + β [2] (x) + β [3] (x) + · · · + β [d] (x) + O(x)d+1 , θ(x) = T x + θ[2] (x) + θ[3] (x) + · · · + θ[d] (x), where f [d] , β [d] , and θ[d] are homogeneous polynomial vector fields of degree d in x. It is assumed, that β(y), T have been chosen such that A = T (F − T −1 BH)T −1 . Therefore, the known quantities are f, h, β, A and the unknown ones are the higher degree terms θ[2] , θ[3] , . . . . Since {F, H} is a linearly observable pair we can choose B to fix the spectrum of A. We assume that we can choose x coordinates so that F is diagonal. When the Jordan form of F is not diagonal the proofs below need to be modified. In this case, the proofs become algebraically involved, and thus, they are omitted for brevity. We assume that we can choose z coordinates so that A is also diagonal by an appropriate choice of T . We can always choose A to have distinct eigenvalues and hence its Jordan form will be diagonal. Complex eigenvalues introduce no difficulties. In fact all the results hold for complex x, z, f, h, A, β, θ, . . . as well. Proof of Theorem 2.1 The proof is based on induction for d ≥ 2. The terms of degree 2 of (1.11) are Aθ[2] (x) − θ[2] (F x) = T f [2] (x) − β [2] (x).
(3.1)
Notice, that the left hand-side is a linear function of θ[2] and the right hand-side is known. In fact, the map θ[2] (x) → Aθ[2] (x) − θ[2] (F x)
(3.2)
is a linear operator on the space of quadratic vector fields. Let ek be the k th unit vector in z mn k 1 space. If m = (m1 , . . . , mn ) then xm = xm 1 . . . xm . The linear operator (3.2) maps e xi xj to k (µk − λi λj )e xi xj . Hence if there is no multiplicative resonance of degree 2, the operator (3.2) is invertible. If θkij ek xi xj (3.3) θ[2] (x) = 1≤k≤n 1≤i≤j≤n
and
T f [2] (x) − β [2] (x) =
γkij ek xi xj
(3.4)
1≤k≤n 1≤i≤j≤n
then θkij
=
γkij . µk − λi λj
Assume that the unique solution θ(x) = θ[1] (x) + θ[2] (x) + θ[3] (x) + · · · + θ[d−1] (x) 6
(3.5)
to (1.12) through terms up to d − 1 has been found where θ[1] (x) = T x. The next term θ[d] (x) must satisfy Aθ (x) − θ (F x) = [d]
[d]
d−1
[d] − β [d] (x).
[i]
θ (f (x))
(3.6)
i=1
The notation [. . . ][d] means the degree part of the bracketed expression. Again, the left hand-side is linear in the unknown θ[d] (x), whereas the right hand-side only involves known or previously computed terms. The map θ[d] (x) → Aθ[d] (x) − θ[d] (F x)
(3.7)
is a linear operator on the space of degree d vector fields. In particular, it maps ek xm to (µk − λm )ek xm where |m| = d. Hence if there is no multiplicative resonance of degree d, then (3.6) has a unique solution. If θk,m ek xm (3.8) θ[d] (x) = 1≤k≤n |m|=d
and d−1
[d] θ[i] (f (x))
− β [d] (x) =
γkm ek xm
(3.9)
1≤k≤n |m|=d
i=1
then θkm =
γkm . µk − λm
(3.10)
The rest of Theorem 2.1 follows easily from the discussion in the introduction.
4
Convergence of the Formal Solution
When f, h, β are real analytic we may apply Theorem 2.1 and conclude that there is an infinite series θ(x) =
∞
θ[d] (x)
d=1
which satisfies (1.12) for all d. We call this a formal solution to (1.11). To prove Theorem 2.2 we must show that this series converges and that it satisfies (1.11). The equation (3.6) defines θ[d] (x) in terms of f , β [d] (x) and θ[1] (x), . . . , θ[d−1] (x). But it is difficult to use it to estimate the size of θ[d] (x) because of the terms in the sum on the right hand-side. Therefore, we sum the series in a different fashion. Let f (x) = F x + f¯(x) ¯ β(y) = BHx + β(x). 7
As before we assume that x coordinates can be chosen so that F is diagonal. We choose B to set ¯ the spectrum of A. The rest of the output injection β(x) is an arbitrary real analytic function. We assume that T can be chosen so that A is diagonal. We construct a sequence of functions {θk (x)}∞ 1=2 which satisfy the homological equations θ1 (x) = T x ¯ Aθ2 (x) − θ2 (F x) = T f¯(x) − β(x) Aθk (x) − θk (F x) = θk−1 (f (x)) − θk−1 (F x).
(4.1)
for k ≥ 3. Clearly θ2 (x) starts with terms of degree two and it is easy to show by induction that θk (x) starts with terms of degree k. We have shown that the sum θ1 (x) + θ2 (x) + θ3 (x) + . . .
(4.2)
converges to an analytic function which solves (1.11) in some polydisc around the origin. Due to space limitation, we omit the proof of the convergence of (4.2). The detail can be found in the full version of the paper [18]. As a final, yet important remark, and before closing this section we shall prove that almost all µ ∈ C I are of multiplicative type (C, ν) with respect to the spectrum of F . Theorem 4.1. For fixed ν > n2 and any C > 0, define M (C) as the set of all µ which are not of multiplicative type (C, ν) with respect to the spectrum of F for degree d ≥ 1. Then measure M (C) ≤ k(n, ν)C 2 , where k(n, ν) is a constant which depends only on n and ν. Therefore measure M (C) = 0 C>0
Proof. Clearly, M (C) =
Ball λm ,
|m|≥1
C |m|ν
where Ball(p, r) stands for an open ball in C I centered at p ∈ C I with radius r. The measure of m C πC 2 the Ball λ , |m|ν is . There are no more than (d + 1)n−1 choices of m = (m1 , m2 , . . . , mn ) |m|2ν such that |m| = d. To see this note that each of m1 , . . . , mn−1 must lie between 0 and d and then mn = d − m1 − . . . − mn−1 . Since (d + 1) ≤ 2d, we have
meas
Ball λm ,
|m|=d
C ≤ πC 2 (2d)n−1−2ν . |m|ν
Therefore, if n − 1 − 2ν < −1 then meas
|m|>0
Ball λm ,
C 2 n−1−2ν (2d) , ≤ πC |m|ν ∞
d=1
and the result follows. 8
5
Necessity
In this section we prove the converse statement of the error dynamics linarization problem of interest. Proof of Theorem 2.3 Suppose that the change of coordinates z = θ(x) transforms the original system (1.2) to z(k + 1) = g(z(k)) y(k) = h(z(k)) = h(θ−1 (z(k)))
(5.3)
where 0 = θ(0), g(0) = 0 and h(0) = 0. Suppose there exists a nonlinear observer zˆ(k + 1) = gˆ(ˆ z (k), y(k))
(5.4)
such that the error dynamics is linear (1.6). Assuming z = 0, (1.6) yields Aˆ z = gˆ(ˆ z , 0).
(5.5)
Assume now that z˜ = 0. Then (1.6) implies g(z) = gˆ(z, h(z)).
(5.6)
Define β(ˆ z , y) = gˆ(ˆ z , 0) − gˆ(ˆ z , y) then A˜ z = g(z) − gˆ(ˆ z , h(z)) = gˆ(z, h(z)) − gˆ(ˆ z , h(z)) = gˆ(z, 0) − β(z, h(z)) − gˆ(ˆ z , 0) + β(ˆ z , h(z)) = Az − β(z, h(z)) − Aˆ z + β(ˆ z , h(z)).
(5.7)
Therefore β(z, h(z)) = β(ˆ z , h(z)).
(5.8)
Notice, that the left hand-side of this equation does not depend on zˆ, hence β(z, y) = β(y). Furthermore gˆ(ˆ z , y) = Aˆ z − β(y) g(z) = Az − β(h(z)).
(5.9)
so θ(x) must satisfy functional equation (1.11). Clearly β(y) is C d+1 .
6
Illustrative Examples
Example 1: Consider the following nonlinear discrete-time dynamic system: x2 (k) ) 1 + x1 (k) + x2 (k) − 1 − 0.4x2 (k) − 0.5 ln (1 + x1 (k) + x2 (k)) 1 + x2 (k) x2 (k + 1) = 0.5 ln (1 + x1 (k) + x2 (k)) + 0.4x2 (k)
x1 (k + 1) = exp (1.3
y(k) = x2 (k)
(6.10) 9
with (x1 , x2 ) = (0, 0) being an equilibrium point of (6.10) and 1 + x1 + x2 > 0 for all x1 , x2 ∈ R. The Jacobian matrix F of (6.10) evaluated at the above equilibrium point is:
0 0.9 F = . (6.11) 0.5 0.9 The eigenvalues of F are: λ1 = −0.358 and λ2 = 1.258, and therefore the above system is unstable (λ2 > 1). Furthermore, notice that the nonlinear discrete-time observer design method developed by Kazantzis and Kravaris [7] can not be applied to this case since the eigenvalues of F do not belong to the Poincar´e domain. Notice also that the system’s linear part is observable. Consider now the following matrix A:
0.5 0.3 (6.12) A= 0.5 0.4 with eigenvalues: µ1 = 0.8405 and µ2 = 0.0595. Under the above choice for A, its eigenvalues are of multiplicative type (C, ν) with respect to the spectrum σ(F ) of F . Moreover, the following output injection map was chosen:
y 1.3( 1+y ) − 0.3y β(y) = (6.13) 0 such that the pair (A, B) is controllable. Under the above remarks and suitable selection of the pair (A, B), the functional equation (1.11) is uniquely solvable in the class of real analytic functions (Theorem (2.2)) and assumes the following form:
x2 √ ) 1 + x1 + x2 − 1 − 0.4x2 − 0.5 ln (1 + x1 + x2 ), 0.5 ln (1 + x1 + x2 ) + 0.4x2 θ1 exp (1.3 1 + x2 = 0.5θ1 + 0.3θ2 + β1 (x2 )
x2 √ ) 1 + x1 + x2 − 1 − 0.4x2 − 0.5 ln (1 + x1 + x2 ), 0.5 ln (1 + x1 + x2 ) + 0.4x2 θ2 exp (1.3 1 + x2 = 0.5θ1 + 0.4θ2 θ1 (0, 0) = 0 θ2 (0, 0) = 0
(6.14)
where the last two equations (initial conditions) merely reflect the fact that equilibrium properties should be preserved under the nonlinear coordinate transformation. Indeed, the above functional equation (6.14) admits a unique analytic solution in closed form: θ1 (x1 , x2 ) = ln (1 + x1 + x2 ) θ2 (x1 , x2 ) = x2
(6.15)
Notice that θ1 (0, 0) = 0 and θ2 (0, 0) = 0, and therefore the initial conditions are satisfied. Moreover, since:
∂θ 1 1 J= (0, 0) = (6.16) ∂x 0 1 10
and det[J] = 0, the above solution θ(x) is indeed locally invertible around the equilibrium point (x1 , x2 ) = (0, 0) (diffeomorphism). The proposed discrete-time observer in z-coordinate is given by: z1 (k) + 0.3ˆ z2 (k) + 1.3( zˆ1 (k + 1) = 0.5ˆ
y(k) ) − 0.3y(k) 1 + y(k)
zˆ2 (k + 1) = 0.5ˆ z1 (k) + 0.4ˆ z2 (k) z1 (k)) − zˆ2 (k) − 1 x ˆ1 (k) = exp (ˆ x ˆ2 (k) = zˆ2 (k).
(6.17)
ˆ1 (k) and e2 (k) := x2 (k) − x ˆ2 (k). The error dynamics is shown in Fig.1, where Let e1 (k) := x1 (k) − x ˆ1 (1) = x ˆ2 (1) = 0. x1 (1) = 0.5, x2 (1) = 0.8 and x Error Dynamics of Example 1 1
0.9
0.8
0.7
0.6
0.5
0.4 e
1
0.3
0.2 e2
0.1
0
0
5
10
15
20
25 k
30
35
40
45
50
Figure 1: Error Dynamics for k = 50. Example 2: Consider the following nonlinear discrete-time dynamic system: y(k) ) − 0.9w(k) 1 + y(k) y(k + 1) = y(k) 1 − 0.5( ) + 0.9w(k) 1 + y(k) w(k + 1) = w(k) 0.5(
(6.18)
with y being the measured state variable and w an unknown constant parameter or disturbance term that needs to be estimated. The Jacobian matrix F of (6.18) evaluated at the origin is:
0.5 −0.9 F = . (6.19) 0 1 The eigenvalues of F are: λ1 = 0.5 and λ2 = 1. Notice that the nonlinear discrete-time observer design method developed by Kazantzis and Kravaris [7] can not be applied to this case, since one of the eigenvalues of F lies on the unit circle. Notice also that the system’s linear part is observable. Consider now the following matrix A:
0 0 A= (6.20) 0 0.1 11
with eigenvalues: µ1 = 0 and µ2 = 0.1. Furthermore, the following nonlinear output injection map was chosen: y(k) 0.5( 1 + y(k) ) . β(y) = (6.21) y(k) 1 + y(k) ∂β (0)) is controllable. ∂y Under the above remarks the functional equation (1.11) is uniquely solvable in the class of real analytic functions (Theorem (2.2)) and assumes the following form: Notice that the pair (A, B =
y ) − 0.9w
y 1+y θ1 ,w = 0.5( ) y 1 + y 1 − 0.5( ) + 0.9w 1+y y ) − 0.9w 0.5(
y 1+y , w = 0.1θ2 + θ2 y 1+y 1 − 0.5( ) + 0.9w 1+y θ1 (0, 0) = 0 0.5(
θ2 (0, 0) = 0
(6.22)
The above functional equation (6.22) admits a unique analytic solution: y + 0.9w 1+y 5 y ( + w) 2 1+y
θ1 (y, w) = θ2 (y, w) =
(6.23)
Notice that θ1 (0, 0) = 0 and θ2 (0, 0) = 0, and therefore the initial conditions are satisfied. Moreover, since:
∂θ 1 0.9 J= (0, 0) = (6.24) ∂x 2.5 2.5 and det[J] = 0, the above solution θ(x) is indeed locally invertible around the origin. The proposed discrete-time observer is then given by the following dynamic equations: y(k) ) 1 + y(k) y(k) z2 (k) + zˆ2 (k + 1) = 0.1ˆ 1 + y(k) 10ˆ z1 (k) − 3.6ˆ z2 (k) yˆ(k) = 1 − 10ˆ z1 (k) + 3.6ˆ z2 (k) z1 (k). w(k) ˆ = 4ˆ z2 (k) − 10ˆ zˆ1 (k + 1) = 0.5(
(6.25)
Let ey (k) := y(k) − yˆ(k) and ew := w(k) − w(k). ˆ Then the simulation of error dynamics is shown in Fig.2, where y(1) = 10, w(1) = −2π and zˆ1 (1) = zˆ2 (1) = 1. 12
Error Dynamics of Example 2 10
8
6 ew 4
2 e
y
0
−2
−4
−6
−8
1
2
3
4
5
6
7
8
9
10
k
Figure 2: Error Dynamics for k = 10.
7
Final Remarks
We have given a necessary and sufficient condition for a smooth nonlinear discrete-time system to admit an observer with linear error dynamics in smoothly transformed coordinates. The condition is that the functional equation (1.11) admits a smooth solution for some A and a smooth output injection term β(y). We have also shown that, for real analytic systems with observable linear part, the functional equation (1.11) is locally solvable for any real analytic β(y) that makes the eigenvalues of A of multiplicative type (C, ν) with respect to the spectrum of the linear part F of the dynamics. This allows one to construct a local observer with linear error dynamics in the transformed coordinates. If the system is only smooth, then the functional equation (1.11) is made formally solvable up to the desirable degree of smoothness by choosing the eigenvalues of A to be nonresonant with the spectrum of the linear part of the dynamics. This allows one to construct a local observer with approximately linear error dynamics in the transformed coordinates.
References [1] V. I. Arnol’d. Geometrical Methods in the Theory of Ordinary Differential Equations. SpringerVerlag, Berlin, 1988. [2] D. Bestle and M. Zeitz. Canonical form observer design for nonlinear time-variable systems. Int. J. Control, Vol. 38, No. 2, 1983, pp. 419-431. [3] S.T. Chung and J.W. Grizzle. Sampled-data observer error linearization. Autom., Vol. 26, 1990, pp. 997. [4] P. Glendinning. Stability, Instability and Chaos: An Introduction to the Theory of Nonlinear Differential Equations. Cambridge University Press, 1994. [5] H. J. C. Huijberts, T. Lilge and H. Nijmeijer. Nonlinear discrete-time synchronization via extended observers. Int. J. Bifurcations and Chaos, 7, 2001, pp. 1997-2006. 13
[6] N. Kazantzis and C. Kravaris. Nonlinear observer design using Lyapunov’s auxiliary theorem. Systems & Control Letters, 34, 1998, pp. 241-247. [7] N. Kazantzis and C. Kravaris. Discrete-time nonlinear observer design using functional equations. Systems & Control Letters, 42, 2001, pp. 81-94. [8] A. J. Krener and A. Isidori. Linearization by output injection and nonlinear observers. Systems & Control Letters, 3, 1983, pp. 47-52. [9] A. J. Krener. Approximate Linearization by state feedback and coordinate change. Systems & Control Letters, 5, 1984, pp. 181-185. [10] A. J. Krener and W. Respondek. Nonlinear observers with linearizable error dynamics. SIAM J. Control and Optimization, Vol. 23, No. 2, 1985, pp. 197-216. [11] A. J. Krener. Nonlinear stabilizability and detectability. In Systems and Networks: Mathematical Theory and Applications, U. Helmke, R. Mennicken and J. Saurer (eds.), Akademie Verlag, Berlin, pp. 231-250, 1994. [12] A. J. Krener and M. Xiao. Nonlinear observer design in the Siegel domain. SIAM J. Control and Optimization, to appear, 2002. [13] A. J. Krener and M. Xiao. Nonlinear observer design in the Siegel domain through coordinate transformation. Proc. of the 5th International Federation of Automatic Control, SaintPetersburg, Russia, 2001, pp. 557-562. [14] A. J. Krener and M. Xiao. Necessary and Sufficient Condition for Nonlinear Observer with linearizable Error Dynamics. Proc. of the 40th IEEE Conference on Decision and Control, Orlando, Florida, 2001. [15] A. J. Krener and M. Xiao. Observers for Linearly Unobservable Nonlinear Systems. Systems & Control Letters, to appear, 2002. [16] H. G. Lee and K. Nam. Observer design for autonomous discrete-time nonlinear systems. Systems & Control Letters, 17, 1991, pp. 49-58. [17] W. Lin and C. I. Byrnes. Remarks on linearization of discrete-time autonomous systems and nonlinear observer design. Systems & Control Letters, 25, 1998, pp. 31-41. [18] M. Xiao, N. Kazantzis, C. Kravaris, and A. J. Krener. Nonlinear Discrete-Time Observer Design with Linearizable Error Dynamics. Preprint, September, 2001.
14