Positional Control of a Flexible Structure using

0 downloads 0 Views 158KB Size Report
888-999, July 20-22, 1999 - ITA, São José dos Campos - SP - Brazil. 001. Positional Control of a Flexible Structure using Neural Networks. Wilson Rios Neto1 ...
Proceedings Proceedings of of theIV IVBrazilian BrazilianConference Conferenceon onNeural NeuralNetworks Networks--IV IVCongresso Congresso Brasileiro Brasileiro de de Redes Redes Neurais Neurais pp.888-999, 378-383,July July20-22, 20-22, 1999 1999 -- ITA, ITA, São São José José dos dos Campos Campos -- SP SP -- Brazil pp.

Positional Control of a Flexible Structure using Neural Networks Wilson Rios Neto1, Cairo Lúcio Nascimento Júnior2 e Luiz Carlos Sandoval Góes1 1 Divisão de Engenharia Mecânica-Aeronáutica - IEM 2 Divisão de Engenharia Eletrônica - IEE Instituto Tecnológico de Aeronáutica - ITA 12228-900 - São José dos Campos - SP E-mails: [email protected], [email protected], [email protected] Abstract

2. Analytical Models

This paper presents an adaptive inverse control approach for the positional control of an unconstrained multibody system with flexible appendages. The approach is called Feedback-ErrorLearning and it is based on the output of a feedback controller with fixed parameters to adapt a neural network which acts as a feedforward controller. The results are demonstrated by simulations using a high fidelity dynamic model of a experimental setup available at the ITA-IEMP Dynamics Laboratory.

2.1. Modelling the Unconstrained Multibody System A schematic view of the experimental setup is shown below in figure 1. The unconstrained system under consideration is composed of two flexible appendages attached to a rigid hub and is driven by a brushless DC motor. The sensors used are a tachometer and a potentiometer which measure the hub angular velocity and position, respectively. The unconstrained characteristic results from the natural motion without external influences, i.e., all the structure is allowed to vibrate and its solution involves both the inertia and the stiffness of the flexible parts.

1. Introduction The investigation of methodologies using artificial neural networks for control of lightweight materials with distributed flexibility in advanced space applications and in the construction of robotic manipulators has been the subject of intensive research in the recent years. Based on their inherent learning ability and in the massively parallel architecture neural networks are considered promising controller candidates, particularly for nonlinear and uncertain systems. In most space applications, the dynamic behavior deviates considerably from the analytical model and the plant state cannot be physically measured or resolved without model dependent state estimation. These characteristics have motivated the present research towards the development of neural network control methodologies that make use of real-time controller tuning and of output measurements. This effort resulted in the adaptive inverse control approach called Feedback-Error-Learning ([1], [2]) that uses the output of a previously adjusted feedback controller with fixed parameters to adapt a neural network which acts as a feedforward controller. This paper describes the analytical model of a unconstrained multibody system ([3],[4]) and the adaptive control approach. The simulations of the positional control with potentiometer feedback of the experimental setup at the ITA-IEMP Dynamics Laboratory are also presented.

fle xible appendages

acc elerom e ter

D /A - PC A /D

potentiom eter

hub

air

strain-gag e

flexible coupling air supply

DC m otor + tachom eter

Figure 1 - Experimental setup Applying Hamilton's principle and introducing the transformation of variables z (x , t ) = y (x , t ) + x θ(t ) , we determine the differential equations for this hybrid system:([3], [4]) ρ z + E I z iv = 0

001 378

(1)



L

I hub θ + ρ x z dx + m t L z L = τ 













0





ρ φ r φ s dx + I hub φ′r (0)φ′s (0) + macel φr (L )φ s (L ) = δ rs





(10) The discrete model of the system is obtained by Ritz's Assumed Modes Method. In this method the elastic displacement can be described as: y (x , t ) =

(3) (4)

ρ ω2 ρ ω2 φ and λ4 = where φ = . EI EI The above problem admits unconstrained shape function:



 Iθ  M =  M η1θ  M  η2 θ

possible

q = [θ η1 The constants A , B , C and D are chosen for normalization purposes. Evaluating the boundary conditions in equation (7), we obtain a set of homogeneous equations. For a nontrivial solution, the determinant of the coefficients must vanish, given the following characteristic equation: [3]

I hub

Iθ = Jm +

0

  0   K η 2η2 

(13)

0

[

η 2 ]T , F = Fθ

Fη1

(14)

Fη 2

]T

(15)

2

L

∑ ∫ m ( x ) (x + r ) dx 2

i

+ m t (L + r )2 (16)

M η1θ = m (x ) (x + r ) φ1 (x 1 ) dx 1 + m t (L + r ) φ1 (l) (17)

∫ 0

L

M η2 θ = m ( x ) (x + r ) φ(x 2 )dx 2



acl

(18)

0

3

L

(8)

M η1η1 = m (x ) (x + r ) φ2 (x1 ) dx1 + m t (L + r ) φ2 (L ) (19)



5

acl

3

M η1η1

i =1 0

5

3

T M η 2θ   0   M η2η 2  

T

L

hub

hub

M η1θ

with:

6

3

(12)



0 0  K = 0 K η1η1  0 0 

φ (x ) = Asin(λx ) + B cos (λx ) + Csinh(λx ) + D cosh (λx ) (7)

[I λ (− 1 − cos(λL)cosh(λL)) λ + (γλLρ − λ Lm )sin(λL ) + (γI λ − λ ρ)cosh(λL ) sin(λL ) + (Lλ m + γLλρ )sinh(λL ) + (λ ρ − γI λ )cos(λL ) sinh(λL )

(11)

j

where:

iv

2

j

M q+K q = F

(6)

the

∑ φ (x )η (t ), 0 ≤ x ≤ L

where φ(x ) are unconstrained shape functions and η(t ) are time-varying coefficients. Applying Lagrange method in equations (1) and (2) and using the orthogonality relationships, the following matrix equation is obtained for the modes:

If we use: θ(t ) = θ cos(ωt ) and z (x , t ) = φ(x )cos(ωt ) , where θ is the modal amplitude of the rotational movement and φ(x ) the unconstrained shapes function; in equation (1) then: E I φiv = ρ φ ω2

N

j=1

(5)



L

0

with the following momentum balance of the hub [3]: EIz ′′(0) − I hub z ′(0) + τ m = 0

(9)

0

where L is the length of appendage, x is the position coordinate in the beam, y is the appendage deformation, θ is the constrained hub inertial rotation and the line (’) and dot (⋅) denote partial derivative with respect to space and time, respectively. The boundary conditions for a clamped-free system at x = 0 and x = L are: z (0) = 0 , z ′(0 ) = θ E I z ′′(L ) = 0 , E I z ′′′(L ) − macel z L = 0

L

E I φ′r′ φ′s′ dx = ω 2r δ rs

(2)

0

3

L

hub

M η2 η 2 = m (x ) φ2 (x 2 ) dx 2



− 2γρ sin(λL ) sinh(λL )] = 0

0

with the following orthogonality relationships:

002 379

(20)

FF Controller

refk

Neural Network xk+1 = f(xk , uk)

ukNN Z-1 refk

Z-1

refk-1

refk-L +

+

-M

Z

refk-M

ek

Feedback Controller



xk+1

+ ukFB

uk

yk

ykd = refk-M

xk Z-1

Plant

xk

Sensor

Figure 2 - Adaptive inverse control approach Feedback-Error-Learning

2.2. The Neural Network Combining this equations, we have: The figure 2 shows the adaptive control approach used in this work. In relation with the approach proposed by Kawato et al. ([1], [2]) we introduced two modifications: a) the second order derivatives of the reference signal were replaced by a tapped delay line of length L, and b) a delay of M sampling periods were added to the reference signal. Assuming that the plant is a linear dynamic system, stable, SISO (single input single output), time-invariant with transfer function in the discrete domain G(z) given by:

[

Y (z ) G (z ) z − M G FB (z ) + G NN (z ) = Ref (z ) 1 + G (z )G FB (z )

[

U FB G FB (z ) z − M − G (z )G NN (z ) = Ref (z ) 1 + G (z )G FB (z )







]

(27)

Y (z ) = z− M Ref (z )

(28)

U FB (z ) E(z ) = =0 Ref (z ) Ref (z )

(29)



However, since it is assumed that the neural network G NN (z ) has the structure defined by the equation (22) with a finite number (L+1) of coefficients, the neural network could be seen as a truncated representation of the pulse response of the inverse model of the plant delayed by M sample periods. Assuming that the closed-loop pulse response of the plant when using just the feedback controller (without the neural network) is stable, then the polynomials γ (z ) e φ (z ) , defined below, converge:

where Na and Nb are non-negative integers, and Na ≥ Nb. Let's assume that the feedforward controller is a linear filter with finite impulse response [5], that is, a linear neural network. Therefore the transfer function of neural network will be given by: G NN (z ) =

(26)

If G NN (z ) = z − M G (z ) , then from eqs. (26) e (27):

Y(z ) b0 z Nb + b1z Nb −1 + + b Nb = U(z ) a 0 z Na + a1z Na −1 + a 2 z Na − 2 + + a Na (21) ∞ B (z ) −i −1 −2 = = βi z = β0 + β1z + β2 z + A(z ) i = 0

G (z ) =

]

U NN (z ) = α 0 + α1z −1 + Ref (z ) 

+ α L z − L (22)

And from figure 1: E(z ) = z G

FB

−M

Ref (z ) − Y (z )

FB (z) = U (z ) E(z )

U(z ) = U FB (z ) + U NN (z )

γ (z ) = (23) φ(z ) =

(24) (25)

G FB (z )

1 + G (z )G FB (z ) G (z )G FB (z )

1 + G (z )G

FB

(z )

=



∑γ z

−j

j

=



∑φ z

−j

j

j =0

Combining eqs. (30) e (31) with eq. (27):

003 380

(30)

j =0

(31)

U

FB

(z ) = [ z

−M

γ (z ) − φ(z )G

NN

(z )] Ref (z )

[(

(32)

 ∞    ∞  Fil′j′ = E   φ i ref k −i − (i′−1)   φ j ref k − j−( j′−1)       j=0 î  i =0  (40)



=

( ) ref

where:

α = [α0 *

ref k* = [ref k

α1

T

* k 

2

αL ] 

refk −1 

[ (

{[

][

= E φ(z )ref k* φ(z )ref k*

)]

] }∆ F T

l

(37)



∑∑ γ φ ρ [ i

j

i − j+ M − (i ′−1)]

2.3. The Feedback-Error-Learning Rule Considering αˆ i (k ) the estimated value of the neural network coefficient α i at time step k, a possible simple approach is to use an algorithm like gradient descent. In this approach the neural network coefficients are changed in the direction that decreases the cost function J , that is:

The matrix F l is the correlation matrix of vector φ(z ) ref k* with dimension L+1 by L+1. From eqs. (32) and (36): F lα * = F r



   ∞   φ j ref k − j−(i′ −1)      j=0   (41)

where 1 ≤ i′ ≤ L + 1 e 1 ≤ j′ ≤ L + 1 . Notice that: 1) F l is a symmetric matrix with the Toeplitz (banded) structure common to covariance matrices, and 2) eqs. (40) and (41) assume that the polynomials γ (z ) and φ (z ) converge. The linear system equation (38) will have a unique solution if the reference signal is exciting enough such that F l is positive definite. The calculated neural network parameters α i (0 ≤ i ≤ L) will be the set that minimizes the mean value of the square of the feedback controller output. The specific values of L and M will determine how small the minimum of the cost function J is. However, the matrices F r e F l can only be evaluated if the transfer function of the plant G(z) is precisely known. If the transfer function G(z) is not precisely known or is slowly time-variant, we can use a learning algorithm to searches for the neural network parameters that minimize the cost function J .

The cost function J will have minimum if the matrix ∂ 2 J ∂α *2 is positive definite, where:

∂α *2

i − j+ i′− j′]

j

(35)

 FB ∂u FB  ∂J * T k = = E u FB E u k k − φ(z )refk = 0 (36) * *  ∂α ∂ α  

∂2J



i =0 j=0

0]T = 0T , where:



i

=

Notice that α e are column vectors with L + 1 components. It is important to show, that under certain conditions: 1) the scalar cost function J has a unique minimum; 2) the parameters of the neural network that minimize the cost function J could be used as an approximation of the delayed inverse model of the plant G (z ) . The stationary points of J are given by ∂J ∂α* = [0 0

∑∑ φ φ ρ [ ∑

ref k*

*



 ∞  Fi′r = E   γ i ref k −i − M î  i =0 (34)

ref k − L ]T





i =0 j=0

(33)

  

T

(39)

the element (i ′) of F r could be written as:

( )

1  FB 2  E uk  2  1  − M = E  z γ (z )ref k − φ(z ) α * 2 

)]

where F r is a column vector of L+1 rows. Considering the reference signal as stationary, i.e., l ′ ′ E[ref k ref k ±i ] = ρ i = ρ −i , the elements (i , j ) of F and

The estimation of the coefficients of the neural network G NN (z ) in eq. (22) can be defined as an optimization problem. In this case it is desired to find the coefficients of neural network that minimize the mean value of the square of the feedback controller output, defined as the following scalar cost function J: J=

)(

F r = E z −M γ (z ) ref k φ(z ) ref k*

(38)

where:

 ∂J  ˆ i (k + 1) = α ˆ i (k ) − η  α   ∂α i  α* = αˆ * (k )

381 004

(42)

where η is the learning rate; i = 0, 1, ..., L; and:

αˆ * (k ) = [αˆ 0 (k ) αˆ 1 (k )

αˆ L (k )]T 



∑φ

j

ref k -i - j

0 0 0   K = 0 5543.12066 0  0 0 34672.54922 

(49)

(44) Assuming that the reference signal ref k is such that ρ i = ρ− i = 0 for i ≥ 2 , we can rewrite the equations (40) e (41) as:

j= 0

However the polynomial φ(z ) , the closed loop transfer function without the neural network, cannot be assumed to be known a priori, since it would imply the knowledge of G(z). Therefore we propose to replace the polynomial φ(z ) by a guessed known polynomial λ (z ) , where λ (z ) should be a coarse approximation of φ(z ) [6]. So in this work, instead of using eq. (41) as our learning algorithm, we proposed to use the following rule, called Feedback-Error-Learning Rule:

Fil′j′ = ñ0



∑ i =0

φ i φ i + (i′− j′ ) + ñ1



∑ φ [φ i

i=0

i −1+ (i ′− j′ )

Fir′ = ñ0



∑ã φ

i i − i′ + M +1 + ñ1

i=0



∑ ã [φ i

i −i′+ M

+ φi −i′+ M + 2 ]

i =0

where, by definition, φi = 0 for i < 0 . This reference signal refk was generated by:

(45)

ref k = S 0 s k + S1 s k −1

where:

λ* = [λ0 λ1





+ λNL z − NL

λNL ]T

]

(51)

j= 0

λ (z ) = λ0 + λ1z −1 +

+ φ i +1+ (i′− j′ )

(50)

NL

αˆi (k + 1) = αˆi (k ) + η u FB k ∑ λ j ref k − i − j

(48)

(43)

From eq. (16): ˆ i (k + 1) = α ˆ i (k ) + η u FB α k

0.00767 0.0417 0.06255   M = 0 1 0     0 0 1 

(46)

(52)

where sk is a white noise sequence uniformly distributed between –1 e 1. Setting S 0 = 1 and S1 = 0.7 , then, by definition:

(47)

Our experiments show that in many cases we can simply use λ* = λ0 = 1 , for example, in plants where the feedback controller was adjusted in such a manner that the polynomial φ(z ) converge quickly without oscillations. Note that, as the quality of the feedback controller increases, the approximation error introduced by using λ (z ) instead of φ(z ) decreases, since φ(z ) will converge more rapidly to zero.

1   ρ = , para i = 0 E[sk sk ±i ] =  s 3 î 0, para i ≠ 0

(

(53)

)

1.49  2 2  S0 + S1 ρs = 3 , se i = 0  0.7  E[ref k ref k ±i ] = ρi = S0 S1 ρs = , se i = 1 (54) 3  0, se i > 1  î

3 - Numerical Simulations

Using a PID controller as the feedback controller, then:

In this section we show the simulation of the positional control of the unconstrained multibody system with potentiometer feedback. The input and output of the plant are respectively the applied torque Fθ and the angular position θ measured by the potentiometer. Therefore the plant transfer function is given as G (z ) = Fθ (z ) / θ(z ) . The mass and the stiffness matrices of the unconstrained system, eqs. (13) and (14), were calculated using MATHEMATICA, a symbolic manipulator program:

G

FB

(z ) =

(K p + K i + K d ) z 2 − (K p + 2 K d ) z + K d z2 − z (55)

and the following gains were selected: K p = 1.0 , K i = 0.0 , K d = 9 . The parameters L and M were chosen respectively as 35 and 1.

382 005

The theoretical solution for the neural network parameters α * can be calculated using eqs. (38), (50) and (51). The plant transfer function G(z) can be calculated using eq. (12). The approach Feedback-Error-Learning was simulated during 2600 seconds, the first 2500 seconds were the learning phase of the neural network and the other 100 seconds were the test phase. The neural network coefficients were initialized as zero such that in the beginning of the learning phase only the feedback controller was used. During the learning phase the Feedback-Error-Learning Rule (eq. (45)) was applied using λ* = λ0 = [0 1] , η = 0.105 , and the reference signal refk was generated as describe in eq. (52). Figure 3 shows the error signal ek (input of the feedback controller) during the learning and test phases. Figure 4 shows the theoretical neural network parameters α * and their estimated values áˆ* (experimental solution) at the end of the learning phase.

4. Conclusions This paper shows how to perform positional control of a unconstrained multibody system using neural networks trained by the Feedback-Error-Learning Rule. The simulation results show that small tracking errors and good convergence for the neural network parameters can be achieved. [6] Rios Neto et al. [7] and Nascimento Jr. [8] show respectively how this neural control approach can be applied to non-minimum-phase linear and non-linear plants as well. Future research will investigate the performance of this neural control approach when used to suppress vibration in real-time.

References [1] Kawato, M., Furukawa, K. & Suzuki, R. (1987). “A Hierarchical Neural Network Model for Control and Learning of Voluntary Movement”, Biological Cybernetics, vol. 57, pp. 169-185. [2] Kawato, M. (1990). Computational Schemes and Neural Network Models for Formation and Control of Multijoint Arm Trajectory, in W. T. Miller, R. S. Sutton & P. J. Werbos (Eds.), Neural Networks for Control, pp. 197228, Cambridge, USA, Bradford Books/MIT Press. [3] Góes, L. C. S., Negrão, R. G., Rios Neto, W. (1998). Modeling and Control of Multibody System with Flexible Appendages., in: Balthazar, J. M., Gonçalves, P. B. and Clayssen, J. (Editors), Nonlinear Dynamics, Chaos, Control and Their Applications to Engineering Sciences. Vol 2: Vibrations with Measurements and Control, published by Brazilian Society of Mechanical Sciences ABCM, Brazilian Society of Computational and Applied Mathematics - SBMAC and Society for Industrial and Applied Mathematics - SIAM, pp. 75-91. [4] Barbieri, E., Özgüner, Ü. (1988). Unconstrained and Constrained Mode Expansions for a Flexible Slewing Link. Journal of Dynamic Systems, Measurement, and Control. vol. 110, pp. 417. [5] Widrow, B. & Stearns, S. (1985). Adaptive Signal Processing, Prentice-Hall, USA. [6] Rios-Neto, W., (1998). Controle de um Sistema com Apêndices Flexíveis usando Redes Neurais (Control of a System with Flexible Appendages using Neural Networks), Master Thesis, Instituto Tecnológico de Aeronáutica, São José dos Campos, Brazil. [7] Rios Neto, W., Nascimento Jr., C. L., Góes, L. C. S. (1998). Controle Adaptativo Inverso usando FeedbackError-Learning (Inverse Adaptive Control using Feedback-Error-Learning), Proceedings of the 12th Brazilian Automatic Control Conference, vol. I, pp. 351356, 14-18 Sept., Uberlândia, Brazil. [8] Nascimento Jr., Cairo L., (1994). Artificial Neural Networks in Control and Optimization, PhD Thesis, Control Systems Centre, UMIST, UK.

Figure 3 - The error signal ek (input of the feedback controller) during the learning and test phases 200

100

0

-100

-200

-300

-400 0

5

10

15

20

25

30

35

40

Figure 4 - The neural network parameters α * (o) and their estimated values áˆ* (+)

383 006

Suggest Documents