Neuroengineering Model of Human Limb Control - IEEE Xplore

Proceedings of the 46th IEEE Conference on Decision and Control New Orleans, LA, USA, Dec. 12-14, 2007

FrC01.5

Neuroengineering model of human limb control - Gainscheduled feedback control approach Kazutaka Takahashi and Steve G. Massaquoi Abstract— Recent neuroengineering modeling suggests that human control of limb movements may take advantage of simple linear control modules that are gainscheduled according to limb state, intended limb state, or both. The scheduling variable(s) appear to consist of limb position and possibly velocity. Thus it is possible that limb control can be modeled effectively using Linear Parameter Varying (LPV) techniques for designing gainscheduled linear control systems. We demonstrate the efficacy of an LPV controller based on a model of cerebrocerebellar neuroengineering models in simulating human arm control of moderate speed, direction-changing movements in the horizontal plane. The technique shows how position dependent dynamics can be stably and realistically controlled using a simple convex interpolation of linear controller modules. The finding could provide a systematic method of representing human motor control using an established engineering technique. It may also support the controversial view that the motor control system does not require internal models of body dynamics to achieve satisfactory performance.

I. I NTRODUCTION The cerebrocerebellar system is central to motor control [1], [2], [3] and has been characterized in terms of its anatomical connections among the areas in the system [1], [3] and in terms of the physiology of each area or network of cerebral cortical areas [4], [5], [6], [7]. The cerebellum is intimately connected to almost all major motor and sensory areas. Furthermore, cerebellar pathology usually results in uncoordinated movements [8] or errors of directions, force [9], amplitudes and delayed movement initiations [10]. Therefore, an importance of the cerebellum in motor control should be emphasized in the analysis and modeling of cerebrocerebellar system. A number of studies have examined simple spike (SS) firing in cerebellar Purkinje cells (PCs) [11], [12], [13]. Those analyzing the firing patterns in anterior intermediate and lateral cerebellum (motor cerebellum) have generally found correlations between PC SS frequency and position and/or velocity. However, a functional model that accounts for the temporal details of these signals during arm movement control is still lacking. In regard to modeling, it is important to note that PCs have been found to fire in relation to both passive and active motion of the body part, though perhaps not as vigorously in the former condition as in the latter [14], [15]. This suggests that PCs may be involved in both monitoring and controlling K. Takahashi is with Department of Organismal Biology and Anatomy, University of Chicago, 1027 E. 57th St. Rm. 202, Chicago, IL 60637, USA

[email protected] S. G. Massaquoi is with the Department of EECS, LIDS, CSAIL, MIT-Harvard Division of Health Sciences and Technology, Massachusetts Institute of Technology, 32 Vassar St. Rm. 214, Cambridge, MA 02139, USA [email protected]

1-4244-1498-9/07/$25.00 ©2007 IEEE.

body parts. Because the cerebellum is a site of considerable convergence of both peripheral sensory information via the spinocerebellar tracts [16] and brainstem counterparts, and copies of motor outflow via pontine nuclei [17] it is natural to consider that PC activity may be a function of both sensory information and motor outflow. This would be consistent with the observation that interpositus (IP) and dentate firing activity modulates during point-to-point movement control [18], [19], passive body movement [15] and postural maintenance [20]. Thhis observation is not of trivial consequence because important motor control models emphasizing feedforward inverse dynamics control based on desired, rather than sensed, movement trajectories (e.g., [21], [12], [22]) do not predict that sensory signals in the principal cerebellar movement control signals. Other formulations [23], [24], [25] suggest sensory signals could be prominently represented in at least some PCs. Importantly [12] showed that PC SS firing activity can be fit by a linear combination of signals needed for dynamic control of the eyes, the data do not specifically exclude the possibly strong dependence of the PC signal on sensory information. Thus, while the precise mechanism of cerebellar motor control has not been established, it is very likely to have a strong dependence on feedback signals. However, if, as has been conjectured more recently [23], [26], [27], [28] the control does not involve feedforward models of inverse dynamics, it remains to be demonstrated how a heavily feedback-dependent control system alone could afford sufficient control despite signal transmission delays. A. RIPID and RICSS models Recently, the Recurrent Integrator PID (RIPID) cerebrocerebellar control model posits that certain recurrent signals from cerebellum stabilize long-loop proprioceptive responses by processing error-like signals so that they may participate strongly in both postural maintenance [29], [28] and pointto-point movement control. Combining the studies of intracortical connectivity [30] with recent data on cerebrocerebellar anatomy [3], and established concept of distributed signal representation as population vectors [4], a slightly more detailed but still simple picture of the cerebrocerebellar architecture is proposed as in Fig. 2. The Recurrent Integrator Cerebellar Simple Spike (RICSS) model (Figs 2, 3) is demonstrated to account well for SS activity in PCs of behaving primates [26] while remaining functionally consistent with the RIPID model. Fig 3. depicts the cerebellar component. An array of PCs as would lie within a microzone [31] projects to a single group of deep cerebellar nuclei to form a functional corticonuclear microcomplex [32]. PCs are

5826

46th IEEE CDC, New Orleans, USA, Dec. 12-14, 2007 Area 4

Area 5

θcom (t)

+

+

θtarget (t)

u(t)

+

Ia /s −

FrC01.5

MC

4

−

tef f

P (u)

θ(t)

+

F3

ecb

+

2

edir· cb

Gb s Gk

1 −

−

I2/s

I1 /s

F2

Area 1,2,3

I3/s

Cerebellum

ucb ucb +

ucb

+

3 +

ρ

taf f

Fig. 1. One version of RIPID model. Colored circles designate functional anatomic areas related to the RICSS Model (Fig 2). Inner feedback loop from area 1 (green) containing block I2/s (”recurrent” integrator) provides phase lead that stabilizes the long outer feedback loop.

SMC-1

MC

ecb(t) r

φdir r

θ com(t)

Area 4

φe

2 units based on behavioral intent (”motor set” by [2]). Thus, it is conceived that different subpopulations of PCs are engaged according to the activity of the group of selPFs. If PCs implement the control gains Gb, Gk, I1 and I2 in Fig 1, and selPF activity reflects the state of the plant and of behavioral intent, then the circuitry in Fig 3 can be viewed as a general mechanism for context-dependent scheduling of control gains according to the scheduling signals ρ(t).

Shoulder SMC-3

SMC-2 r′

SMC-4 Elbow

F2θ(t − taf f )

Area 3a

edir− cb

Fig. 3. Cerebellar cortical architecture proposed to underlie PC SS activity.

edir+ cb

ucb

B. LPV

Cerebellum

Fig. 2. Cerebral cortical component of RICSS model from the perspective of a single cerebellar PC. Recurrent integrator loop is not shown. 8 cerebral cortical columns in Sensorimotor Cortical Area 3a (SMC-1, area 1 in Fig.1) implementing a neural population-based representation, e.g. [4], of the tracking error-like vector (red arrow) and subsequent distribution after cerebellar processing.

assumed to receive two types of parallel fiber (PF) input. Descending signals edir· cb (t) from area 3a or 4 travel by mossy fibers designated here as signal mossy fibers (sigMF) to reach signal PFs (sigPF) that are considered here to be those whose ascending axons synapse multiply on proximal PC dendrites to afford a strong excitatory connection [33], [34]. A given set of sigPF inputs is presumed to synapse on many PCs within the microzone and on the associated deep nuclei. On the other hand, the distal dendrites of each PC are influenced more subtly by passing PFs [33], [34]. These are termed here selector PFs (selPFs). The principal hypothesized action of selPFs is to inhibit laterally adjacent PCs via basket cells (solid red in Fig. 3). Thus, each PC is potentially suppressible by ’beams’ of active PFs to either side. Conversely, PC activity along an active beam of selPFs is comparatively preserved. This mechanism is consistent with the experimental observations of active centers and inhibitory surrounds [35]. The quantitative formulation is very similar to that in [29] but updated to be more consistent with recent work [33], [34]. SelPFs are assumed to be supplied by spinocerebellar tracts among other pathways and thus to carry body state information [36] and other context variables. Based on the structure above, PC SS firing depends on both descending sigPF inputs and modulation by coincident selPF activity. It is assumed that in general the descending cortical inputs may come from different SMC-

As it appears that scheduling control gains may be a mechanism by which the nominally linear RIPID/RICSS circuitry could affect general control throughout extended workspaces, an analytical input-output description is desirable. Linear parameter varying (LPV) formulations have a similar structure to that suggested by the RICSS model and possesses theoretical tractability. Therefore, LPV models were tested on data from human subjects performing double step tasks that required a rapid change in movement direction. The LPV synthesis procedure discussed here applies to affine parameter-dependent plants described below:   x˙ P (·, ρ) = y   z

= A(ρ)x + B 1 (ρ)w + B 2 u = C 1 (ρ)x + D11 (ρ)w + D12 u = C 2 x + D 21 w + D 22 u

(1)

where

ρ(t) = [ρ1 (t), . . . , ρn (t)],

ρi ≤ ρi (t) ≤ ρi

(2)

is a time-varying vector of scheduling variables each component of which is bounded by its minimum and maximum values ρi and ρi respectively, n is the number of the scheduling variables, A(·), B1 (·), C1 (·), and D11 (·), are affine functions of ρ(t). Note that ρ(t) may contain part of the state vector itself , assuming that the corresponding states are accessible to measurement. If the scheduling parameter vector ρ(t) takes values in a box of Rn with corners {Πi }, i = 1, . . . , N = 2n , the plant P (·, ρ) ranges in a matrix polytope with vertices V (Πi ). Namely, given any convex decomposition

5827

ρ(t) =

N X i=1

αi Πi ,

αi ≥ 0,

N X i=1

αi = 1

(3)

46th IEEE CDC, New Orleans, USA, Dec. 12-14, 2007

FrC01.5

of ρ over the corners of the parameter box, then the parameter dependent system is given by P (·, ρ) =

N X

αi V (Πi ).

where ˜ 12 = N

(4)

i=1

ρ

r(t)

+

G(ρ)

P (ρ)

y(t)

−

ρ = R(y(t))

Fig. 4. A realization of RIPID/RICSS structure as a gain-scheduling control system. For the current study, R(y(t)) = ρ(θe ) as shown in Eqn’s 20, 21.

Thus, we would like to find a set of parameter dependent controllers taking the following form: G(·, ρ) =

(

ξ˙ = AG (ρ)ξ + B G (ρ)y , u = C G (ρ)ξ + D G (ρ)y

(5)

with the following vertex property: PN Given the convex decomposition ρ(t) = i=1 αi Πi of the current parameter value ρ(t), the values of AG (ρ), B G (ρ), C G (ρ), and D G (ρ) are expressed as a linear combination of AG (Πi ), B G (Πi ), C G (Πi ), and DG (Πi ), i = 1, · · · , N at the corners of the parameter box by

AG (ρ) B G (ρ) C G (ρ) D G (ρ)

=

N X

αi

i=1

AG (Πi ) B G (Πi ) C G (Πi ) DG (Πi )

•

•

0

and all the admissible trajectories ρ.

N 21 0 ˜ 21 = N , 0 I A(Πi ) B(Πi ) C 1 (Πi ) D11 (Πi )

∞

where S and T are a sensitivity function and a complementary sensitivity function defined as −1

S

= (I + P (ρ)G(ρ))

T

= P (ρ)G(ρ) (I + P (ρ)G(ρ))

−1

.

II. LPV FORMULATION FOR THE TWO - LINK ARM PLANT Formulation here considers the nonlinearity only due to inertia and viscosityi.e., no nonlinearity in muscle dynamics:

.

the vertex property (Eq.(6)) the closed-loop system is stable for all admissible parameter trajectories ρ(t) a guaranteed L2 -gain bound γ > 0 for the closed loop system from the generalized disturbance signal w to the error signal z, i.e., Z T Z T z T z dt ≤ γ 2 wT w dt, ∀T ≥ 0 (7) 0

N 12 0 , 0 I Ai B 1i = C 1i D 11i

for i = 1, . . . , N and N 12 and N 21 are bases of the null spaces of (B T2 , DT12 ) and (C 2 , D 21 ), respectively. Recall that N = 2n where n is the number of the scheduling parameters. Thus, there will be 2n + 1 linear matrix inequalities (LMI) such as ones in Eqn’s 8 ∼ 10. The assumption that B 2 , C 2 , D12 , and D 21 to be independent of ρ(t) will be satisfied by placing a lowpass filter, an excitation-contraction (EC) coupling filter to characterize a gradual force development [27], so that the overall plant is realized as shown in Eq (1). To enforce the performance and robustness requirements, we can use the loop shaping criterion summarizing RMS gain constraint

T

(11)

W 1S W 2T

< 1,

(6) The objective is to design a gain-scheduled controller G(·, ρ) that satisfies: •

τ (t) τ (t)

¨ + C(θ(t), θ(t)) ˙ ˙ = H(θ(t))θ(t) θ(t) = Ku(t)

(12) (13)

u(t)

= EC (r(t) − θ(t − taf f ))

(14)

˙ are the inertia and viscosity where H(θ) and C(θ, θ) matrices respectively, and K is a constant muscle stiffness matrix, and the torque τ is a linear function of the input signal u which is the output of the EC filter. Eq. (14), should be interpreted as a mapping of the difference between the reference signal r(t) and the delayed afferent joint angles θ(t − taf f ), where taf f is the afferent delay amount, and the output of the EC filter, u through the LTI system EC. The inertia matrix can be seen as an affine function of the elbow angle θe :

Theorem [37]: Suboptimal scaled H∞ problem is solvable H(θ) = H 0 + cos θe H 1 , (15) if and only if there exist pairs of symmetric matrices L ∈ where Rn×n and M ∈ Rn×n such that   h1 + h2 + m2 l12 h2 Ai L + LATi LC T1i B 1i H0 = , (16) T h2 h2 C 1i L −γI D 11i  N˜12 < 0, (8) N˜12  2 1 B T1 DT11i −γI . H 1 = m2 l 1 l 2  T  1 0 T Ai M + M Ai M B 1i C 1i T B T1i M −γI D T11i  N˜21 < 0, (9) N˜21  Following the similar manner, the viscosity matrix ˙ can be affinely decomposed as follows: C 1i D11i −γI C(θ, θ) L I ≥ 0, (10) ˙ = C 0 + sin θe θ˙s C 1 + sin θe θ˙e C 2 , (17) C(θ, θ) I M

5828

46th IEEE CDC, New Orleans, USA, Dec. 12-14, 2007 where C 0 = 0 and 0 C 1 = m2 l 1 l 2 1

−1 0

, C 2 = m2 l 1 l 2

−1 −1 0 0

.

(18)

Thus, the arm dynamics can be expressed as: ¨ = −C(θ, θ) ˙ θ˙ + Ku, H(θ)θ ˙ = Ap (ρ)Θ + B p (ρ)u, Θ = θ˙ ⇒ Θ θ = C p Θ.

θ

T

,

The input to the EC filter, uEC is related to the output, u as in the following transfer function κ2 1 0 U EC (s), U (s) = (s + κ)2 0 1

where U (s) and U EC (s) are the Laplace transforms of u and uEC respectively. Its relation in the state space form is: ζ˙ = AEC ζ + B EC uEC u

= C EC ζ.

Thus the augmented parameterized plant from the input of the EC filter to the joint angles is AP (ρ) B P (ρ)C EC 0 ˙ ζa = uEC ζa + 0 AEC B EC | | {z } {z } Ba

Aa (ρ)

θ

=

C p 0 ζ a , where ζ a = | {z } Ca

Θ ζ

T

,

where only Aa (ρ) is parameter dependent. In the analysis below, only the configuration dependency, i.e., the effect of the elbow angle, θe through the inertial matrix H(θ) is examined. This is because of the following reasons. First, including angular velocity terms requires exponential addition of LMI’s to be solved and implementation of corresponding controllers. As there are two parameters in the viscosity matrix in Eqn 17, i.e., sin θe θ˙s and sin θe θ˙e , total increase of the number of the scheduling parameters would be 2 × 2 where additional 2 comes from the inversion of the inertial matrix. Secondly, the viscosity matrix contains terms with joint velocities, sin θe θ˙s and sin θe θ˙e . Thus, the size of workspace alone is not sufficient to estimate the bounds of the scheduling parameters. Therefore, in the case under the consideration, i.e., θ˙e = θ˙s = 0, there is effectively only one parameter, θe characterizing the scheduling variables. Thus, Ap (ρ) is now a constant matrix, i.e., Ap (ρ) = [ 0I 00 ] . Yet, H(θ)−1 in B p (ρ) contains two parameters both of which are functions of θe : h2 −h2 −1 H(θ) = ρ1 (θe ) −h2 h1 + h2 + m2 l12 0 −m2 l1 l2 (19) +ρ2 (θe ) −m2 l1 l2 2m2 l1 l2 1 ρ1 (θe ) = , (20) 2 h1 h2 + h2 m2 l1 − (m2 l1 l2 cos θe )2 cos θe ρ2 (θe ) = . (21) h1 h2 + h2 m2 l12 − (m2 l1 l2 cos θe )2

FrC01.5 Thus, R(·) in Fig. 4 for this particular formulation is T R(y(t)) = [ρ1 (θe (t)); ρ2 (θe (t))] . Note that this formulation does not account for the rate variation of ρ1 and ρ2 directly in addition to excluding the joint velocities. Thus, the arm dynamics here is yet a subset of the arm dynamics only containing the inertial effect. In order to use a polytope based LPV synthesis, we need to set, or find, the lower and upper bounds for all the scheduling parameters, ρi to define the vertices. To find the bounds on ρi , the minimum and maximum values of ρi are taken over the possible range of the elbow angles computed from the size of the workspace through the inverse kinematics such that ρi = min ρi , ρi = max ρi , ρi ≤ ρi (t) ≤ ρi , ∀i , ∀t. Low frequency tracking performance can be enhanced with a choice of the weighting W 1 being a lowpass filter. The following parameter independent frequency weighting is used: 25 I. W 1 (s) = (s + 5)2 The weighting filter for the complementary sensitivity function, W 2 (s) is set to be unity. To compare the performance of a resultant LPV controller, one H∞ controller based on a linearized plant around the center of the workspace was designed with the same weightings. III. H UMAN DOUBLE STEP MOVEMENT EXPERIMENT Each subject sat on a chair with his/her body stabilized by a back support as well as a four-point seat belt that restricted body motion substantially to two degrees of freedom of the arm in a horizontal plane. Each subject held a handle of the InMotion2TM two-link manipulundum to control the location of a yellow cursor shown on a monitor.

5829

I

II

III

IV

V

VI

120 deg

Fig. 5.

Human double-step reaching protocol.

46th IEEE CDC, New Orleans, USA, Dec. 12-14, 2007 Hand speed

Hand path

0.7

0.6

0.6

0.55

Hand speed − m/s

0.5

Y−m

0.5

0.4

0.3

0.45

0.2

0.4 0.1

−0.04 −0.02

0

0.02 X−m

0.04

0.06

0

0.08

0

500

1000

1500

Time − msec

Fig. 6. An example of kinematic fit of the data with response of the LPV system. Data from a subject performing S-C-R task. Experimental data (blue), LPV fit (red), and Reference command for the LPV system (green dashed). Left: Hand path. Right: Hand speed. Hand speed

Hand path

0.8

0.6 0.7

0.6

0.55

Hand speed − m/s

0.5 0.5 Y−m

After 40 ∼ 120 practice trials there were four blocks of 140 test trials. Within each block the starting point, either North (N) or South (S), and the final target, either on the Right (R) or the Left (L) side, all relative to the center (C), remained fixed. Both starting targets are located 15 cm vertically away from the C respectively. The task was to move sequentially from a start location toward the center then toward a final target according to visual cues. An example of an action sequence in each trial is shown in Fig. 5 and is explained here. Each trial was initiated by a subject moving the small yellow cursor to a larger yellow circle at one of the two start locations, N or S (Fig. 5-I,-II). The subject was instructed to hold the cursor in the green circle for 1 ∼ 1.5 seconds until the circle disappeared (Fig. 5-III), and the first central target turned red to cue a brisk movement to C (Fig.5IV). The second target which was blue appeared randomly 100 to 400 ms later and one at one of sevan possible locations (dashed circles) (Fig.5-V) such that the number of trials to each possible target was equal to 20. Kinematic data, hand position and velocity, from each trial was recorded. Then, with obtained or estimated parameters for the arm plant for each subject, an LPV controller is designed. A series of two minimum jerk (MJ) profiles in series was fed as a reference command to the closed loop LPV system. Then, in order to fit the experimental hand position and velocity with the output from the closed loop LPV system, genetic algorithm command in Matlab, ga was used to find a set of coefficients characterizing two MJ profiles in series [38].

FrC01.5

0.4

0.3

0.45

IV. R ESULTS 0.2

Data were recorded from four right handed subjects in this study. Each data set consisted of 560 trials total, 140 trials for each of the combinations of the starting point (N,S) and final targets (L,R). Due to various kinematic variability, only five trials from each subject for the S-C-R task and another set of five trials for the S-C-L task, total of 10 trials were used to explore the quality of the fit of the LPV controller. Out of 40 trials, ga algorithm converged for 39 trial data. Fig.6 shows an example of a data fit well by the LPV model. A minor curvature of the hand path in the first segment is not extremely accurately captured and there is a slight offset at the end point of the movement, but overall the hand path is fit by the LPV model well. The extrema of the hand speed slightly deviate from the data, but the LPV response faithfully captures the hand speed of the data whose second segment command starts roughly at the middle of the first segment. Fig. 7 illustrates an example of a fit of the movement of the target sequence S-C-L. Both the hand path and speed are reproduced in S-C-L movements as well as in S-C-R movements. The contribution of LPV gainscheduling to dynamic control is illustrated in Fig. 8 which shows an performance comparison between H∞ and LPV responses. During the first segment of the movement, the responses from both systems do not differ much either in the hand path or speed,

0.4 0.1

−0.16 −0.14 −0.12

−0.1

−0.08 −0.06 −0.04 −0.02 X−m

0

0

500

1000

1500

Time − msec

Fig. 7. An example of kinematic fit of the data with response of the LPV system. Data from a subject performing S-C-L task. Line types follow Fig.6.

but in the second segment it appears that H∞ response yields slightly late initiation and slight directional offset to the left in the second segment. Furthermore, the end point error of H∞ controller is much larger than that of the LPV. Therefore, even though the task was not very demanding dynamically, the LPV controller usually resulted in a better fit to the experimental data. V. D ISCUSSION AND C ONCLUSIONS The simplicity and linearity of the RIPID formulation suggests that some method of modifying its function to address nonlinearities and temporal variation in plant dynamics would be required for it to account for human limb control. The RICSS model suggests a particular multiplicative relation between the error-like signal and state information at the PCs. This mechanism was abstracted to a well established LPV gainscheduling control scheme. To test the feasibility of such a formulation, the double step task was performed

5830

46th IEEE CDC, New Orleans, USA, Dec. 12-14, 2007 Hand speed

Hand path

0.7

0.6

0.6

0.55 0.5

Y−m

Hand speed − m/s

0.5

0.4

0.3

0.45

0.2

0.4 0.1

−0.06

−0.04

−0.02

0 0.02 X−m

0.04

0.06

0

0

500

1000

1500

Time − msec

Fig. 8. Difference between the single H∞ controller response and the LPV response. Data from a subject performing S-C-L task. Experimental data (blue), LPV fit (red), and H∞ fit (yellow). Left: Hand path. Right: Hand speed.

to human subjects. An LPV model that was gainscheduled according to limb state accounted fairly well for some of the human data. This provides a preliminary indication that simple scheduling of the RIPID gains according to limb state may be adequate to manage limb dynamics. Moreover, it is intriguing to note the structural similarity between an affine representation of the current schedule variable in an LPV formulation and a hypothesized selPF which is an affine function of afferent limb states and their derivatives. Thus, it is possible that limb control can be modeled effectively using LPV techniques. The finding potentially provides a systematic method of representing human control using an established and flexible engineering technique. It also supports the controversial view that the human motor control system does not require internal models of body dynamics to achieve satisfactory control of body motion. In order to critically test if an LPV or gainscheduling system is better than a single H∞ , or even a necessity of gainscheduling to explain behavioral data, more data with faster speeds would be required. In addition, the schedule variables in Eqn’s (20,21) are nonlinear functions of a limb state, θe (t) which is a natural choice as an argument for the scheduling variables here because it is the variable that survives linearization around equilibrium points. However, the RICSS model suggests that selPF’s are thresholded affine functions of limb states and their derivatives. Thus, it is critical to approximate the limb dynamics as an affine function of θe for an LPV model. In addition, although it is not necessary at speeds tested here, scheduling based on velocity may be necessary. The data from [39] implies that some form of velocity gainschedule may be present as PCs appear to be sensitive to different movement speeds. How to incorporate both position and velocity as scheduling variables in an optimal control synthesis will await further study. Furthermore, to avoid ambiguity of the effects between the responses of dynamical systems and those of optimization

FrC01.5 procedures to minimize the difference between the simulation responses and the experimental data, a better method of combining dynamical controller design and data fitting need to be developed without exhaustively searching finely gridded spaces of command and controller parameters. That the kinematic data obtained here can be often fit adequately by a feedback-based control structure does not in itself exclude the possibility of other feedforward inverse dynamics control that has been proposed for cerebellar function by many [40], [25]. However, taken together with the observation by the physiologists that most PC units responded to passive manipulation argues strongly for the presence of feedback signals in PC firing activity, as used by the RICSS model, and against purely feedforward cerebellar control models [21], [40]. The possibly multiplicative relationship between kinematics and cerebellar signals used in RICSS had not been emphasized before [41], although purely linear formulations such as in [42], [12], do not appear to consider linearity as a fundamental requirement. Moreover, other proposals [43], [44] are sufficiently general to be consistent with PC data used here. However, these models have not yet been explicitly reconciled with cerebrocerebellar circuitry and cerebellar signals recorded during arm movement. The debate on whether cerebellum acts like a feedforward inverse dynamics controller [25], [43] or feedback controller remains still active. Two recent physiological studies attempted to answer the question by applying force fields. One recent study [45] showed that the SS firing of majority of PCs was not significantly modulated by the force nor was their spatial tuning affected as long as kinematic performance remained the same. Thus, these results do not support the hypothesis that PCs represent the output of an inverse dynamics model of the arm. Instead these neurons provide a kinematic and possibly kinematic error representation of arm movements. The other study [46] showed that, by applying two different force fields, SS activities in many PCs differed distinctly depending on the type of force field. Furthermore, difference in averaged PC SS activities increased approximately 40 ms before the difference appeared in EMG activities. The authors conclude that PC SS in the intermediate part of cerebellar lobules V-VI encode movement dynamics. As we are not sure about the tracking error in these studies, it is still difficult to clearly reconcile these findings with the RIPID/RICSS formulation. More detailed information regarding the internal signals will be needed. In any case, the current study suggests that scheduled linear control based on kinematic tracking error may be adequate to achieve satisfactory control of body motion. Internal models of body dynamics may not be required. ACKNOWLEDGMENT The authors would like to thank Prof. Tim Ebner at Univ. of Minnesota and Dr. Alex Roitman at UCSF for the primate physiology and kinematic data, and Prof. Munther Dahleh at MIT for discussion on LPV and its application to limb control.

5831

46th IEEE CDC, New Orleans, USA, Dec. 12-14, 2007 R EFERENCES [1] G. Allen and N. Tsukahara, “Cerebrocerebellar communication systems,” Physiol Rev, vol. 54, no. 4, pp. 957–1006, Oct 1974. [2] V. B. Brooks, The neural basis of motor control. Oxford University Press, 1986. [3] R. M. Kelly and P. L. Strick, “Cerebellar loops with motor cortex and prefrontal cortex of a nonhuman primate,” J. Neurosci., vol. 23, pp. 8432–44, 2003. [4] A. P. Georgopoulos, J. F. Kalaska, R. Caminiti, and J. T. Massey, “On the relations between the direction of two-dimensional arm movements and cell discharge in primate motor cortex,” J Neurosci, vol. 2, no. 11, pp. 1527–1537, 1982. [5] J. Kalaska, D. Cohen, M. Prud’homme, and M. Hyde, “Parietal area 5 neuronal activity encodes movement kinematics, not movement dynamics.” Exp Brain Res., vol. 80, no. 2, pp. 351–64, 1990. [6] L. Sergio and J. Kalaska, “Changes in the temporal pattern of primary motor cortex activity in a directional isometric force versus limb movement task,” J Neurophysiol., vol. 80, pp. 1577–83, 1998. [7] B. Marconi, A. Genovesio, A. Battaglia-Mayer, S. Ferraina, S. Squatrito, M. Molinari, F. Lacquaniti, and R. Caminiti, “Eye-hand coordination during reaching. i. anatomical relationships between parietal and frontal cortex.” Cereb Cortex, vol. 11, no. 6, pp. 513–27, 2001. [8] A. J. Bastian, T. A. Martin, J. G. Keating, and W. T. Thach, “Cerebellar ataxia: abnormal control of interaction torques across multiple joints,” J Neurophysiol, vol. 76, no. 1, pp. 492–509, 1996. [9] M. Maschke, C. M. Gomez, T. J. Ebner, and J. Konczak, “Hereditary Cerebellar Ataxia Progressively Impairs Force Adaptation During Goal-Directed Arm Movements,” J Neurophysiol, vol. 91, pp. 230– 238, 2004. [10] G. Holmes, “The cerebellum of main,” Brain, vol. 62, pp. 1–30, 1939. [11] Q. Fu, D. Flament, J. Coltz, and T. Ebner, “Relationship of cerebellar purkinje cell simple spike discharge to movement kinematics in the monkey.” J Neurophysiol, vol. 78, no. 1, pp. 478–91, July 1997. [12] H. Gomi, M. Shidara, A. Takemura, Y. Inoue, K. Kawano, and M. Kawato, “Temporal firing patterns of purkinje cells in the cerebellar ventral paraflocculus during ocular following responses in monkeys I. Simple spikes,” J Neurophysiol, vol. 80, no. 2, pp. 818–831, 1998. [13] C. L. Ojakangas and T. J. Ebner, “Purkinje cell complex and simple spike changes during a voluntary arm movement learning task in the monkey,” J Neurophysiol, vol. 68, no. 6, pp. 2222–2236, 1992. [14] E. Bauswein, F. Kolb, B. Leimbeck, and F. Rubia, “Simple and complex spike activity of cerebellar purkinje cells during active and passive movements in the awake monkey,” J Physiol, vol. 339, pp. 379–94, 1983. [15] W. A. MacKay and J. T. Murphy, “Responses of interpositus neurons to passive muscle stretch,” J Neurophysiol, vol. 37, pp. 1410–1423, 1974. [16] G. Bosco and R. E. Poppele, “Proprioception From a Spinocerebellar Perspective,” Physiol. Rev., vol. 81, no. 2, pp. 539–568, 2001. [17] S. Kakei, J. Yagi, T. Wannier, J. Na, and Y. Shinoda, “Cerebellar and cerebral inputs to corticocortical and corticofugal neurons in areas 5 and 7 in the cat,” J Neurophysiol, vol. 74, no. 1, pp. 400–412, 1995. [18] P. A. Fortier, J. F. Kalaska, and A. M. Smith, “Cerebellar neuronal activity related to whole-arm reaching movements in the monkey,” J Neurophysiol, vol. 62, no. 1, pp. 198–211, 1989. [19] W. MacKay, “Unit activity in the cerebellar nuclei related to arm reaching movements.” Brain Res., vol. 442, no. 2, pp. 240–54, 1988. [20] F. B. Horak and H. C. Diener, “Cerebellar control of postural scaling and central set in stance,” J Neurophysiol, vol. 72, pp. 479–493, 1994. [21] J. Contreras-Vidal, S. Grossberg, and D. Bullock, “A neural model of cerebellar learning for arm movement control: cortico-spino-cerebellar dynamics.” Learn Mem., vol. 3, no. 6, pp. 475–502, Mar-Apr 1997. [22] J. Houk and S. Wise, “Distributed modular architectures linking basal ganglia, cerebellum, and cerebral cortex: their role in planning and controlling action.” Cereb Cortex, vol. 5, pp. 95–110, 1995. [23] S. G. Massaquoi and J. Slotine, “The intermediate cerebellum may function as a wave-variable processor.” Neurosci Lett, vol. 215, pp. 60–4, 1996. [24] M. Paulin, “The role of the cerebellum in motor control and perception.” Brain Behav Evol, vol. 41, no. 1, pp. 39–50, 1993. [25] D. Wolpert and M. Kawato, “Multiple paired forward and inverse models for motor control.” Neural Netw, vol. 11, pp. 1317–1329, Oct 1998.

FrC01.5 [26] K. Takahashi, “Modeling cerebrocerebellar control in horizontal planar arm movements of humans and the monkey,” Ph.D, MIT, Cambridge, MA, 2007. [27] S. G. Massaquoi, “Modeling the function of the cerebellum in scheduled linear servo control of simple horizontal planar arm movements,” Ph.D, MIT, 1999. [28] S. Jo and S. G. Massaquoi, “A model of cerebrocerebellospinomuscular interaction in the sagittal control of human walking,” Biological Cybernetics, vol. 96, no. 3, pp. 279–307, Mar 2007. [29] ——, “A model of cerebellum stabilized and scheduled hybrid longloop control of upright balance,” Biol Cybern, vol. 91, no. 3, pp. 188– 202, September 2004. [30] H. Asanuma and I. Rosen, “Topographical organization of cortical efferent zones projecting to distal forelimb muscles in the monkey.” Exp Brain Res, vol. 14, no. 3, pp. 243–56, 1972. [31] O. Oscarsson, “Functional units of the cerebellum-sagittal zones and microzones,” Trends Neurosci, vol. 2, pp. 143–145, 1979. [32] M. Ito, The cerebellum and neural control. Raven Press, 1984. [33] J. M. Bower, “The organization of cerebellar cortical circuitry revisited: Implications for function,” Ann NY Acad Sci, vol. 978, pp. 135– 55, 2002. [34] F. Santamaria, D. Jaeger, E. De Schutter, and J. M. Bower, “Modulatory effects of parallel fiber and molecular layer interneuron synaptic activity on purkinje cell responses to ascending segment input: a modeling study.” J Comput Neurosci, vol. 13, no. 3, pp. 217–235, Nov-Dec 2002. [35] D. Cohen and Y. Yarom, “Cerebellar on-beam and lateral inhibition: Two functionally distinct circuits,” J Neurophysiol, vol. 83, no. 4, pp. 1932–40, 2000. [36] R. E. Poppele, G. Bosco, and A. M. Rankin, “Independent representations of limb axis length and orientation in spinocerebellar response components,” J Neurophysiol, vol. 87, no. 1, pp. 409–422, 2002. [37] P. Apkarian and P. Gahinet, “A convex characterization of gain scheduled H∞ controllers,” IEEE Trans. Aut. Contr., vol. 40, pp. 853– 863, 1995. [38] T. Flash and E. Henis, “Arm trajectory modifications during reaching towards visual targets,” J Cogn Neurosci., vol. 3, 1991. [39] A. V. Roitman, S. Pasalar, M. T. V. Johnson, and T. J. Ebner, “Position, direction of movement, and speed tuning of cerebellar purkinje cells during circular manual tracking in monkey,” Journal of Neuroscience, vol. 25, no. 40, pp. 9244–9257, 2005. [40] M. Kawato, K. Furukawa, and R. Suzuki, “A hierarchical neuralnetwork model for control and learning of voluntary movement.” Bilo Cybern, vol. 57, no. 3, pp. 169–85, 1987. [41] A. V. Roitman, S. G. Massaquoi, K. Takahashi, and T. J. Ebner, “Kinematic Analysis of Manual Tracking in Monkeys: Characterization of Movement Intermittencies During a Circular Tracking Task,” J Neurophysiol, vol. 91, no. 2, pp. 901–911, 2004. [42] A. Pellionisz and R. Llinas, “Space-time representation in the brain. the cerebellum as a predictive space-time metric tensor.” Neuroscience, vol. 7, no. 12, pp. 2949–70, 1982. [43] M. Kawato, “Internal models for motor control and trajectory planning.” Curr Opin Neurobiol, vol. 9, no. 6, pp. 718–27, Dec 1999. [44] N. Schweighofer, J. Spoelstra, M. Arbib, and M. Kawato, “Role of the cerebellum in reaching movements in humans. ii. a neural model of the intermediate cerebellum.” Eur J Neurosci, vol. 10, 1998. [45] S. Pasalar, A. Roitman, W. Durfee, and T. Ebner, “Force field effects on cerebellar purkinje cell discharge with implications for internal models.” Nat Neurosci., vol. 9, no. 11, pp. 1404–11, Nov 2006. [46] K. Yamamoto, M. Kawato, S. Kotosaka, and S. Kitazawa, “Encoding of movement dynamics by purkinje cell simple spike activity during fast arm movements under resistive and assistive force fields.” J Neurophysiol., vol. 97, no. 2, pp. 1588–99, Feb 2007.

5832