An Adaptive Recurrent Architecture for Learning Robot ... - Google Sites
Recommend Documents
Learning Management System in Aviation Security ... Learning Management Systems (LMS) are one .... used to compare with some a-priori model [4] to see.
Janse and Warner ten Kate. REFERENCES. [1] J.F.K. Buford, Architectures and Issues for Distributed. Multimedia Systems, in J. F. K. Buford(Ed.), Multimedia.
Jul 12, 2006 - http://msdn.microsoft.com/library/default.asp?url=/library/en- ... [15] Nelson, M., Serial Communications Developer's Guide, New Jersey: John.
Human Factors, Standardization. Keywords. Adaptive Hypermedia Services, Learning Environments,. Integration, e-Learning, Content Interworking, Data Model.
intelligent systems by executing hybrids of algorithms based on different computational ..... cognitive self-regulation principle motivates a significantly different ...
ratios. Therefore, these devices have become more susceptible to external phenomena (such as ... corrupting a wide variety of memory elements accessible to the operating system and to ... limited, as most test logic requires that the system be put in
Keywords: automatic adaptive defense, multi-loop feedback control, social network, ...... emails from a sender for some information but he might send some ...
May 13, 1999 - Systems and Computer Engineering. Carleton University ... the same infrastructure is increasingly used to
More precisely, a device is an piece of capital equipment which (1) may be ... the symbolic representation of the operation, and an optional list of one or more modifiers /5/. ... There are atomic expressions, including characters, strings, integers,
Communication Services), CNS-0516807 (Preventing Voice Spamming), .... In particular the MLS considers the past experience (history) with source and .... Dantu and Kolan [14] devised a technique for classifying the emails as âspamâ,.
An Adaptive Architecture for Physical Agents. Pat Langley. Computational
Learning Laboratory. Center for the Study of Language and Information. Stanford
...
monitoring [6]. While the proposed benefits of adaptive software is promising, developing adaptive software also raises some challenging questions. First, in.
personal (name, email address, phone, etc...), preferences. (language .... .
Nov 7, 2014 - vertisement, etc. Automatically mining and learning user- .... randomly sampled triple (u, i, j), which an
International Journal on Social Robotics. Preprint, finally ... Dominic Heger · Felix Putze · Tanja Schultz. Abstract This article ... E-mail: [email protected].
validated using a vision- and sonar-based Pioneer I robot in non-stationary conditions, in the context of a multi-robot foraging task. Results show the effective-.
Middleware, Interoperability, Service-oriented architecture,. Component-Based Architecture ... all software programming activities are executed. In this paper,.
PQ-learning, for intelligent behavior acquisition by an autonomous robot. ... proposed a PartiGame algorithm for solving puzzle problems, in which the agent learns to ..... [6] G. Gordon, âStable function approximation in dynamic programmingâ, ..
PQ-learning, for intelligent behavior acquisition by an autonomous robot. .... episode only, all state transitions in history so that the consistency of value functions .... 640 1280 2560 5120. Number of learning episodes. P e rc e n ta g e o. f s ta
PQ-learning, for intelligent behavior acquisition by an autonomous robot. ... Al- though these methods have achieved impressive robotic intelligence in some fields, more ... In [12], a decision tree based adaptive state partitioning method was.
communication, done via User Datagram Protocol (UDP) and Transmission .... Correction of a robot tool frame according to the nominal frame, referenced to a global .... In order to debug the connection, the Wireshark® free, open-source network ... Wi
a calculus student using short-answer questions. In this article, we show .... mathematical expressions using a pen tablet. ... As you can see, a one-point criterion only involves a ... mathematical software, it does not require transformations of.
with a software simulator which runs interactive examples on-line, and is fully controlled .... Additionally, an electronic white board (smart board) makes the presenta- ... browsing the WEB in search of real data to download to the simulator and ...
An Adaptive Recurrent Architecture for Learning Robot ... - Google Sites
be accessed by more than one arm configuration. ⢠cerebellar connectivity is intrinsically modular and its complexity
Recurrent cerebellar loops simplify adaptive control of redundant and non-linear motor systems John Porrill & Paul Dean Centre for Signal Processing in Neural Imaging & Systems Neuroscience University of Sheffield
We have described elsewhere an adaptive filter model of cerebellar learning in which the cerebellar microcircuit acts to decorrelate motor commands from their sensory consequences (Dean, Porrill & Stone, Proc Roy Soc B, 2002). We found that stable learning for generic motor plants required the cerebellar microcircuit to be embedded in a recurrent loop; this arrangement lead to a simple and modular adaptive control architecture when applied to the linearised 3D vestibular ocular reflex (Porrill, Dean & Stone, Proc Roy Soc B, 2004). Here we investigate the properties of recurrent loop connectivity for the case of redundant and non-linear motor systems using the example of kinematic control of a robot arm. We will demonstrate that •
the learning rule does not depend on the availability of 'motor error' signals or require complex neural 'reference structures' to estimate such signals. It is thus a genuinely local learning architecture in which proximal rather than distal error can be used for learning
•
the form of the learning rule is biologically plausible, independent of the details of the controlled plant and in particular is unaffected by motor plant nonlinearities
•
control of redundant systems is not subject to the 'convexity problem' in which incorrect average motor commands are learnt for end-effector positions which can be accessed by more than one arm configuration
•
cerebellar connectivity is intrinsically modular and its complexity scales linearly with the dimensionality N of output space rather than with the product of N and the (for highly redundant biological motor systems usually much higher) dimensionality of motor command space.
These properties suggest that the highly specific closed cerebellar loops which have been shown to be ubiquitous in motor systems (Kelly & Strick, 2003) may play a central role in simplifying the adaptive control problem.
Figure 1. Recurrent cerebellar architecture applied to kinematic control of a robot arm. Robot arm motor plant (forward kinematics) is denoted by P. Desired end effector positions x(t) are processed by a recurrent loop containing a fixed approximation B to the inverse kinematics P-1 and an adaptive element C = Σ wi Gi representing the cerebellar microcircuit. Reaching error e = x − x is used to train the weights wi using the local learning rule w i = − β e Gi ( y ) . Convergence follows since sum square synaptic error V = 1 2Σ ( wi − wi opt ) 2 is a Lyapounov function with V = − β e 2 . This behaviour contrasts
with conventional forward loop architectures where an approximation to the Jacobian of P is required to enable learning.
Figure 2. Adaptive re-calibration of a two joint robot arm. Left panel shows simulated two joint arm.
Initially B (see Fig 1) is an exact inverse kinematics for arm lengths L1 =1, L2 = 2 and C is an RBF net with weights wi=0: in this case the arm reaches accurately to points within the workspace outlined by the blue grid. If arm lengths are changed to L1 =0.9, L2 = 2.2 the arm reaches to inaccurate positions shown by the distorted pre-training (red) grid. Performance after 104 training examples is shown by the post-training (dotted green) grid. Top right panel shows stochastic decrease of RMS reaching error during training. Bottom right panel shows monotonic decrease in V (exactly as predicted by the Lyapounov analysis). Note that much faster convergence can be obtained using optimised bases Gi.