• Tidak ada hasil yang ditemukan

A Neuromorphic Motion Controller for a Biped Robot

N/A
N/A
Protected

Academic year: 2023

Membagikan "A Neuromorphic Motion Controller for a Biped Robot"

Copied!
1
0
0

Teks penuh

(1)

A Neuromorphic Motion Controller for a Biped Robot

Michele Folgheraiter, Amina Keldibek, Bauyrzhan Aubakir, Shyngys Salakchinov

1

, Giuseppina Gini, Alessio Mauro Franchi

2

Humans beings, during their life, are exposed to dynamic environments that force them to quickly react to external stimulus, adapt to new situations and learn new sensory- motor control strategies. In an analogous way a humanoid robot which is supposed to operate in an unstructured envi- ronment is also required to learn and adapt while operating and performing different tasks.

Continuous Time Recurrent Neural Networks (CTRNN) represent a powerful paradigm to control mobile robotics systems [1]. They can be used to predict sensory inputs, for image processing, speech recognition and to generate motion trajectories [2]. Different learning methods can be applied to tune the CTRNN’s synapses such as back-propagation through time [3], real-time recurrent learning [4], Kalman fil- ters, etc. In particular, the reward-modulated Hebbian learn- ing rule [5] represents a more biological plausible method where the adaptation depends on a binary signal encoding an instantaneous reward based on the average performance of the neural controller.

Here we propose a neuromorphic control system for a medium size humanoid robot under development in the Robotics and Mechatronics Department at Nazarbayev Uni- versity (Fig.1) and in cooperation with Politecnico di Mi- lano. The high level layer decides which motion primitives to generate according to the task. The middle level layer coordinates the low level modules setting the gait frequency, the step size and height. Finally the low level modules, that mimic the structure of microcircuits located in the primates cortex [6], generate the joints trajectories. More specifically, each module is represented by a CTRNN of 200 neurons randomly and sparsely connected with fixed synapses and capable to learn and reproduce tunable periodic trajectories.

Each neuron in the circuit is modeled by a first order differential equation with a sigmoidal activation function.

The adaptation takes place in the synapses of readout units that realize a linear combination of the neurons output in order to reproduce the target signals.

1School of Science and Technology, Robotics and Mechatronics Department,Nazarbayev University, Kaza-

khstan. [email protected],

[email protected], [email protected], [email protected]

2 DEIB, Politecnico di Milano, piazza L. da Vinci

32, Milano, Italy [email protected],

[email protected]

*This work is supported by the Ministry of Education and Science of the Republic of Kazakhstan under the grant and target funding scheme agreement#220/073-2015.

150 155 160 165 170

⌃1.5

⌃1

⌃0.5 0 0.5 1 1.5

Time [s]

RNNOutput1

150 155 160 165 170

⌃1.5

⌃1

⌃0.5 0 0.5 1 1.5

Time [s]

RNNOutput2

150 155 160 165 170

⌃1.5

⌃1

⌃0.5 0 0.5 1 1.5

Time [s]

RNNOutput3

150 155 160 165 170

⌃1.5

⌃1

⌃0.5 0 0.5 1 1.5

Time [s]

RNNOutput4

150 155 160 165 170

⌃1.5

⌃1

⌃0.5 0 0.5 1 1.5

Time [s]

RNNOutput5

Adaptable Excitatory Inhibitory

3

4

5

A)

B)

C)

Fig. 1. A) A low level CTRNN module, B) Biped prototype under development at Nazarbayev University: 10DOFs, 4.5Kg, 0.6m heightC) Learned trajectories for the five joints of the biped right leg obtained from a step simulation, the vertical blue dashed line indicates the learning time.

REFERENCES

[1] R. D. Beer, H. J. Chiel, R. D. Quinn, K. S. Espenschied, and P. Larsson,

“A distributed neural network architecture for hexapod robot locomo- tion,”Neural Computation, vol. 4, no. 3, pp. 356–365, May 1992.

[2] P. Joshi and W. Maass, “Movement generation with circuits of spiking neurons,”Neural Computation, vol. 17, no. 8, pp. 1715–1738, 2005.

[3] P. Werbos, “Backpropagation through time: what it does and how to do it,”Proceedings of the IEEE, vol. 78, no. 10, pp. 1550–1560, Oct 1990.

[4] R. J. Williams and D. Zipser, “Experimental Analysis of the Real-time Recurrent Learning Algorithm,”Connection Science, vol. 1, no. 1, pp.

87–111, 1989.

[5] G. M. Hoerzer, R. Legenstein, and W. Maass, “Emergence of complex computational structures from chaotic neural networks through reward- modulated hebbian learning,”Cerebral Cortex, vol. 24, no. 3, pp. 677–

690, 2012.

[6] J. Wessberg, C. R. Stambaugh, J. D. Kralik, P. D. Beck, M. Laubach, J. K. Chapin, J. Kim, S. J. Biggs, M. A. Srinivasan, and M. A. L.

Nicolelis, “Real-time prediction of hand trajectory by ensembles of cortical neurons in primates,”Nature, vol. 408, no. 6810, pp. 361–365, 11 2000.

Human Performance and Robotics, Satellite Workshop of IEEE Humanoid 2016

Referensi

Dokumen terkait

The primary scope of this research involves the developing a stable walking motion for humanoid robot by using cubic polynomial method which refers to the

The big difference of hopping and jumping type robot is jumping robot can make only one big jump moving performance.. In contrast, the hopping robot can generate the

a) To design and develop the mechanical structure of legged motion platform of a mobile robot suitable for terrain grass, sand and floor for military purpose. b) To

Experimental Evaluation of a Saturated Output Feedback Controller Using RBF Neural Networks for SCARA Robot IBM 7547 Mohammad Pourrahim Khoshnam Shojaei Abbas Chatraei Omid Shah

Mohamed Elouni Adaptive PID Fault-Tolerant Tracking Controller for Takagi-Sugeno Fuzzy Systems with Actuator Faults: Application to Single-Link Flexible Joint Robot designing a

Footstep transition model: left - The reachable region of the swing foot; right - a set of selected feasible footstep placements are predefined correspond to the robot stable strategy

Therefore, an educational desktop-based application, iLeap: Interactive Learning for Autism Spectrum Disorder ASD Children using Leap Motion Controller is developed together with Leap

In order to avoid instability caused by unmodeled phase behaviour, the robust controller design is divided into two stages: H’ controller design and checking closed loop sensitivity