Neural reinforcement learning for behaviour synthesis

Claude Touzet

doi:10.1016/S0921-8890(97)00042-0

Article Dans Une Revue Robotics and Autonomous Systems Année : 1997

Neural reinforcement learning for behaviour synthesis

(1)

Claude Touzet

Fonction : Auteur correspondant
PersonId : 9191
IdHAL : claude-touzet
IdRef : 031952259

Connectez-vous pour contacter l'auteur

Laboratoire de Neurosciences intégratives et adaptatives

Résumé

We present the results of a research aimed at improving the Q-learning method through the use of artificial neural networks. Neural implementations are interesting due to their generalisation ability. Two implementations are proposed: one with a competitive multilayer perceptron and the other with a self-organising map. Results obtained on a task of learning an obstacle avoidance behaviour for the mobile miniature robot Khepera show that this last implementation is very effective, learning more than 40 times faster than the basic Q-learning implementation. These neural implementations are also compared with several Q-learning enhancements, like the Q-learning with Hamming distance, Q-learning with statistical clustering and Dyna-Q.

Mots clés

Neural Q-learning reinforcement learning obstacle avoidance behaviour self-organising map autonomous robotics

Domaines

Intelligence artificielle [cs.AI] Apprentissage [cs.LG] Réseau de neurones [cs.NE] Robotique [cs.RO]

Fichier principal

Jars_97(1).pdf (472.86 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Claude Touzet : Connectez-vous pour contacter le contributeur

https://amu.hal.science/hal-01337989

Soumis le : lundi 27 juin 2016-16:30:28

Dernière modification le : vendredi 24 mars 2023-14:53:02

Dates et versions

hal-01337989 , version 1 (27-06-2016)

Identifiants

HAL Id : hal-01337989 , version 1
DOI : 10.1016/S0921-8890(97)00042-0

Citer

Claude Touzet. Neural reinforcement learning for behaviour synthesis. Robotics and Autonomous Systems, 1997, ⟨10.1016/S0921-8890(97)00042-0⟩. ⟨hal-01337989⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-AMU LNIA

87 Consultations

647 Téléchargements

Neural reinforcement learning for behaviour synthesis

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager