Distributed Lazy Q-learning for Cooperative Mobile Robots

Claude Touzet

doi:10.5772/5614

Article Dans Une Revue International Journal of Advanced Robotic Systems Année : 2004

Distributed Lazy Q-learning for Cooperative Mobile Robots

(1)

Claude Touzet

Fonction : Auteur
PersonId : 9191
IdHAL : claude-touzet
IdRef : 031952259

Neurobiologie intégrative et adaptative

Résumé

Compared to single robot learning, cooperative learning adds the challenge of a much larger search space (combined individual search spaces), awareness of other team members, and also the synthesis of the individual behaviors with respect to the task given to the group. Over the years, reinforcement learning has emerged as the main learning approach in autonomous robotics, and lazy learning has become the leading bias, allowing the reduction of the time required by an experiment to the time needed to test the learned behavior performance. These two approaches have been combined together in what is now called lazy Q-learning, a very efficient single robot learning paradigm. We propose a derivation of this learning to team of robots : the «pessimistic» algorithm able to compute for each team member a lower bound of the utility of executing an action in a given situation. We use the cooperative multi-robot observation of multiple moving targets (CMOMMT) application as an illustrative example, and study the efficiency of the Pessimistic Algorithm in its task of inducing learning of cooperation.

Mots clés

reinforcement learning lazy learning CMOMMT Q-learning Cooperative robotics cooperative learning heterogeneous robots

Domaines

Intelligence artificielle [cs.AI] Automatique / Robotique Neurosciences

Fichier principal

Distributed_lazy_Q.pdf (360.84 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Claude Touzet : Connectez-vous pour contacter le contributeur

https://amu.hal.science/hal-01337605

Soumis le : lundi 27 juin 2016-16:03:37

Dernière modification le : vendredi 24 mars 2023-14:53:02

Archivage à long terme le : mercredi 28 septembre 2016-11:16:44

Dates et versions

hal-01337605 , version 1 (27-06-2016)

Identifiants

HAL Id : hal-01337605 , version 1
DOI : 10.5772/5614

Citer

Claude Touzet. Distributed Lazy Q-learning for Cooperative Mobile Robots. International Journal of Advanced Robotic Systems, 2004, 1, pp.5-13. ⟨10.5772/5614⟩. ⟨hal-01337605⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-AMU TDS-MACS LNIA

65 Consultations

104 Téléchargements

Distributed Lazy Q-learning for Cooperative Mobile Robots

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager