Q-learning for Robots

Claude Touzet

Chapitre D'ouvrage Année : 2003

Q-learning for Robots

(1)

Claude Touzet

Fonction : Auteur
PersonId : 9191
IdHAL : claude-touzet
IdRef : 031952259

Laboratoire de Neurosciences intégratives et adaptatives

Résumé

Robot learning is a challenging – and somewhat unique – research domain. If a robot behavior is defined as a mapping between situations that occurred in the real world and actions to be accomplished, then the supervised learning of a robot behavior requires a set of representative examples (situation, desired action). In order to be able to gather such learning base, the human operator must have a deep understanding of the robot-world interaction (i.e., a model). But, there are many application domains where such models cannot be obtained, either because detailed knowledge of the robot’s world is unavailable (e.g., spatial or underwater exploration, nuclear or toxic waste management), or because it would be to costly. In this context, the automatic synthesis of a representative learning base is an important issue. It can be sought using reinforcement learning techniques – in particular Q-learning which does not require a model of the robot-world interaction. Compared to supervised learning, Q-learning examples are triplets (situation, action, Q value), where the Q value is the utility of executing the action in the situation. The supervised learning base is obtained by recruiting the triplets with the highest utility.

Domaines

Apprentissage [cs.LG] Robotique [cs.RO] Réseau de neurones [cs.NE]

Fichier principal

Q-learning_for_Robots(1).pdf (58.22 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Claude Touzet : Connectez-vous pour contacter le contributeur

https://amu.hal.science/hal-01338045

Soumis le : lundi 27 juin 2016-17:14:11

Dernière modification le : vendredi 24 mars 2023-14:53:02

Dates et versions

hal-01338045 , version 1 (27-06-2016)

Identifiants

HAL Id : hal-01338045 , version 1

Citer

Claude Touzet. Q-learning for Robots. M. Arbib. The Handbook of Brain Theory and Neural Networks (Second Edition), MIT Press, pp. 934-937, 2003. ⟨hal-01338045⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-AMU LNIA

126 Consultations

201 Téléchargements

Q-learning for Robots

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager