Reinforcement Learning with Restrictions on the Action Set

Mario Bravo; Mathieu Faure

doi:10.1137/130936488

Article Dans Une Revue SIAM Journal on Control and Optimization Année : 2015

Reinforcement Learning with Restrictions on the Action Set

(1) , (2)

1
2

Mario Bravo

Fonction : Auteur

Universidad de Santiago de Chile [Santiago]

Mathieu Faure

Fonction : Auteur

Groupement de Recherche en Économie Quantitative d'Aix-Marseille

Résumé

Consider a two-player normal-form game repeated over time. We introduce an adaptive learning procedure, where the players only observe their own realized payoff at each stage. We assume that agents do not know their own payoff function and have no information on the other player. Furthermore, we assume that they have restrictions on their own actions such that, at each stage, their choice is limited to a subset of their action set. We prove that the empirical distributions of play converge to the set of Nash equilibria for zero-sum and potential games, and games where one player has two actions.

Mots clés

Economie quantitative

Domaines

Sciences de l'Homme et Société Economies et finances

Elisabeth Lhuillier : Connectez-vous pour contacter le contributeur

https://amu.hal.science/hal-01457301

Soumis le : lundi 6 février 2017-13:50:32

Dernière modification le : lundi 18 mars 2024-10:24:07

Dates et versions

hal-01457301 , version 1 (06-02-2017)

Identifiants

HAL Id : hal-01457301 , version 1
ARXIV : 1306.2918
DOI : 10.1137/130936488

Citer

Mario Bravo, Mathieu Faure. Reinforcement Learning with Restrictions on the Action Set. SIAM Journal on Control and Optimization, 2015, 53 (1), pp.287--312. ⟨10.1137/130936488⟩. ⟨hal-01457301⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-AMU EHESS GREQAM EC-MARSEILLE AMSE

57 Consultations

0 Téléchargements

Reinforcement Learning with Restrictions on the Action Set

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager