On the Refinement of Conflict History Search Through Multi-Armed Bandit - Aix-Marseille Université Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

On the Refinement of Conflict History Search Through Multi-Armed Bandit

Mohamed Sami Cherif
  • Fonction : Auteur
  • PersonId : 1290539
  • IdHAL : sami-cherif
Djamal Habet
  • Fonction : Auteur
  • PersonId : 1090275
Cyril Terrioux

Résumé

Reinforcement learning has shown its relevance in designing search heuristics for backtracking algorithms dedicated to solving decision problems under constraints. Recently, an efficient heuristic, called Conflict History Search (CHS), based on the history of search failures was introduced for the Constraint Satisfaction Problem (CSP). The Exponential Recency Weighted Average (ERWA) is used to estimate the hardness of constraints and CHS favors the variables that often appear in recent failures. The step parameter is important in CHS since it controls the estimation of the hardness of constraints and its refinement may lead to notable improvements. The current research aims to achieve this objective. Indeed, a Multi-Armed Bandits (MAB) framework can select an appropriate value of this parameter during the restarts performed by the search algorithm. Each arm represents a CHS with a given value for the step parameter and it is rewarded by its ability to improve the search. A training phase is introduced earlier in the search to help MAB choose a relevant arm. The experimental evaluation shows that this approach leads to significant improvements regarding CHS and other state-ofthe-art heuristics.
Fichier principal
Vignette du fichier
mab.pdf (1.02 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03132228 , version 1 (04-02-2021)

Identifiants

Citer

Mohamed Sami Cherif, Djamal Habet, Cyril Terrioux. On the Refinement of Conflict History Search Through Multi-Armed Bandit. IEEE 32nd International Conference on Tools with Artificial Intelligence (ICTAI), Nov 2020, Baltimore, United States. pp.264-271, ⟨10.1109/ICTAI50040.2020.00050⟩. ⟨hal-03132228⟩
165 Consultations
92 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More