CAMEO: Curiosity Augmented Metropolis for Exploratory Optimal Policies

Mohamed Alami Chehboune; Rim Kaddah; Luca Martino; Fernando Llorente; Jesse Read

Communication Dans Un Congrès Année : 2022

CAMEO: Curiosity Augmented Metropolis for Exploratory Optimal Policies

(1, 2) , (2) , , , (1)

1
2

Mohamed Alami Chehboune

Fonction : Auteur
PersonId : 754696
IdHAL : mohamed-alami-chehboune
ORCID : 0000-0002-9091-7095

Département d'informatique de l'École polytechnique

IRT SystemX

Rim Kaddah

Fonction : Auteur
PersonId : 1274688
IdHAL : kaddahri

IRT SystemX

Luca Martino

Fonction : Auteur
PersonId : 1134979

Fernando Llorente

Fonction : Auteur
PersonId : 1134980

Jesse Read

Fonction : Auteur
PersonId : 751910
IdHAL : jesse-read
ORCID : 0000-0002-1013-6724

Département d'informatique de l'École polytechnique

Résumé

Reinforcement Learning has drawn huge interest as a tool for solving optimal control problems. Solving a given problem (task or environment) involves converging towards an optimal policy. However, there might exist multiple optimal policies that can dramatically differ in their behaviour; for example, some may be faster than the others but at the expense of greater risk. We consider and study a distribution of optimal policies. We design a curiosity-augmented Metropolis algorithm (CAMEO), such that we can sample optimal policies, and such that these policies effectively adopt diverse behaviours, since this implies greater coverage of the different possible optimal policies. In experimental simulations we show that CAMEO indeed obtains policies that all solve classic control problems, and even in the challenging case of environments that provide sparse rewards. We further show that the different policies we sample present different risk profiles, corresponding to interesting practical applications in interpretability, and represents a first step towards learning the distribution of optimal policies itself.

Mots clés

MCMC Reinforcement Learning

Domaines

Informatique [cs] Mathématiques [math] Statistiques [stat]

Fichier principal

MCMC_EUSIPCO(4).pdf (848.86 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Simo Alami Chehboune : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03675575

Soumis le : lundi 23 mai 2022-11:25:57

Dernière modification le : lundi 7 août 2023-11:56:19

Dates et versions

hal-03675575 , version 1 (23-05-2022)

hal-03675575 , version 2 (14-02-2023)

Identifiants

HAL Id : hal-03675575 , version 1

Citer

Mohamed Alami Chehboune, Rim Kaddah, Luca Martino, Fernando Llorente, Jesse Read. CAMEO: Curiosity Augmented Metropolis for Exploratory Optimal Policies. EUSIPCO 2022, Aug 2022, Belgrade, Serbia. ⟨hal-03675575v1⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

107 Consultations

34 Téléchargements

CAMEO: Curiosity Augmented Metropolis for Exploratory Optimal Policies

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Partager