Loading...

Safety-Guaranteed, Accelerated Learning in MDPs with Local Side Information

In environments with uncertain dynamics, synthesis of optimal control policies mandates exploration. The applicability of classical learning algorithms to real-world problems is often limited by the number of time steps required for learning the environment model. Given some local side information a...

Fuld beskrivelse

Na minha lista:
Bibliografiske detaljer
Udgivet i:Proc Am Control Conf
Main Authors: Thangeda, Pranay, Ornik, Melkior
Format: Artigo
Sprog:Inglês
Udgivet: 2020
Fag:
Online adgang:https://ncbi.nlm.nih.gov/pmc/articles/PMC7676387/
https://ncbi.nlm.nih.gov/pubmed/33223606
https://ncbi.nlm.nih.govhttp://dx.doi.org/10.23919/acc45564.2020.9147372
Tags: Tilføj Tag
Ingen Tags, Vær først til at tagge denne postø!