Carregant...

Robust and Efficient Transfer Learning with Hidden Parameter Markov Decision Processes

We introduce a new formulation of the Hidden Parameter Markov Decision Process (HiP-MDP), a framework for modeling families of related tasks using low-dimensional latent embeddings. Our new framework correctly models the joint uncertainty in the latent parameters and the state space. We also replace...

Descripció completa

Guardat en:
Dades bibliogràfiques
Publicat a:Adv Neural Inf Process Syst
Autors principals: Killian, Taylor, Daulton, Samuel, Konidaris, George, Doshi-Velez, Finale
Format: Artigo
Idioma:Inglês
Publicat: 2017
Matèries:
Accés en línia:https://ncbi.nlm.nih.gov/pmc/articles/PMC6814194/
https://ncbi.nlm.nih.gov/pubmed/31656388
Etiquetes: Afegir etiqueta
Sense etiquetes, Sigues el primer a etiquetar aquest registre!