Carregant...

Reinforcement Learning with Limited Reinforcement: Using Bayes Risk for Active Learning in POMDPs

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that increase an agent's reward. Unfortunately, most POMDPs are defined with a large number of parameters which are diffi...

Descripció completa

Guardat en:
Dades bibliogràfiques
Autors principals: Doshi, Finale, Pineau, Joelle, Roy, Nicholas
Format: Artigo
Idioma:Inglês
Publicat: 2008
Matèries:
Accés en línia:https://ncbi.nlm.nih.gov/pmc/articles/PMC2868199/
https://ncbi.nlm.nih.gov/pubmed/20467572
https://ncbi.nlm.nih.govhttp://dx.doi.org/10.1901/jaba.2008.301-256
Etiquetes: Afegir etiqueta
Sense etiquetes, Sigues el primer a etiquetar aquest registre!