A carregar...

Reinforcement Learning with Limited Reinforcement: Using Bayes Risk for Active Learning in POMDPs

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that increase an agent's reward. Unfortunately, most POMDPs are defined with a large number of parameters which are diffi...

ver descrição completa

Na minha lista:
Detalhes bibliográficos
Main Authors: Doshi, Finale, Pineau, Joelle, Roy, Nicholas
Formato: Artigo
Idioma:Inglês
Publicado em: 2008
Assuntos:
Acesso em linha:https://ncbi.nlm.nih.gov/pmc/articles/PMC2868199/
https://ncbi.nlm.nih.gov/pubmed/20467572
https://ncbi.nlm.nih.govhttp://dx.doi.org/10.1901/jaba.2008.301-256
Tags: Adicionar Tag
Sem tags, seja o primeiro a adicionar uma tag!