A carregar...
Reinforcement Learning with Limited Reinforcement: Using Bayes Risk for Active Learning in POMDPs
Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that increase an agent's reward. Unfortunately, most POMDPs are defined with a large number of parameters which are diffi...
Na minha lista:
| Main Authors: | , , |
|---|---|
| Formato: | Artigo |
| Idioma: | Inglês |
| Publicado em: |
2008
|
| Assuntos: | |
| Acesso em linha: | https://ncbi.nlm.nih.gov/pmc/articles/PMC2868199/ https://ncbi.nlm.nih.gov/pubmed/20467572 https://ncbi.nlm.nih.govhttp://dx.doi.org/10.1901/jaba.2008.301-256 |
| Tags: |
Adicionar Tag
Sem tags, seja o primeiro a adicionar uma tag!
|