Carregant...
Reinforcement Learning with Limited Reinforcement: Using Bayes Risk for Active Learning in POMDPs
Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that increase an agent's reward. Unfortunately, most POMDPs are defined with a large number of parameters which are diffi...
Guardat en:
| Autors principals: | , , |
|---|---|
| Format: | Artigo |
| Idioma: | Inglês |
| Publicat: |
2008
|
| Matèries: | |
| Accés en línia: | https://ncbi.nlm.nih.gov/pmc/articles/PMC2868199/ https://ncbi.nlm.nih.gov/pubmed/20467572 https://ncbi.nlm.nih.govhttp://dx.doi.org/10.1901/jaba.2008.301-256 |
| Etiquetes: |
Afegir etiqueta
Sense etiquetes, Sigues el primer a etiquetar aquest registre!
|