Carregant...
Simple learning rules to cope with changing environments
We consider an agent that must choose repeatedly among several actions. Each action has a certain probability of giving the agent an energy reward, and costs may be associated with switching between actions. The agent does not know which action has the highest reward probability, and the probabiliti...
Guardat en:
| Autors principals: | , , , , , |
|---|---|
| Format: | Artigo |
| Idioma: | Inglês |
| Publicat: |
The Royal Society
2008
|
| Matèries: | |
| Accés en línia: | https://ncbi.nlm.nih.gov/pmc/articles/PMC3226992/ https://ncbi.nlm.nih.gov/pubmed/18337214 https://ncbi.nlm.nih.govhttp://dx.doi.org/10.1098/rsif.2007.1348 |
| Etiquetes: |
Afegir etiqueta
Sense etiquetes, Sigues el primer a etiquetar aquest registre!
|