Lanean...
Temporal-Difference Reinforcement Learning with Distributed Representations
Temporal-difference (TD) algorithms have been proposed as models of reinforcement learning (RL). We examine two issues of distributed representation in these TD algorithms: distributed representations of belief and distributed discounting factors. Distributed representation of belief allows the beli...
Gorde:
Egile Nagusiak: | , |
---|---|
Formatua: | Artigo |
Hizkuntza: | Inglês |
Argitaratua: |
Public Library of Science
2009
|
Gaiak: | |
Sarrera elektronikoa: | https://ncbi.nlm.nih.gov/pmc/articles/PMC2760757/ https://ncbi.nlm.nih.gov/pubmed/19841749 https://ncbi.nlm.nih.govhttp://dx.doi.org/10.1371/journal.pone.0007362 |
Etiketak: |
Etiketa erantsi
Etiketarik gabe, Izan zaitez lehena erregistro honi etiketa jartzen!
|