Lanean...

Temporal-Difference Reinforcement Learning with Distributed Representations

Temporal-difference (TD) algorithms have been proposed as models of reinforcement learning (RL). We examine two issues of distributed representation in these TD algorithms: distributed representations of belief and distributed discounting factors. Distributed representation of belief allows the beli...

Deskribapen osoa

Gorde:
Xehetasun bibliografikoak
Egile Nagusiak: Kurth-Nelson, Zeb, Redish, A. David
Formatua: Artigo
Hizkuntza:Inglês
Argitaratua: Public Library of Science 2009
Gaiak:
Sarrera elektronikoa:https://ncbi.nlm.nih.gov/pmc/articles/PMC2760757/
https://ncbi.nlm.nih.gov/pubmed/19841749
https://ncbi.nlm.nih.govhttp://dx.doi.org/10.1371/journal.pone.0007362
Etiketak: Etiketa erantsi
Etiketarik gabe, Izan zaitez lehena erregistro honi etiketa jartzen!