ロード中...

Temporal-Difference Reinforcement Learning with Distributed Representations

Temporal-difference (TD) algorithms have been proposed as models of reinforcement learning (RL). We examine two issues of distributed representation in these TD algorithms: distributed representations of belief and distributed discounting factors. Distributed representation of belief allows the beli...

詳細記述

保存先:
書誌詳細
主要な著者: Kurth-Nelson, Zeb, Redish, A. David
フォーマット: Artigo
言語:Inglês
出版事項: Public Library of Science 2009
主題:
オンライン・アクセス:https://ncbi.nlm.nih.gov/pmc/articles/PMC2760757/
https://ncbi.nlm.nih.gov/pubmed/19841749
https://ncbi.nlm.nih.govhttp://dx.doi.org/10.1371/journal.pone.0007362
タグ: タグ追加
タグなし, このレコードへの初めてのタグを付けませんか!