Lanean...

Linear Fitted-Q Iteration with Multiple Reward Functions

We present a general and detailed development of an algorithm for finite-horizon fitted-Q iteration with an arbitrary number of reward signals and linear value function approximation using an arbitrary number of state features. This includes a detailed treatment of the 3-reward function case using t...

Deskribapen osoa

Gorde:

Xehetasun bibliografikoak
Egile Nagusiak:	Lizotte, Daniel J., Bowling, Michael, Murphy, Susan A.
Formatua:	Artigo
Hizkuntza:	Inglês
Argitaratua:	2012
Gaiak:	Article
Sarrera elektronikoa:	https://ncbi.nlm.nih.gov/pmc/articles/PMC3670261/ https://ncbi.nlm.nih.gov/pubmed/23741197
Etiketak:	Etiketa erantsi Etiketarik gabe, Izan zaitez lehena erregistro honi etiketa jartzen!

Linear Fitted-Q Iteration with Multiple Reward Functions

Antzeko izenburuak