A carregar...

A Generalization Error for Q-Learning

Planning problems that involve learning a policy from a single training set of finite horizon trajectories arise in both social science and medical fields. We consider Q-learning with function approximation for this setting and derive an upper bound on the generalization error. This upper bound is i...

ver descrição completa

Na minha lista:

Detalhes bibliográficos
Autor principal:	Murphy, Susan A.
Formato:	Artigo
Idioma:	Inglês
Publicado em:	2005
Assuntos:	Article
Acesso em linha:	https://ncbi.nlm.nih.gov/pmc/articles/PMC1475741/ https://ncbi.nlm.nih.gov/pubmed/16763665
Tags:	Adicionar Tag Sem tags, seja o primeiro a adicionar uma tag!

A Generalization Error for Q-Learning

Registos relacionados