A carregar...
A Generalization Error for Q-Learning
Planning problems that involve learning a policy from a single training set of finite horizon trajectories arise in both social science and medical fields. We consider Q-learning with function approximation for this setting and derive an upper bound on the generalization error. This upper bound is i...
Na minha lista:
| Autor principal: | |
|---|---|
| Formato: | Artigo |
| Idioma: | Inglês |
| Publicado em: |
2005
|
| Assuntos: | |
| Acesso em linha: | https://ncbi.nlm.nih.gov/pmc/articles/PMC1475741/ https://ncbi.nlm.nih.gov/pubmed/16763665 |
| Tags: |
Adicionar Tag
Sem tags, seja o primeiro a adicionar uma tag!
|