A carregar...

Balancing Exploration and Exploitation in Self-imitation Learning

Sparse reward tasks are always challenging in reinforcement learning. Learning such tasks requires both efficient exploitation and exploration to reduce the sample complexity. One line of research called self-imitation learning is recently proposed, which encourages the agent to do more exploitation...

ver descrição completa

Na minha lista:
Detalhes bibliográficos
Publicado no:Advances in Knowledge Discovery and Data Mining
Main Authors: Kang, Chun-Yao, Chen, Ming-Syan
Formato: Artigo
Idioma:Inglês
Publicado em: 2020
Assuntos:
Acesso em linha:https://ncbi.nlm.nih.gov/pmc/articles/PMC7206262/
https://ncbi.nlm.nih.govhttp://dx.doi.org/10.1007/978-3-030-47436-2_21
Tags: Adicionar Tag
Sem tags, seja o primeiro a adicionar uma tag!