Загрузка...

Reactive Reinforcement Learning in Asynchronous Environments

The relationship between a reinforcement learning (RL) agent and an asynchronous environment is often ignored. Frequently used models of the interaction between an agent and its environment, such as Markov Decision Processes (MDP) or Semi-Markov Decision Processes (SMDP), do not capture the fact tha...

Полное описание

Сохранить в:
Библиографические подробности
Опубликовано в: :Front Robot AI
Главные авторы: Travnik, Jaden B., Mathewson, Kory W., Sutton, Richard S., Pilarski, Patrick M.
Формат: Artigo
Язык:Inglês
Опубликовано: Frontiers Media S.A. 2018
Предметы:
Online-ссылка:https://ncbi.nlm.nih.gov/pmc/articles/PMC7805616/
https://ncbi.nlm.nih.gov/pubmed/33500958
https://ncbi.nlm.nih.govhttp://dx.doi.org/10.3389/frobt.2018.00079
Метки: Добавить метку
Нет меток, Требуется 1-ая метка записи!