A carregar...

A Study of Continuous Maximum Entropy Deep Inverse Reinforcement Learning

The assumption of IRL is that demonstrations are optimally acting in an environment. In the past, most of the work on IRL needed to calculate optimal policies for different reward functions. However, this requirement is difficult to satisfy in large or continuous state space tasks. Let alone continu...

ver descrição completa

Na minha lista:
Detalhes bibliográficos
Main Authors: Xi-liang Chen, Lei Cao, Zhi-xiong Xu, Jun Lai, Chen-xi Li
Formato: Artigo
Idioma:Inglês
Publicado em: Hindawi Limited 2019-01-01
Colecção:Mathematical Problems in Engineering
Acesso em linha:http://dx.doi.org/10.1155/2019/4834516
Tags: Adicionar Tag
Sem tags, seja o primeiro a adicionar uma tag!