Gravar-mail: Training an Actor-Critic Reinforcement Learning Controller for Arm Movement Using Human-Generated Rewards