Gravar-mail: Memory-assisted reinforcement learning for diverse molecular de novo design