Gravar-mail: Robust estimation of optimal dynamic treatment regimes for sequential treatment decisions