Gravar-mail: Estimating Dynamic Treatment Regimes in Mobile Health Using V-learning