Gravar-mail: Joint Multimodal Embedding and Backtracking Search in Vision-and-Language Navigation