Gravar-mail: Toward a clinical text encoder: pretraining for clinical natural language processing with applications to substance misuse