Gravar-mail: NLProt: extracting protein names and sequences from papers