Gravar-mail: Evaluating the effect of annotation size on measures of semantic similarity