Gravar-mail: Practical impacts of genomic data “cleaning” on biological discovery using surrogate variable analysis