Gravar-mail: The good, the bad, and the ugly in chemical and biological data for machine learning