The problem with using data from dubious sources, especially when it has been obtained in bad faith & with questionable consent, is that it gives incentive to the continuation of said practices. It's far easier to harvest data if you have a complete disregard for people's rights.
The only way to guarantee that data harvesters follow good practice is to make data obtained through questionable practices untouchable.