Reply to post: Unbiased?

Fake it until you make it: Can synthetic data help train your AI model?

Mike 137 Silver badge


"we can generate whatever distribution of ethnicities, ages, genders you want in your data, so we are not biased in any way"

The moment you specify a distribution up front, you implement a bias (whether or not you're smart enough to recognise that), because your specification is based on your prior expectation.

The reason for random sampling from a real population is that you can't have any prior expectation. Statistics 101.

POST COMMENT House rules

Not a member of The Register? Create a new account here.

  • Enter your comment

  • Add an icon

Anonymous cowards cannot choose their icon