Reply to post: ...little incentive...

Inside the 1TB ImageNet data set used to train the world's AI: Naked kids, drunken frat parties, porno stars, and more


...little incentive...

“The data set creators allow some of these 'contaminants' to remain simply because there is little incentive to spend the resources eradicating them all and they have minimal overall effect on the training of machine learning models.”

In other words they didn't think they would have to do it and now that the data set is created it's really really hard to fix so they don't want to. Nice.

And the "effect on the training of machine learning models" is irrelevant to the privacy concerns.

POST COMMENT House rules

Not a member of The Register? Create a new account here.

  • Enter your comment

  • Add an icon

Anonymous cowards cannot choose their icon