Reply to post: Re: Trust of black box systems is overrated

Inside the 1TB ImageNet data set used to train the world's AI: Naked kids, drunken frat parties, porno stars, and more

juice

Re: Trust of black box systems is overrated

> False : New images can be made/found which have full prior consent

That's not the point. The issue being flagged up here is that without the original data sets, it'll be impossible to recreate the AI systems which were trained on them. Which in turn means it's impossible to assess how much of a part the excised images played when it comes to their processing capabilities and/or potential biases thereof.

Admittedly, there is a rebuttal to this, in that you could just retrain the AI with the new data sets and then compare/contrast. But this costs time and money, and by it's very black-box/random-weighting nature (combined with the fact that the new system will probably be trained on different/newer hardware), AI training isn't guaranteed to be reproducible.

Equally, it's debatable how useful the original material is when it comes to dissecting AI logic paths, given how many iterations and weighting actions occur, and how many layers are in the AI black box.

But still, there's definitely some merit to the concerns being raised!

POST COMMENT House rules

Not a member of The Register? Create a new account here.

  • Enter your comment

  • Add an icon

Anonymous cowards cannot choose their icon