Reply to post: Re: AI indexing

Google: We've achieved quantum supremacy! IBM: Nope. And stop using that word, please

Il'Geller

Re: AI indexing

"In RL, a software agent takes sequential actions aiming tocmaximize a reward function, or a negative cost function, that embodies the target problem. Successful training of an RL agent depends on balancing exploration of unknown territory with exploitation of existing knowledge." https://www.nature.com/articles/s41534-019-0141-3.pdf

POST COMMENT House rules

Not a member of The Register? Create a new account here.

  • Enter your comment

  • Add an icon

Anonymous cowards cannot choose their icon