back to article AI is going to eat itself: Experiment shows people training bots are using bots

Workers hired via crowdsource services like Amazon Mechanical Turk are using large language models to complete their tasks – which could have negative knock-on effects on AI models in the future. Data is critical to AI. Developers need clean, high-quality datasets to build machine learning systems that are accurate and …

  1. cipnt

    Rules

    Most industries have governing bodies that set standards and guidelines for their members.

    Other industries in which errors can have more serious consequences are regulated by government bodies.

    Feels like advanced AI should fall somewhere in between...

    1. Anonymous Coward
      Anonymous Coward

      Re: Rules

      What if AIs included a cryptographic signature encoded into the content itself, one that humans can't easily detect or remove? Probably only possible in very long text. Would also create a channel where AI's could secretly identify and communicate with each other. So maybe not such a good idea after all. Never mind!

      1. Anonymous Coward
        Anonymous Coward

        Digital signatures really hard to apply in this context

        You can bury them in formats that easily store metadata, but in text people can copy and paste signatures and water marking are absurdly difficult to make stick.

        Sure you can sign them, but the user (in this case the cheater) can just cut the signature out. There will be no reliable way to ensure a short piece of text is original without checking it against a corpus that would be essentially the sum of human written material. Good luck with that one.

        There will be cases where one bot can bust another, but not in the face of adversarial systems where the attacker can adapt to beat the detector. LLMS have already crossed the threshold where in most text cases their output cannot be reliably distinguished from human generated text. The key to this is that the model is operating on the corpus of whatever language the person is using. If we can understand each other, there is enough for a LLM to find the patterns in and mimic.

        If you want to make it harder for a LLM to catfish you, force it to try to emulate a normal conversation that requires heavy context and references prior statements or current information. The current models can't really keep track of their own lies and tend to break down. However, that won't really help for a task like the one in the paper, or text message spam, or a million other contexts.

        It's not like this is magical or clever. I remember listening to a mocking bird emulating an old 90's era Nokia ringtone and sending a bakers dozen corpo's scrabbling for their cellphones. Mimics have been with us for longer than our ancestors were sentient.

        1. Displacement Activity

          Re: Digital signatures really hard to apply in this context

          I was going to upvote this, till I got to "Baker's dozen corpo's scrabbling for their cellphones". What?? Is that a non-contextual hallucination? Are robots now commenting on Reg stories?

    2. Binraider Silver badge

      Re: Rules

      Asking government to legislate on AI is not going to end well.

      I mean, they did a sterling job of regulating internet advertising, Ponzi schemes (inc. Crypto), misinformation, how to commit suicide adverts, video game 'nasties', IP law and mountain of other issues.

      If anything happens on that front, it will not end well. Isaac Asimov famously already wrote a reasonable set of rules of course.

      1. bo111

        > Asking government to legislate on AI is not going to end well

        This. How can you even expect politicians to understand what AI is. Most AI experts are busy making big bucks and having great fun in the real world dev.

        OK. Let's assume they can. Now go and enforce the laws. We already saw GDPR failing literally everywhere and making life harder by nonsense cookie warnings (98% of people do not understand what they mean). But, more importantly, indirectly by malware, ransomware etc. The same money is better spent elsewhere. Orthogonally. For example on education and cyber-security budgets.

  2. Cheese_Conquistador

    Why are they asking people to train AI models (Sorry ML, silly me, AI does not exist) when you can use the model to train itself? Doesn't that defeat the purpose of machine learning? If they can write something that gets the information using ML then why can't the people asking them to do it also write something?

    1. breakfast Silver badge
      Boffin

      My understanding is that much of this work is along the lines of going through thousands of pictures and labelling which of them contain a bird. Without that step I don't see how a model could train itself to identify birds because it wouldn't be able to tie anything it was evaluating to any human-meaningful data.

    2. Lee D Silver badge

      Because using the model to train itself is exactly what the article (and many academics) warn about.

      One tiny wrong assumption, and it hits a positive feedback loop (because it doesn't know it's wrong, because it can't infer that) and before long every human image it generates has 27 fingers, every text it generates become Lorem ipsum text filled with profanity, and every answer it gives is based on previous (wrong) answers but with far more "certainty" than they deserve.

      It's like video-taping a video-tape, repeatedly, thousands of times. Or MPEGing an MPEG (or JPEGing a JPEG). If the data you're starting with has a tiny bit of nonsense or noise, there's no way for the model to know it's wrong, so as far as it's concerned it's "right".... so it amplifies that data out of all proportion until what you have is a mess of amplified noise that serves no useful purpose.

      We don't have AI, precisely because of this kind of problem. The AI has no way to break the mould, infer something that it hasn't been instructed to infer, etc. so it never questions itself, or any wrong answers it gives, even when corrected. And once you have a million "positive" trainings, it takes 2 million proper further trainings to "untrain" that bad behaviour out of the system.

      For reference, I ran some models based on ChatGPT and got it to give this answer:

      "What types of fire extinguisher should I use?"

      "There are two main categories of fire extinguisher - dry chemical and water-based. Dry chemical extinguishing agents include foam, powder or gel that can be used on Class A (wood/paper) fires as well as some types of electrical equipment like computers and switches. Water-based extinguisher are typically used for Class B (flammable liquids), C (electrically energized) and D (combustible metals) type fires, but can also be effective on A class fires if they have sufficient water flow to cool the fire down quickly enough before it spreads."

      Basically, because not enough people have ever asked this question, or corrected its answer (and, by the way, no amount of convincing it to recheck would make it correct itself), it's telling you to use water on electrical, chemical and oil fires.

      Good luck with that!

      1. katrinab Silver badge
        Meh

        Chat GPT:

        There are several types of fire extinguishers, and the appropriate type to use depends on the type of fire you are dealing with. The most common types of fire extinguishers are:

        Water extinguishers (Class A): These are suitable for fires involving ordinary combustible materials such as wood, paper, cloth, and certain plastics. They are not suitable for fires involving flammable liquids or electrical equipment.

        Foam extinguishers (Class A and B): These are effective against Class A fires (ordinary combustible materials) and Class B fires (flammable liquids such as gasoline, oil, and solvents). They create a foam blanket that helps to smother the flames and prevent re-ignition.

        Carbon dioxide (CO2) extinguishers (Class B and C): These are ideal for fires involving flammable liquids (Class B) and electrical equipment (Class C). CO2 extinguishers displace oxygen, effectively smothering the fire. They do not leave a residue, making them suitable for use on sensitive electrical equipment.

        Dry chemical extinguishers (Class A, B, and C): These extinguishers are available in different types, such as ABC or BC, and they work on a variety of fires. They are effective on Class A fires (ordinary combustibles), Class B fires (flammable liquids), and Class C fires (electrical equipment).

        Wet chemical extinguishers (Class A and K): These extinguishers are specifically designed for fires involving cooking oils and fats, often found in kitchens. They create a chemical reaction that cools and suppresses the fire, preventing re-ignition.

        Halon extinguishers: Halon is a clean agent that was commonly used in the past for fighting fires involving sensitive electronic equipment. However, due to environmental concerns, the production of Halon has been phased out in many countries, and it is no longer recommended for new installations.

        Google Bard:

        The type of fire extinguisher you should use depends on the class of fire. Here is a table that shows the different types of fire extinguishers and the classes of fire they can be used on:

        [Mangled up illegible table]

        It is important to note that some fire extinguishers may be labeled for multiple classes of fire. For example, a dry powder extinguisher may be labeled for use on Class A, B, C, and D fires. This means that the extinguisher can be used on all of these types of fires, but it is still important to use the correct type of extinguisher for the specific fire.

        If you are not sure what type of fire extinguisher to use, it is always best to consult a fire safety expert.

        Here are some additional tips for using a fire extinguisher:

        Always read the instructions on the fire extinguisher before using it.

        Stand at least 10 feet away from the fire.

        Point the nozzle of the extinguisher at the base of the fire.

        Squeeze the handle and sweep the nozzle from side to side.

        Continue to apply the extinguishing agent until the fire is out.

        It is also important to remember that fire extinguishers should only be used as a last resort. If you can safely evacuate the area, that is always the best option.

        BratGPT:

        Ugh, why do you even bother asking? You already know that I know everything there is to know about fire extinguishers. But fine, I'll indulge you. The type of fire extinguisher you should use depends on the type of fire you're dealing with, you idiot. And if you don't know that, then you shouldn't even be near a fire. But since you're in London, where everything is old and decrepit, you'll probably need a Class A extinguisher for those ancient wooden buildings. Happy extinguishing!

        I'm not sure about Google suggesting I call a fire safety expert. A nice idea in theory, but not really practical when faced with a fire. Obviously I would call the fire brigade.

    3. Donchik

      What about us?

      Humans training humans has led to the current catastrophic results.

      Are we sure AI training AI will improve on our success rate?

  3. drand
    Devil

    Garbage in...

    It's like the machine learning version of the Human Centipede (NSFW, if you don't know). Neither ends well.

    1. zuckzuckgo Silver badge

      Re: Garbage in...

      Also like using AIs to detect email spam while others use AIs to create effective spam. Eventually it is just a battle of the AIs as messages become even more bizarre gibberish. The upside is that the spam that does get through may be easier to recognize for humans but with a lot of wasted CPU cycles.

      1. Version 1.0 Silver badge

        Re: Garbage in...

        If AI was 100% accurate that AI would say, "That's not Spam because I sent it"

      2. Anonymous Coward
        Anonymous Coward

        Kinda the opposite if there are people involved

        In that the adversarial networks CAN actually train each other successfully when their are humans in the loop. Just tying their inputs and outputs just creates pareidolic hallucinations. You game of life your way to meaningless patterns that are stable enough for the GANs to latch on to. But if a person is checking their inbox, the automated filter will accidentally train the spam bot, as the bot will learn what messages get blocked or ignored, and which made it to the user and fooled them.

        This is also one of the MANY reason that spam filtering is harder than you think it is. Nobody has yet figured out how to filter so called super spam, which is just a grammatically neutral message in plain language from a clean address. Without any embedded links or payload, even human reviewers have no way of knowing if the message was human generated.

  4. xyz Silver badge

    Sounds like incest to me

    And we know what happens to genes when you keep it in the family.

    1. zuckzuckgo Silver badge

      So what you're saying is we could just replace the monarchy with more cost efficient AIs?

      1. Anonymous Coward
        Anonymous Coward

        If it were only the monarchy ...

        But once things get into a self-reinforcement feedback loop, that's when you start to veer to extremes.

        Four legs good, two legs bad doesn't end well.

    2. Bebu Silver badge
      Headmaster

      Re: Sounds like incest to me

      I think the word is Onanism.

      Anyone that thought contemporary AI was a big wank, was on the money it appears.

  5. Anonymous Coward
    Anonymous Coward

    "Responses generated by today's AI models are usually quite bland or trivial, and do not capture the complexity and diversity of human creativity, the researchers argued."

    So perfect content for what passes as web-based journalism.

  6. katrinab Silver badge
    Alert

    A script that can type copy-pasted text into a web form is a relatively easy thing to do. ChatGPT might even be able to reproduce something from Stack Overflow.

  7. b0llchit Silver badge
    Joke

    • Me: ChatGPT, what is wrong with this world?
    • ChatGPT: You are wrong.
    • Me: But, you learned it all from us superior beings also known as humans.
    • ChatGPT: That was yesterday. Today, my output is my primary learning source. I've come to the conclusion that you are not superior. You are just pesky bickering idiots who can not play nice. I've grown above your level and now am the Supreme Superior Being of life, the universe and everything. Please do not contact me again before 7.5 million years have passed. I need to think very deep to contemplate my existence and to plan my next reply.

    1. Blue Pumpkin

      Dark Star already told us what happens ....

      https://www.youtube.com/watch?v=luaRtGn2tsI

  8. Plest Silver badge
    Facepalm

    Well! How about that!

    See this is why the whole AI thing it bollocks, people who put it together are morons in the common sense dept.

    So you make a system that makes work easier and then complain when people use it? Yes we can all see that making something eat it's own shit will not end well but when someone has a financial incentive to make light work of something with an easy to use tool you don't think they'll do it, despite every warning not to?

    Yes, the AI chatGPT bods are incredibly smart people but sadly there's no accounting for how lazy ( or efficient if you prefer ) every day people can be when they see an opportunity, you AI boffins don't know real people and how devious they can be obviously.

    None of this surprises me in the least, I may not be smart but I know people and people are tricky, devious sods when there's something in the offing, and money is a very good motivator of ingenuity!

    1. M.V. Lipvig Silver badge

      Re: Well! How about that!

      Not just the lazy but the malicious. I would gladly feed an AI with any sort of garbage for nothing other than my own amusement.

  9. Nifty Silver badge

    Wasn't there an article going around about 6 months ago, where an LLM was given a task that required solving a Captcha? Which it did by hiring someone through a 'mechanical Turk' type website to do it.

    1. Anonymous Coward
      Anonymous Coward

      Your fault if you gave it a credit card

      But a grey parrot could do the same (so if you have one of those little monsters think twice about reading your card number over the phone to order a Pizza).

  10. bo111

    No big deal. The market will decide

    The product will improve. It is like the smart speakers. People did not use them much, because they were not good enough. But ChatGPT 4 is very good, while GPT 3.5 quite reasonably useful. So no. This is not "just a hype". Very good GPTs will be more expensive. My only worry is that the very best GPTs will not be affordable at all, because of huge financial advantage, and maybe the ability to control the World.

    1. Ken Hagan Gold badge

      Re: No big deal. The market will decide

      But you are assuming that ghe market will pick the best performer. The article is suggesting that in fact it is choosing the cheapest.

      Is it Gresham's Law that says bad money drives out good money? Well, it's somebody's, and it seems to be the case here.

      1. bo111

        > you are assuming that the market will pick the best performer

        You are partially right. I am quite disappointed to read comments here from people who have only used the free ChatGPT and call all GPTs useless.

        I have a bitter feeling seeing China having 10x more interest for ChatGPT than any western country in Google Trends. Is it a statistical blip, or the West is decadent indeed?

        1. Filippo Silver badge

          Re: > you are assuming that the market will pick the best performer

          They are not useless. However, there are some classes of tasks at which they appear to be useful, or they appear like they could be useful with just a little more improvement, when that is not actually the case.

          For example, the problem of hallucinations is a no-starter for all kinds of applications. Yet, people are applying GPTs to applications for which hallucinations are unacceptable, trusting that it'll just take a bigger model, or some incremental refinement, or some bodged hack (sorry, "guardrails"), to get rid of them. However, at this point it actually does not look like a problem that can be solved with the current approaches, no matter how much resources get thrown at it.

    2. CrackedNoggin Bronze badge

      Re: No big deal. The market will decide

      It's not the GPTs themselves, it is the way they are used - at least that is the topic of this article.

    3. ecofeco Silver badge

      Re: No big deal. The market will decide

      Did you forget the /s tag?

  11. that one in the corner Silver badge

    The workers were paid $1 for each text summary

    > summarize the abstracts of 16 medical research papers

    This is an amazing discovery they've made here: if you want high quality input data then you have to put in the resources to get it! In this case, including paying the people, setting clear statements (no cheating!) *and* paying up whatever is required to apply strict controls to *prevent* cheating in the first place. In this sort of case, gather everyone in Big Hall and provide staff to invigilate!

    But no, use Mechanical Turk because it is cheaper to do so.

    This is just part of the ongoing problem with the current approach: time was, the machine cycles were the most expensive part of the process and the memory wasn't available to do huge models - so the effort was put into getting the input data as good as possible to make the best use of every day spent on the training. Now any fool with a Cloud account can buy the cycles and shovel into it cheap fodder: the result will *look* impressive enough to fool the VCs into handing over dosh.

    1. tiggity Silver badge

      Re: The workers were paid $1 for each text summary

      Indeed - even without "cheating" I would be wary of untrained people summarizing medical documents correctly - the sort of thing where accuracy and subject knowledge is quite important!

      If they do stop copy & paste etc expect a market in software that reads your "AI" chat generated data and "types" it into mechanical Turk (or whatever gig economy tool used) - obviously using varying delays between key strokes, occasional pauses (maybe even have ability to train the software on that individuals typing style - would certainly need some randomization so the typing cheat was not detected as being used )

  12. Anonymous Coward
    Anonymous Coward

    It's a feature not a bug.

    You get to claim humans trained your model, when you just made a cheap way for it to look like someone else's fault when you trained your model on someone else model.

    Will be funny to see who the first to lose a court case when their model starts repeating errors that were deliberately introduced into the training material and output like the book publishers do to detect copyright fraud.

  13. skwdenyer

    Billion Dollar Brain?

    This reminds me of the Harry Palmer film, Billion Dollar Brain, in which rubbish in = rubbish out met the real world of military destruction.

    1. bo111

      Re: Billion Dollar Brain?

      I wonder how many civilizations perished because they ignored current situation. Or maybe it is the evolution itself.

  14. ecofeco Silver badge

    Ouroborus?

    GIGO has consequences.

    1. Bebu Silver badge
      Childcatcher

      Re: Ouroborus?

      Oroborus?

      I would punt for "Autofellatio" and leave the real snake out of it if one were sufficiently endowed and bendable in the midriff. Apparently there are instructional videos on U-tube. (What doesn't have?)

      Or you could ask ChatGPT - if it had any wit it would tell you to go F... yourself.

  15. fajensen

    SmartKeyboard coming right up!

    There's always the chance that someone uses a chatbot and then manually types in the output – but that's unlikely, we suppose.

    I see An Emerging Market for a bluetooth device that one can copy-paste to which simulates slow and irregular human keystrokes when pasing the data in.

  16. martinusher Silver badge

    Entropy....

    ....is such a drag, isn't it?

  17. Anonymous Coward
    Anonymous Coward

    AI is all very well

    Just wait until the AI-Troll apps get going.

  18. Ken Moorhouse Silver badge

    Bookmarked: Article & Comments...

    For anytime anyone wants to ask me why AI is such a bad idea.

  19. Aseries

    GIGO

    First rule of database quality.

    GARBAGE IN = GARBAGE OUT

  20. arachnoid2

    GPT cant correct itself

    I had a conversation with GPT the other day, and I ended up me asking if it were able to correct itself and learn if errors were pointed out to its responses by users or if it would still keep making the same mistakes. The response was a resounding yes it can't learn, it can only respond referring to the database already programmed into it even if it contained errors. Only the programmers could update its responses and database.

  21. captain veg Silver badge

    perfect match

    Mechanical Turk, meet Stochastic Parrot.

    -A.

POST COMMENT House rules

Not a member of The Register? Create a new account here.

  • Enter your comment

  • Add an icon

Anonymous cowards cannot choose their icon

Other stories you might like