back to article The AI everything show continues at AWS: Generate SQL from text, vector search, and more

Another day at AWS re:Invent and yet more talk of artifical intelligence dominated, with a senior executive taking to the stage to wax lyrical about the impact of vector databases on the tech and more. Dr Swami Sivasubramanian, AWS VP of Data and AI, gave the official AI keynote at re:Invent in Las Vegas, a day after AWS CEO …

  1. Doctor Syntax Silver badge

    "the ability to generate SQL from text input"

    HR departments will need to avoid any candidates called Robert Tables.

    1. Anonymous Coward
      Anonymous Coward

      That's the least of their worries. We've tried something similar in the recent past, and the model had such a high error rate (generating either syntactically or semantically inaccurate queries) that it never really got anywhere. All language model outputs currently require human validation. Any service that tries to do otherwise is... premature at best.

  2. Herring`

    A little knowledge

    Back in the day, a few management types figured out that if they asked me nicely, I could answer questions like "How many $things of $type happened in $timewindow in $areacode". We only had the prod DB - no reporting replica.

    Once or twice a month, I would see one of the DBAs stand up and walk over to my desk to ask what the fuck I was running cos it was maxing out the CPU. Happy days.

    1. Doctor Syntax Silver badge

      Re: A little knowledge

      You should have told them to add some useful indexes. Either that or run UPDATE STATISTICS.

  3. John H Woods Silver badge

    ChatGPT 3.5 reveals training data...

    ... Seriously. Spitting out PII such as email addresses and telephone numbers, and whole expanses of undigested verbatim training material.

    Guardrails might be much more important... they may be required to stop your LLM regurgitating material it shouldn't.

    Even if you haven't got time for anything else, scroll down to the first embedded video for a laugh out loud moment at the attack used.

    https://not-just-memorization.github.io/extracting-training-data-from-chatgpt.html

    1. Doctor Syntax Silver badge

      Re: ChatGPT 3.5 reveals training data...

      They say training data is only 1% of the outputs they've got. However it raises the question of whether other prompts would extract more and more of the training data. It seems to knock on the head any copyright defence that they're not really storing the training data.

POST COMMENT House rules

Not a member of The Register? Create a new account here.

  • Enter your comment

  • Add an icon

Anonymous cowards cannot choose their icon

Other stories you might like