Re: I give up re:speed on hype cycles
11 months is ages in AI years.
Massive changes can happen in a week.
I decided to go down the rabbit hole and take a look at what is out there, deploy some technologies on my own kit and several things jumped out at me.
1) It is incredibly cheap to roll out.
2) With the cost in mind, the results you can get are staggering.
3) There are new models being released every day (yes, quite a few of them are extensions of base models, but for every crap model, there are a couple of interesting ones).
4) The technologies around AI to enhance AI are moving incredibly quickly. Hypernetworks, Lora, Lycoris, tons of diffusers, networks, GANs etc etc...
5) It's moving so quickly, that by the time you see something hit the media, it's already old news.
The key thing appears to thinking of the likes of ChatGPT, LLAMA etc as sort of "OEM" AI technologies. They are general purpose, tech demos essentially.
Is AI a buzzword? Maybe...is it in the same camp as other buzzwords...e.g. WEB3, DevOps, Cloud etc? Definitely not.
With regards to this particular article..."AI trained on curated and domain-specific data"...there are two obvious ways you can look at this. Firstly, from the perspective of someone that is only seeing the media hype-cycle...it seems like an experimental and weird decision to make with potentially unproven results.
From the perspective of someone that has been actively using, experimenting with and deploying AI...beyond the "lol I made Bing angry" bubble...this doesn't seem like a dumb move...setting up a local model, based on a very good base model, that has been enhanced with some additional training to make it better in specific areas, then giving it access to "domain specific" data seems like a reasonably and straight forward move...especially considering that all you need to get started is a quad core CPU (I'd recommend anything 6th gen and up), 8GB of RAM and a 256GB SSD...you fire up the client you want to use (determine if you want a REST API, a full blown chat client, Web UI etc), pick a model that meets or comes close to your requirements, present the client with a path to your "local domain" information...essentially a folder full of documents that you have picked for the AI to learn form...and away you go. You start prompting it, seeing the results, and fine tuning the data you're giving it to round out the results you're looking for.
You can do that yourself, right now, using GPT4ALL in a few clicks...see the results for yourself and form an opinion based on a real world experiment that you performed, rather than just being sick of the "hype".
You can have this kind of solution up and running in under an hour and I think that is the attraction right now...you can bring an old box out of retirement and turn it into your own local AI. The investment can literally be zero.
It's dirt cheap, pretty easy to setup and you can start seeing results very quickly...and as the models improve in very short periods of time, you can see continual improvement in very short spaces of time.
If you could add a technology to your business that has more benefits than drawbacks for virtually nothing, to see if you can derive some sort of competitive edge from it, why wouldn't you want it?
AI may be a buzzword, but it's a buzzword that doesn't require heavy investment, massive changes to your business and enormous amounts of technical skill to setup...it's only going to get easier and as the models keep improving, it's only going to get better...whether we like it or not, it's going to be incorporated in a typical tech stack, in some shape or form, whether we like it or not.
What we need to do at this stage, is figure out how minimise the reliance on cloud based AI to prevent our idiot clients signing up for the latest shiny and uploading terabytes of company data. That is where I see massive problems in the future...some fuckwit CEO decided on his own that linking the company SharePoint, OneDrive and Outlook on Office 365 to WhamBamThankYouMamAI and now that "fly by night" platform has access to anything and everything and becomes a target for hackers...or worse, proprietary information has now been incorporated in a model that could at some point leak to the internet and be used as a "base" model or is manipulated into spewing loads of private / proprietary information.
There is a lot of dumb shit on the horizon that we're going to have to deal with, unless we stay ahead of the curve and nip in the bud before it happens.
In terms of the quality of AI...well, that all depends on who is training it and with which data...ultimately, it will be people like us training AI models for businesses...unless you're a naysayer, in which case the CEO will hire a third party company that demands full admin rights over *your* network. Shit always follows the path of least resistance...and in this case, it's firing the guy getting in the way. So you can either roll with it, understand it and be in a position to ensure that this "hype" tech gets deployed properly, or you can resist it and head to the job centre.
Finally, not all AI is ChatGPT...most of my clients have been using AI in some shape or form, for quite a while now...for some it might be as simple as an AI based upscaler, for others it might be an AI copilot for programming, some people might be using AI scheduling assistants...because I know a lot of the products, I don't see AI as this looming thing in the distance, it's very much already here...thing is, the genie is already out of the bottle...pretty soon we're going to have an AI infused Office 365...at that point, AI becomes basically unavoidable and will be everywhere, it stops being niche and becomes something that everyone has...and it'll hit you like a truck because suddenly you're going to be balls deep in clients asking you questions about AI, how to use it, how to get the best out of it, what they should and shouldn't use AI for, what should they share with it, can you just hook up the company shared drive real quick? etc etc...at that point you're either going to have sensible, rational reasons to gently let them down that you can bring to their attention or you're going to be the crazy "it's just a buzzword, we don't need that shit" guy...being that guy, never goes down well in the long run.