> The difference between SD and the more commercial offerings is still large.
Absolutely. SD is infinitely configurable, doesn't charge me for using it, it doesn't rely on the goodwill of some corporation to use, people can and do freely adapt it to an enormeous number of use cases, there are hundreds of models, hypernetworks, textual inversions and LoRAs to chose from, and there is absolutely zero chance for some dictatorship to limit use cases based on insecurities of their beloved leaders.
Whereas commercial offerings ... well ... ummm ... yeah. Okay, granted: They don't require me to have the hardware. Okay, I'll give em that.
Oh, what's that? I have that hardware already, because I play videogames, and it sit's idle most of the time anyway? Well, guess I'm not gonna be a customer then :D
> They just work
So does stable diffusion if the hardware supports it. InvokeAI has an installer that's not more complicated than installing any old desktop PC software. And yes, one needs beefy enough hardware, and some things are not supported, and on it works only with a good amount of technical expertise ... same as with videogames.
> the results are a bit disappointing
Sorry, what exactly is the definition of "disappointing" we're talking about here exactly?
> and it's slow enough that you'll have got bored by the time your first image is delivered.
That depends entirely on the hardware. My current setup delivers 1024x1024 images in under 12 seconds, that's with float32 full-precision. Granted, thatrequires a somewhat expensive setup, but that too is the same as with videogames...I cannot expect VR games to run smoothly either if the hardware cannot back it up.