The Register Home Page

* Posts by Doctor Syntax

42030 publicly visible posts • joined 16 Jun 2014

Page:

Life, interrupted: How CrowdStrike's patch failure is messing up the world

Doctor Syntax Silver badge

Re: Microsoft to blame, surely?

"If you cannot code defensively to ensure third party services don't take down your product when they fail, you have no place as a software engineer or a software company."

Likewise if you can't code defensively to ensure your product isn't taken down by your own badly formatted data file.

Doctor Syntax Silver badge

Re: WTF?

If it's embedded then embed it very thoroughly. Nobody gets near it to install viruses so no AV, no AV updates.

Doctor Syntax Silver badge

Re: WTF?

What part of "safe" did you overlook?

Doctor Syntax Silver badge

Re: WTF?

"hiding in the basement."

The one with the sewer release valve in it or the one with the killer robot?

Doctor Syntax Silver badge

Re: WTF?

I’d expect hope an elevator to would go to the next floor and open its doors, whatever happens on the outside.

FTFY

CrowdStrike file update bricks Windows machines around the world

Doctor Syntax Silver badge

Re: Related?

The issue here is worse than that. It wasn't the kernel module itself that was replaced, it was a data file which triggered a bug that had been there all along. There was no good version to roll back to. It was entirely the responsibility of the kernel module or whatever it was to handle the bad data file.

Doctor Syntax Silver badge

As I keep saying, there's never the budget to do things right but there's always the budget to fix it when it goes wrong.

Doctor Syntax Silver badge

Re: Related?

"leaves you system running but with all the security turned off"

It's not a straightforward question. If the system is in some way compromised but apparently undamaged this is not good. However just crashing the system and making it unbootable deprives you of your best tool for diagnosis and repair - the system itself. The best course of action might be to boot into some sort of safe mode. The advised solution is indeed to do this although it doesn't seem ti be automatic. From what I've read here the limitation of this is that if Bootlocker is installed it would normally obtain the keys from the server but this isn't an option when booting to safe mode, they need to be entered manually which depends on there being a more accessible copy. It would obviously have helped if systems were able to obtain the server's copy of the keys when booting to safe mode. Of course if the server is also down because of this...

The issue, as always, is to try to anticipate problems at design time and set a goal of having as few problems as possible that can prevent the system getting into a state where it can at least act as a tool to help recover to the intended production state.

Doctor Syntax Silver badge

Re: Related?

"they always are going to have the ability to crash it if they screw things up."

... and hence an obligation to ensure it fails gracefully if it encounters a dodgy data file.

Doctor Syntax Silver badge

Re: Related?

Would you care to give us a more extensive benefit of your insight?

Doctor Syntax Silver badge

Re: Related?

This comes into the category of "now you have two problems".

Doctor Syntax Silver badge

Re: What I actually like ..

Running a mixed environment would be safer. Still fraught with cost, much less so risk.

Doctor Syntax Silver badge

Re: What I actually like ..

BBC is now asking people to explain the term "bricking"

I hope the explanation doesn't involve camels.

Doctor Syntax Silver badge

Code review? That's a bit of an assumption. So, it would appear, is pre-release testing.

Doctor Syntax Silver badge

Re: S!y News and others borked

So the Fail hasn't!

Doctor Syntax Silver badge

Re: This will be fun for their marketing ..

Their marketing bods will do well out of this. They can point to how successful they were at getting their product into so many big corporations when they go looking for new jobs.

Doctor Syntax Silver badge

Re: Why oh why

"The media do a generally poor job explaining tech."

The customers will generally do a poot job of understanding it anyway.

Doctor Syntax Silver badge

Re: Lessons won't be learned

That remaining 1% can get you into a lot of trouble, however.

Doctor Syntax Silver badge

Not to mention "test".

Doctor Syntax Silver badge

And it's working out so well, isn't it.

Doctor Syntax Silver badge

"Weird that a company like Crowdstrike allows non-spokespersons to put out statements"

How do you know he's a non-spokesperson?

Doctor Syntax Silver badge

In this case the whatever OS was Windows.

Doctor Syntax Silver badge

Re: One assumes

If they're critical they should run something better. Something designed from ground up to be a server, not something adapted from a desktop design.

Doctor Syntax Silver badge

Re: Fun Times......

Nevertheless it's a trade-off of risks - or it should be if you recognise the risks. But if you do recognise the risks you are at least in a position to look at mitigations.

CrowdStrike shares sink as global IT outage savages systems worldwide

Doctor Syntax Silver badge

It's not just the updater/release process that needs to be improved. This was just a data file of some sort and the S/W that read it fell over taking the rest of the system and the reboot process with it. A program with that ability should reject a bad data file without falling over.

Doctor Syntax Silver badge

Re: The fault's with Microsoft

Or in this case DTaaS.

Doctor Syntax Silver badge

Re: There's something familiar about all of this...

There are these old-fashioned things called IT departments. As this seems to be a product aimed at big corporates there's a fair chance their customers still have them. Not guaranteed these days, but a fair chance. The IT department does the test and makes the decision on behalf of its users - and does the roll-out. I suppose they could still roll out something they know will bork all the workstations on the grounds that it will keep out ransomware but at least it becomes a deliberate choice.

Doctor Syntax Silver badge

Re: "CrowdStrike, not Microsoft" - debatable

There's an argument that a vendor program should fail gracefully if it finds a vendor-supplied data file which it can't handle, especially if hte file is so inessential that the workaround is to simply delete the file.

Doctor Syntax Silver badge

The first rule of trust is "trust nobody".

Doctor Syntax Silver badge

Re: The fault's with Microsoft

You're buried a long way down in the comments. What was the suggestion you're querying?

FWIW Linux kernel upgrades usually leave one or more old kernels in place. The user will get a few seconds grace to bypass the default boot into the most recent kernel.

Some distros will just leave the last one in place and delete the one older than that, some will leave all deletion to the user. But the presence of, at minimum, the kernel you were running immediately before the upgrade means that you can go back to what is expected to be a known good kernel.

Also, the manual boot options include booting any of the available kernels into what would be the equivalent of Windows safe mode in which the system is running single user without starting any more than an absolute minimum of services. It still wouldn't defend against a situation where a bad update affected something outside the kernel which was essential to booting single user because either old or new kernel would pick that up. There is also the possibility o manuallyf issuing parameters to the kernel at boot time. All in all, although no OS is fail-proof here is a great deal more defence in depth than Windows has."".

AIUI one of the issues with the present situation wasn't just that the update downloaded a corrupt data file but that CrowdStrike's SW did not simply reject it and carry on* but crashed and crashed in such a way that it then blocked the rest of boot. That's a double failure for which the corrupt file was only a trigger. This goes against everything we were taught years ago - that problems that can be caught and handled should be caught and handled.

And, of course, don't release an update on a Friday.

" it's evident from the recommended "just delete it" that it the file wasn't essential to normal operation

** I should add that my experience is based on SysV usage - systemd based systems may be less or more robust.

Doctor Syntax Silver badge

Re: The fault's with Microsoft

But ran like the clappers every morning?

Doctor Syntax Silver badge

"This is not a security incident or cyberattack."

Just the same bad consequences.

Doctor Syntax Silver badge

Re: I don't mind people blaming Windows..

"Can you name an operating system that is guaranteed not to fall over when someone with system level access changes stuff?"

No, but can you name one that's guaranteed not to fall over when it changes stuff itself without waiting for someone with system level access to do it?

Doctor Syntax Silver badge

Re: The fault's with Microsoft

"corporate IT still has a checkbox to tick that everyone has protection"

The box-ticking culture! How about, instead of ticking boxes, we start out recognising, evaluating and mitigating risks. Is there a possibility of a supply chain attack on the O/S or 3rd party S/W? How do we mitigate that? Could we test on a sacrificial machine? Should we use some sort of threat detection S/W? If so, is there a a threat of a s supply chain attack on it etc.?

Doctor Syntax Silver badge

Re: The fault's with Microsoft

Please explain the difference between an issue with "system reliability" and a "vulnerability".

One is a subset of the other. A vulnerability is a susceptibility to an external - usually malicious occurrence. Other causes of system unreliability could be all sorts of things from inadequate memory provision upwards.

Doctor Syntax Silver badge

"Isn't this always the case?"

It's the default assumption. What does that tell us?

Doctor Syntax Silver badge

Re: things that are running

"Until 2038… when it will be a surprise for everybody"

Not everybody - just you if don't realise that time_t has mostly - if not entirely - been upgraded to 64-bits already.

Doctor Syntax Silver badge

Re: There's something familiar about all of this...

It depends on whether or not you take precautions. One might be to test before deploying, another might be to wait a day to see if any adverse reports roll in. I guess any Cloudstrike customers who adopted either approach won't be rolling it out today.

Angry admins share the CrowdStrike outage experience

Doctor Syntax Silver badge

Re: Beyond me

"Wouldn't it be a really good idea if those critical keys were somewhere where you could access them"

Somewhere like a write-protected medium locked in the safe.

Doctor Syntax Silver badge

Is it too much to hope that when the dust settles legislators will start requiring that major infrastructure failures will, by statute, be followed up by an inquiry to determine what led up to the incident; decisions were made which impacted release of faulty S/W & so on. Bad decisions, especially those undocumented or made to cut costs would then lead to prosecution of those who made them.

Doctor Syntax Silver badge

In reality it'll be manglement demanding their PCs are fixed first. Those using PCs to actually earn revenue will have to wait.

Doctor Syntax Silver badge

Re: Beyond me

With a bit of luck InfoSec should new be looking at the decision and asking if it increases or decreases the risk.

Second NHS IT system confirmed to be affected by CrowdStrike issues

Doctor Syntax Silver badge

"they cause the top-level CS driver to crash as they're invalidly formatted,"

Fail-safe. Defensive programming. All long gone.

Azure VMs ruined by CrowdStrike patchpocalypse? Microsoft has recovery tips

Doctor Syntax Silver badge

Yes. If you have incremental backups you'll be fine. But my point, which I maybe didn't make plain, was that just restoring a pre-1900hrs backup will, on its own, take you back to the state of work then. It will take further action to recover subsequent transactions, whether by restoring incremental backups or by manual re-entry. If, however, data was entered, say from a website, and there was no separate backup of that then it will have been lost. Is there a chance of recovering information from email acknowledgements or did the restore overwrite outbound emails? The system may even start handing out order numbers duplicating those issued between backup and outage.

Restoring a backup image is only the start of getting back.

Doctor Syntax Silver badge

Always.

Doctor Syntax Silver badge

"The multiple and up to 15 times just makes it even crazier."

It makes the disclaimer easier - if it hasn't righted itself yet then you just haven't tried enough times.

Doctor Syntax Silver badge

"First, if you have a backup from before 1900 UTC yesterday, just restore that."

And accept that you lost any work, orders taken, despatches made, whatever, since then.

Dangerous sandwiches delayed hardware installation

Doctor Syntax Silver badge

"Anyone who was active during the 70s and 80s in the UK will remember innumerable bomb scares - the vast majority hoaxes"

That last part depends on which part of the UK you in.

Doctor Syntax Silver badge

Re: Probably in a tiny room somewhere

Back then it might have been CAT-3

Doctor Syntax Silver badge

Re: Try to keep it culturaly correct please

"In the UK a bathroom contains a bath"

And something that could only be used as a foot-bath doesn't count.

Page: