* Posts by Not now John, I’ve gotta get on with this

2 publicly visible posts • joined 7 Feb 2022

Resilience is overrated when it's not advertised

Not now John, I’ve gotta get on with this

I remember the early days of Sun failover systems, failover worked perfectly but the journal file system wasn't really ready for primetime and nobody used it, so the result was: failover takes 2-3 seconds, the resulting FSCK when the backup machine rips the disks away from the primary takes 4-6 hours! It was almost always quicker to fix the hardware issue in the primary server than to allow a failover to occur.

To err is human. To really tmux things up requires an engineer

Not now John, I’ve gotta get on with this

First day in my first job in a bank my new boss demonstrated how easy it was to remotely manage multiple systems from his desktop. He proceeded to shut down the main NIS server for the trade floor systems instead of the dev box he had intended, queue much screaming as hundreds of clients very slowly switched over to the secondary