I remember the early days of Sun failover systems, failover worked perfectly but the journal file system wasn't really ready for primetime and nobody used it, so the result was: failover takes 2-3 seconds, the resulting FSCK when the backup machine rips the disks away from the primary takes 4-6 hours! It was almost always quicker to fix the hardware issue in the primary server than to allow a failover to occur.
Posts by Not now John, Iâve gotta get on with this
2 publicly visible posts • joined 7 Feb 2022
Resilience is overrated when it's not advertised
To err is human. To really tmux things up requires an engineer
Monday 7th February 2022 09:05 GMT
First day in my first job in a bank my new boss demonstrated how easy it was to remotely manage multiple systems from his desktop. He proceeded to shut down the main NIS server for the trade floor systems instead of the dev box he had intended, queue much screaming as hundreds of clients very slowly switched over to the secondary