Re: What were they thinking?
On failures:
Person A: I manage one rack and the rack has never failed
Person B: I manage one rack and it failed catastrophically
Person C: 1 manage 20 racks and 1 of them failed once
Google: We manage 30,000 racks and get materials defects that make it into production in 1 in every 1000 cases.
On availability:
Person A: we never have maintenance or hardware failures so we never have downtime
Person B: we patch asap and occassionally have hardware failures
Person C: we have some resiliance and redundancy but a power issue took everything down last Monday
Google: we see frequent hardware failures, power issues and network outages but the entire system continues to self-heal and provide high levels of global availability.
There are many ways of looking at a problem grasshopper....
Which answer do you take?