Telecity London data centre outage borks VoIP, websites, AWS... • The Register Forums

Tuesday 17th November 2015 15:42 GMT Alister

Telecity refused to comment when The Register phoned them to ask what had happened.

Oh! You managed to get through then?

4 0 Reply

Tuesday 17th November 2015 15:45 GMT Your alien overlord - fear me

Don't mention you-know-who to Mr Osborne. Nah, it couldn' be them. Could it?

0 2 Reply

Tuesday 17th November 2015 16:01 GMT Tom Chiverton 1

Sssh ! We're trying to stop them torpedoing the UK tech industry by banning encryption, and getting in the way of security research

0 1 Reply
Tuesday 17th November 2015 16:18 GMT Daggerchild

Once again, by the time the authorities get there, Corbyn is nowhere to be seen!

5 0 Reply
Wednesday 18th November 2015 10:30 GMT Mpeler

Not you-know-who, but you-know-who

The Register understands both primary and backup power supplies went down, potentially affecting thousands of customers.

Sounds like BOFH and PFY were out and aboot, not being beer o'clock yet...

0 0 Reply

Tuesday 17th November 2015 16:18 GMT Anonymous Coward

Looks like Amazon AWS Direct Connect customers are also affected:

http://status.aws.amazon.com/

6:47 AM PST We are investigating packet loss between the Direct Connect location at TelecityGroup, London Docklands, and the EU-WEST-1 Region.

7:36 AM PST We can confirm intermittent packet loss between the Direct Connect location at TelecityGroup, London Docklands, and the EU-WEST-1 Region. An external facility providing Direct Connect connectivity to the EU-WEST-1 Region has experienced power loss. We are working with the service provider to mitigate impact and restore power.

2 0 Reply

Tuesday 17th November 2015 16:31 GMT ianx

Yep Direct Connect is certainly impacted!

Also looks like inbound internet access to the EU_WEST-1 region is being hit generally as we're seeing substantially elevated latencies on applications running there.

0 0 Reply

Tuesday 17th November 2015 17:09 GMT Linker3000

Payment processing?

Could this explain why Paypal is borked for me and the missus had issues with our Santander card in M&S?

/Yes, we checked our balance.

2 0 Reply

Tuesday 17th November 2015 19:42 GMT nsld

That's one way

To stop those pesky terrorists using the interwebs, stop putting 50p in the meter!

0 0 Reply

Tuesday 17th November 2015 21:36 GMT Valarian

Not Just Direct Connect

Links to Amazon VPC infrastructure in EU-WEST-1 were also hit - my phone got very warm between 2-4pm this afternoon with people wanting to know why various services were going dark for minutes at a time...

1 0 Reply

Tuesday 17th November 2015 22:41 GMT John Brown (no body)

Amazon?

Don't they have automatic failover in their elastic cloudy stuff? Or don't they eat their own dogfood?

2 1 Reply

Wednesday 18th November 2015 16:20 GMT John Brown (no body)

Re: Amazon?

Why the downvote? Am I wrong in thinking that various "cloud" providers, including Amazon, sell their services based on automatic fail over even if an entire data centre goes off-line?

1 0 Reply

Tuesday 17th November 2015 22:42 GMT DaleWV

Now I can understand an undersea cable being a single point failure with possibly very wide spread implications but if a power failure on a few floors of a single building can have such wide spread implications then don't we have a deeper problem?

Fortunately for us this only really impacted our test & dev environments and, ironically, our DR capability.

2 0 Reply

Wednesday 18th November 2015 09:01 GMT Anonymous Coward

Fortunately for us this only really impacted our test & dev environments and, ironically, our DR capability.

Don't forget the social impacts which were more serious: I noticed that grumble feeds were intermittent and slow last night, and I'd therefore like to ask Telecity to investigate their backup power provision to prevent this happening again.

0 0 Reply

Wednesday 18th November 2015 08:35 GMT Anonymous Coward

Terrorism and 'won't somebody think of the children'

This is what happens when you let the spooks splice into the networks.

Normally they blame it on submarines breaking under water cables. This time they fucked it right up.

0 2 Reply

Wednesday 18th November 2015 09:00 GMT Anonymous Coward

Aws direct connect affected

Yep all our production Amazon web services were down for 5 hours. Nobody could access their desktops or emails. Where is the resiliency?!? Another knock on the cloud hype train. Never would have happened had we left things how they were in the datacentre

0 1 Reply

Wednesday 18th November 2015 09:21 GMT smartypants

Magical non cloud it strikes again

Honestly I wish I had a pound for every mention of the mythical non-cloud IT. You know, the one that can never go wrong...

0 0 Reply
1. Wednesday 18th November 2015 12:08 GMT Anonymous Coward
  
  Re: Magical non cloud it strikes again
  
  It can go wrong, but 5 hours for a major [layer?
  
  0 0 Reply
Wednesday 18th November 2015 12:20 GMT Smoking Gun

Re: Aws direct connect affected

Just because you chose to put your production environment in a cloud doesn't mean you get resilience by default right? Do you not have to architect your cloud environment like you would your own physical environment by selecting resilience datacentres for your production workloads?

Like in Azure you might tick the geo redundant feature or ensure your backups, backup to something not in Azure for DR purposes. If a business jumps on the cloud train and works on the basis that it all "just works" when things go tits up, surely need to think again?

3 0 Reply

Wednesday 18th November 2015 12:13 GMT Anonymous Coward

Back up often

Both UPS channels went offline in a cascade failure due to loading. Then the transfer to mains was disruptive and the transfer back to UPS failed.

And the second attempt to switch back has also failed and that involved switching it off and on again so it was a proper IT fix, not some bodge.

Currently it's running on utility power. It's not the first time the UPS systems at Sovereign House have gone out like this either...

1 0 Reply

Wednesday 18th November 2015 20:52 GMT John Stoffel

Re: Back up often

This is one of those areas where you'd think they would be running tests on the system on a regular basis, with N+1 redundancy. If you don't test, you don't know.

But I've run into stuff like this before where we had a dodgy transfer switch that if you let it sit for a month or two, one of the phases would stick and not flip over the next time you had an outage. But once you tested it... it would be happy as a clam and would switch back and forth no problem. It took me doing monthly tests and metering the panel to finally find and prove the problem. Took over a year to find and solve this issue.

Ever since then... I test and check the voltages on the transfer switch.

Now in this case, if the loads are too high... then I suspect someone goofed and overloaded a singel phase or something so that too much power is being pulled from one leg at a time, which doesn't allow things to come up cleanly. Not a fun situation if you haven't planned for it and know how to shed load (i.e. turn off crap...) as you bring things up to let disks spin up one by one, instead of a huge thundering herd.

0 0 Reply

Topics

Special Features

Vendor Voice

Resources

COMMENTS

Not you-know-who, but you-know-who

Yep Direct Connect is certainly impacted!

Payment processing?

That's one way

Not Just Direct Connect

Amazon?

Re: Amazon?

Terrorism and 'won't somebody think of the children'

Aws direct connect affected

Magical non cloud it strikes again

Re: Magical non cloud it strikes again

Re: Aws direct connect affected

Back up often

Re: Back up often

POST COMMENT House rules

Enter your comment

Add an icon

Other stories you might like

911 goes MIA across multiple US states, cause unclear

Cyberattack hits Omni Hotels systems, taking out bookings, payments, door locks

Sacramento airport goes no-fly after AT&T internet cable snipped

US-EAST-1 region is not the cloudy crock it's made out to be, claims AWS EC2 boss

Datacenter outages are on the decline, but when they hit, they hit hard

Tech trade union confirms cyberattack behind IT, email outage

McDonald's ordering system suffers McFlurry of tech troubles

LinkedIn's turn to fall over: Outage hits thinkfluencer hub

World-plus-dog booted out of Facebook, Instagram, Threads

AT&T's apology for Thursday's outage should stretch to a cup of coffee

Americans wake to widespread AT&T cellular outages

X protests forced suspension of accounts on orders of India's government

About Us

Our Websites

Your Privacy