VMware Causes Second Outage While Recovering From First

VMware Causes Second Outage While Recovering From First 215

Posted by Soulskill on Monday May 02, 2011 @07:55PM from the third-time's-a-charm dept.

jbrodkin writes "VMware's new Cloud Foundry service was online for just two weeks when it suffered its first outage, caused by a power failure. Things got really interesting the next day, when a VMware employee accidentally caused a second, more serious outage while a VMware team was writing up a plan of action to recover from future power loss incidents. An inadvertent press of a key on a keyboard led to 'a full outage of the network infrastructure [that] took out all load balancers, routers, and firewalls... and resulted in a complete external loss of connectivity to Cloud Foundry.' Clearly, human error is still a major factor in cloud networks."

VMware Causes Second Outage While Recovering From First

This discussion has been archived. No new comments can be posted.

Search 215 Comments Log In/Create an Account

Comments Filter:

Re:This is very bad design (Score:4, Informative)

by X0563511 ( 793323 ) writes: on Monday May 02, 2011 @09:19PM (#36006576) Homepage Journal

... which is why you should always use the shift key to wake a display, and never enter. Unless it's a serial link, in which case you have to hit enter and pray the guy before you isn't a sadist.

Re:VMware shows its PR colors. (Score:4, Informative)

by drooling-dog ( 189103 ) writes: on Monday May 02, 2011 @10:41PM (#36007026)

To me it sounds like someone (non-technical) high up in the chain wanted to focus blame on an inadverant act by one of the engineers. Inadvertant, of course, so no one needs to get fired and file a lawsuit, and an engineer so that no one in upper management appears culpable. The downside is that they dramatically underscore the fragility of their cloud, thereby undermining its acceptance in the market. Not a good tradeoff, if that's the case.

Re:UR DOING IT WRONG! (Score:3, Informative)

by larry bagina ( 561269 ) writes: on Monday May 02, 2011 @10:43PM (#36007034) Journal

Remember how your uncle used to touch you in your naughty place? It was like that.

Re:VMware shows its PR colors. (Score:5, Informative)

by rsborg ( 111459 ) writes: on Monday May 02, 2011 @11:49PM (#36007230) Homepage

VMware's explanation of events is troubling to me. The company as a whole is responsible for any of its failures. Internally the company could blame an individual but to shareholders and other vested entities an individual employee's failure is not something they care about. A better PR response would be to say that "we" made an unscheduled change or simply an unscheduled change was made to our infrastructure that caused X.
"Transparency is bad" +4 Insightful
What the... ?
You know, I'd prefer my vendor/partner (ie, VMWare) doesn't throw their employees under the bus when bad stuff happens. If this happened at Apple or Google the group (leadership taking responsibility) would announce they messed up... not "one of the peons pushed a magic button".
Transparency is only useful as a way to diagnose and improve. This "explanation" from VMWare hides all explaination (...touched the keyboard. This resulted in a full outage of the network infrastructure...) while torching a single employee.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

VMware Causes Second Outage While Recovering From First 215

VMware Causes Second Outage While Recovering From First More Login

VMware Causes Second Outage While Recovering From First

Re:This is very bad design (Score:4, Informative)

Re:VMware shows its PR colors. (Score:4, Informative)

Re:UR DOING IT WRONG! (Score:3, Informative)

Re:VMware shows its PR colors. (Score:5, Informative)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Slashdot