





How Google Routes Around Outages 105
1sockchuck writes "Making changes to Google's search infrastructure is akin to 'changing the tires on a car while you're going at 60 down the freeway,' according to Urs Holzle, who oversees the company's massive data center operations. In a Q-and-A with Data Center Knowledge, Holzle discusses Google's infrastructure, how it has engineered its system to route around hardware failures, and how it responds when something goes awry. These updates usually go unnoticed, but during system maintenance last month a software bug triggered an outage for Gmail."
Just me? (Score:5, Funny)
Was it just me or did anyone else spend a few minutes contemplating how you actually could make a car that did allow you to change a flat while moving?
Re:Just me? (Score:5, Funny)
Re:Just me? (Score:5, Funny)
Just you. I kept thinking about how I could use a car metaphor to describe how google...oh wait.
I kept thinking about derailing a car, before I realized I was on the wrong track.
Re:Just me? (Score:5, Funny)
I was going to reply about mixing metaphors, but then I lost my train of thought.
Re: (Score:1)
oh come on guys, i know posting here is like watching a train wreck, but.. let's stop hopping onto the bandwagon, shall we?
Re: (Score:2)
oh come on guys, i know posting here is like watching a train wreck, but.. let's stop hopping onto the bandwagon, shall we?
Better not keep going down this way. I think the light at the end of the tunnel is a train.
Re: (Score:2)
TSCHOOO TSCHOOOOOOOOOOOOO *SPLAT* OOOOOOOOOOOOO...........
___
filter error error fixer: Lorem ipsum dolor sit amet, consectetuer sadipscing elitr, sed diam nonumy eirmod tempor invidunt ut labore et dolore magna aliquyam erat, sed diam voluptua. At vero eos et accusam et justo duo dolores et ea rebum. Stet clita kasd gubergren, no sea takimata sanctus est Lorem ipsum dolor sit amet.
It's easy (with a little help from google images) (Score:1, Offtopic)
I kept thinking about derailing a car, before I realized I was on the wrong track.
It's easy. (With a little help from Google Images...)
Car [westminstercollege.edu]
Derailer [mcn.org]
Re: (Score:2)
All that aside, having tires is not absolutely critical to driving a car. One time I saw a Honda Civic going just fine down the street with two rims sans tires on the driver's side. Many people have no respect for their cars, but in spite of this being an older car, it was quite disturbing. The guy probably did something to shred his (bald?) tires, didn't have a spare or enough money/patience for a tow/new tires. As usual, no police were in sight.
I think the Internet is better maintained than poor people's
Re: (Score:1)
Re:Just me? (Score:5, Insightful)
I thought about it for approximately 30 seconds. Then I realized that it is a bad analogy. A Google car would have hundreds of redundant wheels, changing one is easy.
Re:Just me? (Score:5, Insightful)
Basically, all this means is Google designs like Mack while everyone else designs like Chrysler...
Re: (Score:2)
Unless it is one of the two front tires.
Actually, the load would still be supported if one or even both the front tires blew. It may not be under control, but that's another story....
Re: (Score:2)
That took you thirty seconds?
Re: (Score:1)
Perhaps you're thinking of a movie from the '70s? (Score:2)
Like maybe this one? [imdb.com]
Re:Just me? (Score:5, Interesting)
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
It's not how well the bear dances. It's the fact that it dances at all.
Re:Just me? (Score:4, Informative)
Re: (Score:2)
wow brilliant!
Why was it banned? Is the mousse carcinogenic or something??
Re: (Score:3, Interesting)
Re: (Score:1)
Re: (Score:2)
Was it just me or did anyone else spend a few minutes contemplating how you actually could make a car that did allow you to change a flat while moving?
Retractable axles, like they use on dump trucks. With the wheels off the pavement, changing it becomes trivial. You need at least two sets of drive wheels though for that to work. But that's a closed-system solution. An adaptive solution would be a semi-truck that would deploy a ramp out the back. Drive the car onto the ramp and then effect the fix on the semi-truck. Then return the car to the flow of traffic. It would be dangerous though if it was one of the front tires; At 60 MPH, a normal car tire that h
Re: (Score:2)
Yes, I can do it in just 4 steps:
1. Stop car
2. Jack car up
3. Remove flat
4. Install spare
Can I get a job at Google now?
Re: (Score:1)
Now figure out how to do that without step #1...
Re: (Score:3, Funny)
What are you talking about? I'm in America, and we need to find someone to blame for the flat first....
Then, maybe we can fix it..Got any nails and a hammer?
Re: (Score:2)
I remember something from a Popular Science|Mechnics magazine from the 1950's. This guy had heavily modified a car to keep going, it could be refueled, and change tires while moving. There were small wheels at each corner that could be jacked down and lift that wheel up, and an extended running board. Of course it didn't happen at highway speeds, just a few miles per hr. I think the goal was to cross the country without stopping for some record. Tires and other stuff were stored in a trailer towed behind t
Re: (Score:1)
Re: (Score:1)
You need to contemplate no more, since the answer is in easily digested film format: http://en.wikipedia.org/wiki/The_Big_Bus [wikipedia.org]
I did too... (Score:1)
While technically possible to make such a car, I don't see any practical use for such a system when it's just safer and more efficient to stop the car and replace the wheel.
At first I thought it might be a useful system for an armored car like the presidential limo, but the added weight of another motorized system would
Re: (Score:1)
Re: (Score:2)
Wow, you guys really haven't seen the Big Bus?? Does all that, power by nuclear reactor, and comes with a pool and piano bar.
Re: (Score:2)
Took me about 15 seconds to think about it. Knight-Rider style, load the car (into/onto/via grapple) a specialized truck/platform that exists to fix flats in transit or simply en-route. The car analogy is only bad because it doesn't really define the constraints based on the nature of the data, imo.
Re: (Score:1)
Re: (Score:1)
I know! I know! Pick Me!!! (Score:5, Funny)
Re: (Score:1, Funny)
It treats the routes as damage and censors them.
Re:I know! I know! Pick Me!!! (Score:5, Funny)
It treats the censors as routes and damages them?
Re: (Score:1)
Re: (Score:2)
It goes around the damages and routes the censors.
Re: (Score:1)
It goes around the damages and routes the censors.
It sensors the damage, and rounds the route?
...
Hey, we've almost returned full circle to TFS...
Re: (Score:1)
Google File System Paper (Score:5, Informative)
To those looking for a more in-depth description, check out the technical paper on the google file system:
http://labs.google.com/papers/gfs.html
Had to read it for a search engines course in college, it's pretty darn spiffy.
Re:Google File System Paper (Score:5, Funny)
Re: (Score:1)
If you want to show off
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
any large organization (Score:2)
Re: (Score:2)
People with exotic problems and lots of resources make interesting fantasy material(hence the interest in reading about experimental fusion widgets and submarines and satellites and stuff).
People with common problems and few resources are mostly human interest/commiseration fodder("and that is how Mr. Bitter Q. Sysadmin keeps the department running for under $500/quarter, using nothing more than terrible coff
Re: (Score:1)
"a pile of paperclips running Gentoo Linux"
Can I get one of those? Does it move around when you are compiling something? At least my wife would think that would be worth staring at instead of a bunch lines of text when doing my daily 'emerge --sync && emerge -uDNav world'.
Re: (Score:2)
Re: (Score:3, Funny)
Easy (Score:2, Flamebait)
Google treats outages like damage and routs around it.
Video of the car analogy (Score:5, Funny)
Excellent use of the car analogy, especially since it is possible to change a tire while driving a car. Youtube video [youtube.com] at 1:48.
Slightly..ahem... OT so posting anon.
Re: (Score:2)
Excellent use of the car analogy, especially since it is possible to change a tire while driving a car. Youtube video [youtube.com] at 1:48.
Slightly..ahem... OT so posting anon.
lol, all I can think of those drivers saying after they read that quote is... "ahahhaha... too easy! too easy!"
Article doesn't really say anything. (Score:5, Interesting)
You know, the article read like a press release. Hasn't slashdot whored itself out enough lately on these kinds of things? Google is so ultra-reliable, blah blah, 24x7, blah blah, commitment, blah blah, premier service partner, blah blah... I get that kind of talk enough in staff meetings. Where's the meat already!?
Why not write an article with some nice graphics saying what happens to my request from the time I hit "Search" to the time I click a result. List off all the servers it goes through, their roles, how they're monitored, etc. Give examples of failure and show the mode decisions the software makes (and where this software is running) -- show the latencies and other performance impacts as my request bounces over failure after failure. That's what I expect when I pull up an article entitled "How Google Routes Around Outages". Something useful, professionally enriching, intellectually stimulating, etc. In short, tell me why I (should) never see a "500 Internal Server Error" from Google, but I do from just about every other major website I've used.
Re: (Score:2, Insightful)
Re: (Score:3, Interesting)
YMBNH.
This has been happening since as long as I've been lurking slashdot (2000?), and didn't go away once I set up an account (2002? maybe 2003). And from the YMBNH posts I saw when I began lurking, this has apparently been an issue since the beginning (or shortly thereafter).
At any rate, complaining about it won't do much good. There's a saying maybe it might help you to repeat:
Re: (Score:2)
Is that an Ignatian prayer?
Re: (Score:2)
I have to admit I was disappointed too. We recently had our colocation facility fall down on the job (turns out they have no alternate way to contact everyone should their internet fail) and I was hoping to get some insight into setting up hot sites, how a site should determine whether it can see the internet or not (clearly I can't just ping google anymore), and other things that would be useful from a technical perspective.
Instead I get "white box" and "black box" monitoring, and I have yet to figure out
Re: (Score:2)
Re: (Score:1)
Simple, really... (Score:5, Informative)
The key point:
When they get an outage, they check how it was caught and if it wasn't caught automatically, they figure out how to next time. Simple rule: They learn from their mistakes and don't put all their eggs in one basket.
Re:Simple, really... (Score:5, Funny)
Re: (Score:1, Flamebait)
Only because your mistake was NOT drugging the bitch.
Re: (Score:2)
I agree, if you keep making the same mistake over and over again, you learn to deal with it better, it is less of a surprise, and you are more prepared for it. You can become a real expert at that mistake, which keeps on happening. If you stop making this mistake, you might start making new mistakes....well, who knows how to fix those?
Re: (Score:2)
But what we have here is, many baskets but with only one egg, and the problem is to make sure the egg going into the right basket "AND" getting back out from that basket while number of baskets grows!
What it sounds to me is that Google intelligence is not redundancy but rather granular task assignment with each task in mind that level of fault tolerance in result should be greater than risk of failure in result as it grows/scales.
For example (simplistic), raid 10 vs. raid 6 comes to mind. Both tolerant to
Re: (Score:1)
Simple rule: They learn from their mistakes and don't put all their eggs in one basket.
Yes they do. Its called the Earth.
Changing tires (Score:3, Interesting)
akin to 'changing the tires on a car while you're going at 60 down the freeway,'
This is not so hard. Just design the car with 4 axles instead of 2 and lift one off the road at a time. Helps if it can swivel for easy access to the lugnuts.
Re: (Score:1)
akin to 'changing the tires on a car while you're going at 60 down the freeway,'
This is not so hard. Just design the car with 4 axles instead of 2 and lift one off the road at a time. Helps if it can swivel for easy access to the lugnuts.
I tried that once, but before I had even produced isometric drawings my tire had shredded, and my rim was trashed...
Inspiration taken from the same thing it runs on! (Score:5, Insightful)
Replacing a wheel on a car going 60mph (Score:2, Informative)
Watch from 1:55 to 2:35:
Youtube video of guys replacing a wheel on a car while it is moving.. [youtube.com]
Re: (Score:1)
That only works on the non-driving wheels of two-wheel drive vehicles though. To offer a quick analogy, if Google were a car, then Google would only be able to replace those servers that are not responsible for helping drive the search engine forward on the Internet superhighway.
Recursive googling (Score:1)
I'm sure they just do exactly what I do when I'm at work and have a problem: they google for an answer to the problem at hand.
Oh, wait.
The above sort of leads into explaining my fear of asking google "is google alive" and the ensuing apocalypse.
Urs Holzle??? (Score:2)
Re: (Score:2)