Google Outlines the Role of Its Human Evaluators 62
An anonymous reader writes "For many years, Google, on its Explanation of Our Search Results page, claimed that 'a site's ranking in Google's search results is automatically determined by computer algorithms using thousands of factors to calculate a page's relevance to a given query.' Then in May of 2007, that statement changed: 'A site's ranking in Google's search results relies heavily on computer algorithms using thousands of factors to calculate a page's relevance to a given query.' What happened? Google's core search team explain."
Google is PEOPLE (Score:5, Insightful)
Re:Google is PEOPLE (Score:4, Informative)
In reality this is why search engines like Wolfram Alpha without the broad research and knowledge of Google in the industry don't stand much of a chance unless Google drops the ball.
Yeah - but before Google was people, Yahoo was people. Google gets an advantage based on what they're doing. But it doesn't make them invulnerable. Look at the tech industry for the past several decades to see this theme played out again and again.
Re: (Score:2, Funny)
I thought Soylent Green is people.
Re: (Score:3, Interesting)
Re: (Score:2)
Soylent Green is people
Google is people
Therefore Google is Soylent Green
Re: (Score:1)
It's not clear what point, if any, you are trying to make here.
TFA is about Google using humans to improve its results, in a few ways.
Wolfram Alpha derives all of its results from a database that is curated by humans.
There are major differences in their approaches (as indeed there are major differences in what they are trying to accomplish), but the general notion of involving human beings to improve your results is the same.
Re:Google is PEOPLE (Score:4, Insightful)
Re:Google is PEOPLE (Score:4, Insightful)
Wolfram Alpha isn't a search engine.
Saying Wolfram Alpha isn't a search engine is like saying that Linux should be called GNU/Linux. It might be more technically correct (emphasis on might), but it won't change the public's perception of it.
Re: (Score:3, Informative)
Of course they're both search engines, but most people (in my experience and I work in a library) know the difference and don't have any trouble differentiating the two.
Google = search for websites. Wolfram = search for data.
Re: (Score:2)
k' tx
Re: (Score:1)
Yes, I can imagine how that would be true if Wolfram Alpha were COMPETING WITH GOOGLE. Except it's not. Not as a search engine at least. Jesus. Why don't you try a few typical searches on WA before you say stupid crap that ill-informed modmins will mod up?
Try "cheap plane tickets", or "what are pennies made of" in WA. Look at the results. There are none. That's because WA does not do what Google Search does. It wasn't meant to. You know what else it doesn't do? What ebay does. That's right, you can't buy an
Summary, missing from TFS (Score:5, Informative)
Because the summary wasn't kind enough to give you the answer to the question, here it is.
Human evaluators (mostly college students) are trained in the art of validating a search engine result. They examine the results of their searches, and determine which ones are the most highly relevant. For example, searching for the Olympics should yield information about the 2008 Olympics (or any current one) instead of the 1996 Olympics. The reviewers frequently work on the same query results, that way they can see how consistently the reviewers are rating websites.
The vast upshot of this, is that it helps weed out those websites that are cheating the system, and trying to get their website as the #1 google hit, so they can show you ads. So the large part of what they are doing is tracking spam websites, not real ones.
Re:Summary, missing from TFS (Score:5, Insightful)
Re:Summary, missing from TFS (Score:5, Funny)
It's a lot like Slashdot's mod system.
Oh, God, I hope not! Searching for jokes, you'll wind up with completely humorless results. Completely irrelevant pages will end up getting to the top of search results because they were "insightful" or "informative", despite being completely offtopic. All anti-Microsoft, anti-Apple and anti-Linux pages will be completely buried in the search results. And searching for "Apple" or "Linux" will result and you'll wind up with in a bunch of fanboy pages....wait a minute (searches Google)....ARRRGGHHHH! The Slashdot Mods have TAKEN OVER!!!! Ummm....ahem....
I, for one, welcome our new humorless Apple, Linux and Microsoft fanboy overlords!
Re: (Score:1, Funny)
You are most gracious. Now if you'll excuse me, I have Google searches to fix.
Re: (Score:2)
Re: (Score:2)
Re: (Score:2, Insightful)
No matter how limited human review is, I am sure that the notion that real people evaluating a page for relevance is a good thing for all concerned.
In reality I think that most of this review activity will be directed at the 'to good
Re: (Score:2, Informative)
The vast upshot of this, is that it helps weed out those websites that are cheating the system, and trying to get their website as the #1 google hit, so they can show you ads. So the large part of what they are doing is tracking spam websites, not real ones.
Actually, it calls for further explanation, because manual tweaking of results produces bias and legal concerns. As guy from Google said,
We don't use any of the data we gather in that way. I mean, it is conceivable you could. But the evaluation site ratings that we gather never directly affect the search results that we return. We never go back and say, 'Oh, we learned from a rater that this result isnâ(TM)t as good as that one, so letâ(TM)s put them in a different order.' Doing something like that would skew the whole evaluation by-and-large. So we never touch it.
Mankind's knowledge stands on the shoulders of Google, so they can't just hire, say, a thousand students and use this evaluation as an significant weighting factor. It's rather a evaluation of algorithms for the sake of further improvement done fully by algorithms.
College students? Hmmmm... (Score:5, Funny)
Google, for example, employs a vast team of human search "Quality Raters" ... Spread out around the world, these evaluators, mostly college students, review search returns ....
Well, THAT explains a lot of what happens when you set 'safe search' to 'off'...
Pigeon Rank (Score:2, Funny)
How do the human evaluators compare to the pigeons they used before?
Re: (Score:2)
But for how long? (Score:5, Interesting)
Re: (Score:1)
Well, I think the law of diminishing marginal utility applies here: assuming the 'quality raters' are improving the search, assuming the algorithms are being improved with each change, it would probably become economical to drop the quality raters because the search is good enough.
That is, unless they choose to make drastic changes. Personally, I think a more "raw" search like google claims to have is better than a "directed" one, like bing claims to have. Of course, I use the word "claim" in each case...
Re: (Score:1)
There is another possibility: automatic search (attempting to find relevant pages for everyone) has reached a plateau in terms of performance and if you want to do better, you will need to employ raters. Clearly Google would like to find "the next best thing" in Search, but that sounds quite uncertain. Employing lots of people is a much surer way to improve results.
Re: (Score:2)
Re: (Score:1)
+1 insightful
Re: (Score:1)
I imagine that personalized search results were a direct result of realizing that a certain number of good or bad ratings were a strong indicator; I would be surprised if they were not aggregating the clicks from personalized search and feeding it into the generalized results.
Re: (Score:2, Insightful)
Nope, sorry. This might be true if google operated in a static environment, but they are competing. Both against direct competitors, and against people trying to game the system. If they ever came up with a perfect "algorithm" and let it rest, then the SEOs would reverse engineer it, make their useless pages beat every useful page, and then the perfect algorithm would be shit.
Re: (Score:2)
Re: (Score:2)
Maybe the answer is to have several algorithms and change them at random intervals? Kind of like HIV?
Re: (Score:2)
Humans don't scale, so you need automation to make anything like a general search engine. However, you have to verify the algorithms are doing the right thing, even though external factors are evolving. Human evaluators allow more stable experiments to be run that tell you how you are doing, and in turn help you improve your algorithms.
This really isn't that new, other than realizing clever ways to apply traditional manufacturing quality control to natural language / information retrieval algorithm develo
human brain pwns computational intelligence (Score:1)
Don't read this if you're not in a bad mood (Score:1, Offtopic)
Article is somewhat inventive (Score:5, Insightful)
JP: So are these raters college students or random folks responding to a job post? What are the requirements?
SH: It's a pretty wide range of folks. The job requirements are not super-specific. Essentially, we require a basic level of education, mainly because we need them to be able to communicate back and forth with us, give us comments and things like that in writing.
Funny how the introduction restates the interviewer's preconception even though the actual interview implies otherwise.
some personal observations on the program (Score:3, Informative)
-as anything modern in IT, people sign Non-Disclosure Agreements (NDAs) so not a lot can be said from within the circle without breaking its terms. Having read the interview, I see the chief has also pretty much kept it this way, let alone only for the terms that are already publicly disclosed -google operates through 3rd party outsourcers and pretty much all non-essential communication is through them and not google directly, that's why the guy can't tell ya exact number about his posse. the big numbers are probably very correct, but I'm not sure about now. there seemed to be a very big wave of cut-offs and discontinued access for raters about a year ago, a lot of people got the boot and I'm not sure why - my bet is just a sweep of the axe. some were gone for a good reason, others very randomly. -the raters have a few spaces and forums to discuss their work, open to public and with minimal chance for an NDA break. -the raters have mods, too, but I haven't seen activity on that from for a while. -the specifics of the most cases have drawn me to a conclusion that for each surveyed example, there are at least 6 or 7 people working and giving opinions about, before a final decision is drawn, so there is your internal balance and weeding out bad judgement. lemme say it again you cannot single-handedly change Google's opinion about a particular site and particular search term. -about natural language processing - this is the scary part. you cannot imagine how good are these guys, especially their algorithms. from time to time they let us sneak peek at it and let me say we had a look at some betas (or alfa-s) of correct grammar processing and translation MONTHS ahead of their official announcement to the world. you could tell it was machine-made translation, but it was good, scary good. And I'm NOT talking English only, no,no. -the pay -it gets delayed about 6 weeks after month's end but is regular and usually not enough for a living, mainly due to the lack of work. first year it was good, very good, but in 2008 it started getting less and less, which is a shame, since it is a nice way to browse the net and get really paid to do it !
Re: (Score:2, Funny)
Imagine... (Score:2)
Just imagine ... you change a sentence on your companies website and get interviewed why you did it over and over again and people write pages about it.
Search results that don't include search terms (Score:2, Informative)
Re: (Score:2)
What has started pissing me off is the text at the bottom that basically says "you recently searched for 'foo' and are now searching for 'bar', so we combined the two, click here to get your real results for only 'bar'"
Perhaps the reason for my second search was that I wasn't satisfied with my original search terms, so fuck you for adding them on covertly. This was done while not signed into any Google sites...
Fuzzy logic is killing Google (Score:5, Interesting)
Slightly off-topic: Am I the only one who finds Google web search less and less useful? There's no way to really force literal search anymore. Everything I enter gets auto-"corrected". Plus signs, quotation marks or that misleading field "this exact wording or phrase" in Advanced Search used to help, but that stopped working a while ago. Everything is fuzzified now. Is there an alternative or some trick I haven't heard of?
Re: (Score:3, Interesting)
Everything is fuzzified now
First off, I don't think fuzzy logic has anything to do with Google's innards. What you're referring to is more generally called word sense disambiguation, and as far as I know, you can force a literal search by putting it in quotes. You'll also get "fuzzy" results, but if you don't see your literal at the top of the results, it probably doesn't exist and I'm guessing the Goog is trying to be helpful by breaking up the terms. Of course, you could Bing it to verify... :)
Re:Fuzzy logic is killing Google (Score:4, Informative)
Plus signs should still be treated as true literals. Quotation marks don't indicate literality -- they indicate that you really, really care about things like word order and so on within the quotes. It used to be true that quotation marks implied a plus on everything inside them, but that wasn't an intentional feature. The advanced search check box was, AFAIK, just equivalent to sticking everything in quotes.
If you're still seeing fuzzification with a plus sign, something may be a bit screwy, and you should file a bug with a specific broken query. (Of course, if you run the query +wombats and see the word "wombat" highlighted in the snippet, that isn't the same thing -- +wombats was treated literally, so this document really truly matched the word "wombats," it might just also have matched the word "wombat" and the snippet highlighter decided that it made sense, for this particular query, to highlight the term. A bug would be if you found a truly irrelevant document coming up.)
Re: (Score:3, Insightful)
Am I the only one who finds Google web search less and less useful? There's no way to really force literal search anymore.
So true and frustrating! I can't tell you how many times recently I've tried searching for something "SPECIFIC" and not been able to at all. :-(
I would love to know of a useful alternative that searches for what *I* want, rather than what some non-intelligence presumes I might want (and just wastes my time and their resources).
Re: (Score:2)
Am I the only one who finds Google web search less and less useful? There's no way to really force literal search anymore.
So true and frustrating! I can't tell you how many times recently I've tried searching for something "SPECIFIC" and not been able to at all. :-(
I would love to know of a useful alternative that searches for what *I* want, rather than what some non-intelligence presumes I might want (and just wastes my time and their resources).
Yahoo almost always does a better job there, but they don't play very nice with literal searching either. I wish that the search technology commonly used in the mid 90s on, for instance, Lexis Nexis would finally percolate up to mainstream web search engines and replace the primitive grunting, pointing, and shrugging engines used now.
Re: (Score:2)
Re: (Score:2)
Slightly off-topic: Am I the only one who finds Google web search less and less useful? There's no way to really force literal search anymore. Everything I enter gets auto-"corrected". Plus signs, quotation marks or that misleading field "this exact wording or phrase" in Advanced Search used to help, but that stopped working a while ago. Everything is fuzzified now. Is there an alternative or some trick I haven't heard of?
Try bing. Seriously, it seems like it's going to give Google a run for its money.
I, on the other hand, am completely happy with Google.
Oh my god! (Score:3, Funny)
GoogleNet (Score:1)