Slashdot Log In
Google Opens Up (Some) Search Algorithms
Posted by
CmdrTaco
on Saturday May 24, @08:51AM
from the nobody-else-has-a-million-servers-anyway dept.
from the nobody-else-has-a-million-servers-anyway dept.
overmars writes "After years of closely guarding the formula for its search algorithms, Google is opening up a little.
The search engine company has kept its search formula a closely guarded secret for two reasons: competition and to prevent abuse, said Udi Manber, Google's vice president of engineering, search quality, in a post on the corporate blog. Manber said the blog post is the first part of a renewed effort at the company 'to open up a bit more than we have in the past.'
Manber said the most famous part of Google's ranking algorithm is PageRank, an algorithm developed by Google cofounders Larry Page and Sergey Brin. While PageRank is still in use, it is a 'part of a much larger system,' he said.
'Other parts include language models (the ability to handle phrases, synonyms, diacritics, spelling mistakes, and so on), query models (it's not just the language, it's how people use it today), time models (some queries are best answered with a 30-minutes old page, and some are better answered with a page that stood the test of time), and personalized models (not all people want the same thing),' he said."
Related Stories
Firehose:Google opens up on its search algorithms by Anonymous Coward
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
Full
Abbreviated
Hidden
Loading... please wait.

Dont do it Google! (Score:1, Interesting)
Re: (Score:3, Insightful)
Re: (Score:2, Interesting)
and
The search engine company has kept its search formula a closely guarded secret for two reasons: competition and to prevent abuse
Secur
Re:Dont do it Google! (Score:5, Insightful)
I took one course in Information Retrieval, and I could come up with most of these things with an evening or two of brainstorming, at least on a general level like this. Ideas like PageRank gave Google the edge in the early days, but now, their advantage lies in other areas. The have a stunning amount of capital tied up in hardware, giving them amazing speed, and amazing amounts of data. They have code optimized to handle those amounts of data in reasonable time. They have the experience to take simple probability models like the ones described in the article, and make them work with those amounts of data.
This is why it's impossible to beat Google at search and other data-based markets. It's not one simple patented idea anymore. If it was just that, Google would've disappeared years ago. The only way to beat the points described above, is to have the capital to buy the hardware, and knowledge to match Google. Microsoft can do that, but Google has one other thing that Microsoft doesn't. They understand their developers. They understand that if you give these kinds of scientist/developers an interesting problem, a fantastic dataset and the freedom to attack it in their own way, you barely even have to pay them anymore. The interest will take over and completely fuel the project. They will work overtime, and come in on the weekends, without being asked.
That will bring energy to a project and a company, that you can never get through any tactic that Microsoft is likely to employ. I admit I don't precisely know what Microsoft is like on the inside, but I simply cannot conceive of them as a company that understands the joy of programming, or the joy of science (which is a huge big part of information retrieval). In any case, one blog post with some sketchy details isn't going to tell Microsoft anything they don't know already.
Reply to This
Parent
Re: (Score:2, Interesting)
Re:Dont do it Google! (Score:4, Funny)
I took one course in Information Retrieval, and I could come up with most of these things with an evening or two of brainstorming, at least on a general level like this.
Reply to This
Parent
Re: (Score:2)
I'll be joining Microsoft full-time this summer; if that says anything.
What exactly is open? (Score:5, Insightful)
What, exactly, has Google opened up? As far as I can see fron TFA all that is explained is on a very general level, with no detail what so ever. I can't see Google's competion gaining any significant benefit from this.
Reply to This
Re: (Score:2, Interesting)
While Rob Enderle puts the matter trollishly, I agree with th
To be fair, he's a VP (Score:2)
Re: (Score:3, Informative)
No, they _used to be_ engineers (Score:2)
Re: (Score:2)
From TFA...
Obviously, this usage of the word "open" is not related to open source software. It's more like he is willing to talk about it at all.
Deus Ex, anyone? (Score:1)
User: What else do you know about me?
Pagerank: Everything that can be known.
User: How about a report on yourself?
Pagerank:I was a prototype for Echelon IV. My instructions are to amuse visitors with
inform
License? (Score:2, Interesting)
At l
Re: (Score:2, Informative)
Re: (Score:3, Informative)
The secret ingredient... (Score:5, Funny)
// Google search algorithm
for (int i=0; i <= numResults; i++)
{
if (results[i].good)
{
show(results[i]);
}
}
//
Reply to This
Re: (Score:2, Funny)
Re: (Score:2)
consider the Pagerank important (Score:2, Interesting)
Mystified by 'the google" (Score:3, Interesting)
Reply to This
Much can be determined by using google (Score:2)
Re: (Score:2)
Re: (Score:2)