The Math Behind PageRank 131
anaesthetica writes "The American Mathematical Society is featuring an article with an in-depth explanation of the type of mathematical operations that power PageRank. Because about 95% of the text on the 25 billion pages indexed by Google consist of the same 10,000 words, determining relevance requires an extremely sophisticated set of methods. And because the links constituting the web are constantly changing and updating, the relevance of pages needs to be recalculated on a continuous basis."
Pagerank is cool (Score:0, Interesting)
Bad summary (Score:5, Interesting)
I joke a lot on Slashdot, but serious question (Score:3, Interesting)
Does PageRank count? (Score:2, Interesting)
Re:PageRank doesn't seem to be based on keywords (Score:3, Interesting)
Re:I joke a lot on Slashdot, but serious question (Score:3, Interesting)
I notice many sites that do that and don't get slapped down - esp subscription sites. And seems Google doesn't cache those, so its probably collusion.
You see the keywords and paragraphs in the search, but click on it you get a login page.
They should have to pay a special rate be marked differently from the other search results. It's a waste of time otherwise.
Re:Pagerank is cool (Score:5, Interesting)
Of course, yahoo has its own opinion. [yahoo.com]
Although, altavista seems to almost agree. [altavista.com] Check the second non-advertised result.
I do find this [google.com] amusing though. Third place, how humble.
I didn't expect such interesting results. The site with the search term in its url was tops for av and yahoo, but not google. Yahoo ranked the wiki entry above google, but av reversed that decision, google of course thought itself was more important than the wiki. Google's own reference site was number one in its own search and near the top in the other two, but pagerank.net wasn't even in the top 10 for google's search. I'm not sure what conclusions can be drawn from all that, but it is definitely food for thought.
Re:I joke a lot on Slashdot, but serious question (Score:5, Interesting)
I wonder, if I changed my useragent to be whatever the googlebot reports itself to be - would I get by the registration screen on websites like the NYTimes??
Re:The two that matter (Score:1, Interesting)
About 1.2 billion pages, and surprise surprise, Acrobat Reader tops the list, followed by a who's who of internet applications and plugins. But around result #30 it gets a bit more interesting, and when you're a few dozen pages in, "new patterns begin to emerge."
And to explain why not to use "click here", I found this [w3.org] buried on page 45. Thanks for the proof pudding guys, it's delicious.
Re:I joke a lot on Slashdot, but serious question (Score:3, Interesting)
Pages that don't exist anymore (Score:2, Interesting)