Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!

 



Forgot your password?
typodupeerror
×
Google Businesses The Internet Math

The Math Behind PageRank 131

anaesthetica writes "The American Mathematical Society is featuring an article with an in-depth explanation of the type of mathematical operations that power PageRank. Because about 95% of the text on the 25 billion pages indexed by Google consist of the same 10,000 words, determining relevance requires an extremely sophisticated set of methods. And because the links constituting the web are constantly changing and updating, the relevance of pages needs to be recalculated on a continuous basis."
This discussion has been archived. No new comments can be posted.

The Math Behind PageRank

Comments Filter:
  • Pagerank is cool (Score:0, Interesting)

    by pap3rw8 ( 962737 ) on Wednesday December 06, 2006 @08:01PM (#17139252) Homepage
    Whatever google's doing with PageRank, it seems to be doing it right. At least from my experience.
  • Bad summary (Score:5, Interesting)

    by Knights who say 'INT ( 708612 ) on Wednesday December 06, 2006 @08:06PM (#17139318) Journal
    The article specifically says the PageRank eigenvector is only recalculated once a month, approximately. Even though Google uses some clever numerics to calculate the eigenvectors to a 25 billion by 25 billion matrix by iteration, it still takes several hours to finish.
  • by CrazyJim1 ( 809850 ) on Wednesday December 06, 2006 @08:10PM (#17139396) Journal
    I skimmed the article and didn't find what I wanted to find. If you make a webpage that you want ranked high, what do you do? Do you make 100 geocities accounts and provide links to your main website, or what? I'm just wondering this out of curiosity, not out of need.
  • Does PageRank count? (Score:2, Interesting)

    by matr0x_x ( 919985 ) on Wednesday December 06, 2006 @08:22PM (#17139570) Homepage
    As a self proclaimed SEO expert - I honestly don't believe PageRank counts nearly as much as it did a few years ago! You'll find lots of PR5 sites ahead in the SERPS of PR9 sites!
  • by Anonymous Coward on Wednesday December 06, 2006 @08:37PM (#17139744)
    It's not secret [google.com].
  • by TheLink ( 130905 ) on Wednesday December 06, 2006 @08:58PM (#17139938) Journal
    "if you break Google's rules about displaying the same content to bots as to humans"

    I notice many sites that do that and don't get slapped down - esp subscription sites. And seems Google doesn't cache those, so its probably collusion.

    You see the keywords and paragraphs in the search, but click on it you get a login page.

    They should have to pay a special rate be marked differently from the other search results. It's a waste of time otherwise.
  • Re:Pagerank is cool (Score:5, Interesting)

    by silentounce ( 1004459 ) on Wednesday December 06, 2006 @09:02PM (#17139982) Homepage
    Interestingly enough, google thinks so, too. [google.com]

    Of course, yahoo has its own opinion. [yahoo.com]
     
    Although, altavista seems to almost agree. [altavista.com] Check the second non-advertised result.
     
    I do find this [google.com] amusing though. Third place, how humble.
     
    I didn't expect such interesting results. The site with the search term in its url was tops for av and yahoo, but not google. Yahoo ranked the wiki entry above google, but av reversed that decision, google of course thought itself was more important than the wiki. Google's own reference site was number one in its own search and near the top in the other two, but pagerank.net wasn't even in the top 10 for google's search. I'm not sure what conclusions can be drawn from all that, but it is definitely food for thought.
  • by oni ( 41625 ) on Wednesday December 06, 2006 @09:39PM (#17140342) Homepage
    I notice many sites that do that and don't get slapped down - esp subscription sites.

    I wonder, if I changed my useragent to be whatever the googlebot reports itself to be - would I get by the registration screen on websites like the NYTimes??
  • by Anonymous Coward on Wednesday December 06, 2006 @10:24PM (#17140680)
    There's only two that really reflect the power of Pagerank: Click here. [google.com]
    About 1.2 billion pages, and surprise surprise, Acrobat Reader tops the list, followed by a who's who of internet applications and plugins. But around result #30 it gets a bit more interesting, and when you're a few dozen pages in, "new patterns begin to emerge."

    And to explain why not to use "click here", I found this [w3.org] buried on page 45. Thanks for the proof pudding guys, it's delicious.
  • by suggsjc ( 726146 ) on Thursday December 07, 2006 @10:52AM (#17145688) Homepage
    Here is an email with associated response I received from Google on roughly this topic.

    This is a very general question. I'm creating a website. It is going to be a blogging platform. Obviouslly, the content of the site(s) is the most important thing. I've already started making the content of my site dynamic in the sense that I tailor it to the requesting agent (via the user-agent header). My intention for doing this is to make sure that the content renders correctly for *any* browser that accesses the site. I've built the site modularly, so tailoring the content to the requesting agent isn't a big deal. However that leads me to my question(s) and the reason I am emailing you? FYI, I have no ulterior motives for being able to tailor my content, other than making sure that the user get the most usefull information.

    That said. When a "bot" (ie your crawler) accesses my site. I'm going to treat it like I would a mobile browser. I'm going to give the minimal markup and the css will be very simple. I'm going to make sure that my content comes before my navigation, advertisements, etc in the source.

    My real question is does the fact that I'm presenting you the content of my site differently from other browsers make a difference? If so, (then again my reasoning is to make sure that my users get the correct content) how do I prevent this from hurting me in your rankings? If not, then how do you protect yourself from other sites taking advantage of this "hole"? Meaning I could make my site appear legit when I knew your bots were crawling me, but give "alternate" content when real users were visiting.

    Last question. Do you have any idea when your ads will work correctly with xhtml?
    Hope you weren't expecting a straightforward answer (like I was), because here is what I got back

    Hello Jonathon,

    Thanks for your email about the website you're creating.

    First, since you asked when our program will support XHTML, I wanted to
    let you know that we're unable to say if we'll support XHTML pages in the
    future.

    While the AdSense team isn't able to answer your questions about your
    site's ranking in the Google search index, I'd recommend visiting
    http://www.google.com/support/webmasters . I also wanted to let you know
    that our advertising programs are independent of our search results.
    Participation in AdWords and AdSense doesn't affect inclusion or ranking
    in the Google search index.

    I've also included answers to some of the most common questions AdSense
    publishers have asked.

    How can I improve my site's ranking?
    Answer:
    http://www.google.com/support/w ebmasters/bin/answer.py?answer=34432&hl=en_US

    H ow do I add my site to Google's search results?
    Answer:
    http://www.google.com/support/w ebmasters/bin/answer.py?answer=34397&hl=en_US

    M y site is no longer included in the search results. What happened?
    Answer:
    http://www.google.com/support/ webmasters/bin/answer.py?answer=34443&hl=en_US

    Why doesn't my site show up for a specific keyword?
    Answer:
    http://www.google.com/support/w ebmasters/bin/answer.py?answer=34434&hl=en_US

    F or additional questions, I'd encourage you to visit the AdSense Help
    Center (http://www.google.com/adsense_help), our complete resource center
    for all AdSense topics. Alternatively, feel free to post your question on
    the forum just for AdSense publishers: the AdSense Help Group
    (http://groups.google.com/group/adsense-hel p).

    Sincerely,

    Jake
    The Google AdSense Team
  • by namco ( 685026 ) on Friday December 08, 2006 @11:24AM (#17162422)
    I've seen links on google searches that don't exist anymore but were ranked highly when they DID exist and still exist in the top 10 of the query. What happens to those? Do they stay at their ranking till they get overtaken by other more popular pages on the same search? Get their ranking slowly reduced because they don't exist?

This file will self-destruct in five minutes.

Working...