Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Slashdot Log In

Log In

Create Account  |  Retrieve Password

Google to use TrustRank for News, Possibly More

Posted by Zonk on Fri Apr 29, 2005 01:56 PM
from the no-more-trusting-the-weekly-world-news dept.
mike slaven writes "In a follow-up to Tuesday's post about Google registering a trademark on the term TrustRank, an article on NewScientist explains how Google plans to track the credibility of news sources. The article also mentions that the patent on TrustRank is not limited to ranking just news stories: 'The patent also reveals that the same system could be roped in to rank other search results, not simply news. So sales and services could in the future be listed on the basis of price and the reputation of the company involved.'"
+ -
story
This discussion has been archived. No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More
Loading... please wait.
  • From what I have read about trust rank, the basic premise is that they pick 200 or so "trusted sites". The trust rank for any page is then basically the number of link hops to the page from a trusted site.

    The problem I have with this is that there are many problems with identifying trusted sites and maintaining the trustworthiness of such sites after they have been chosen.

    From Google's point of view, a trusted site would have to have strict editorial standards and link to a lot of sites. I can think of a lot of sites with strict editorial content, but they generally do not link to a lot of sites. The open directory projects seems to me to be a candidate for a trusted site [dmoz.org]. It has editorial controls and links to a heck of a lot of sites.

    The first question to ask is: "After the trusted sites is chosen, how much would it cost to buy one?". I suppose dmoz itself would be hard to buy outright, but how much would it take to buy one of the editors, or to buy an editorial position? Probably not much. Dmoz alread has a lot of editorial fraud and it would make the problem worse. I'm not sure that its fair to expect trusted sites not to degrade to some extent.

    The second question to ask is: "How hard is it to buy links from trusted sites?". The answer has to be that it is pretty easy. Forget about corrupting the people as I discussed in the last point. Any trusted site that links to lots of pages is going to have a huge link management problem. Every day hundreds of domains that it links to may expire. You can snap those up and buy trust.

    All this doesn't even include folks who make sites look trustworthy with the sole intention of turning them to the dark side later. All of this happens currently with pagerank, but it will be much worse once the trust power is put into the hands of a few.

    --
    Exchange Rate Currency Calculator [ostermiller.org]

    • Interesting points, but I find it hard to believe that Google, (or any other competent company implementing something like this,) hasn't had the forethought to avoid these pitfalls. We still don't know entirely how trust rank works, mind you.
      • I find it hard to believe that Google, (or any other competent company implementing something like this,) hasn't had the forethought to avoid these pitfalls.

        I don't know, it seems like making an assumption like that is not really a good idea. I think a company like Google certainly has shown competent decision making in the past, but we should never accept something at face value just because of an apparent pattern of competence. Question everything!
    • The open directory projects seems to me to be a candidate for a trusted site.

      You know, that's exactly what I thought, too. In fact I can't think of a better site to use. Being an open directory editor myself, I'm pretty familiar with the rules and guidelines for editors, and, let me tell you, the open directory people take what they do very seriously. Of course, you'd really want to have multiple directories like this to really utilize Trust Rank.
    • Simple solution, don't publish who you trust, just publish the results.
    • by twbecker (315312) on Friday April 29 2005, @02:21PM (#12386716)
      From what I have read about trust rank, the basic premise is that they pick 200 or so "trusted sites". The trust rank for any page is then basically the number of link hops to the page from a trusted site.

      I was gonna mod you down, but I'll post instead. If you'd have RTFA, you'd have seen that the above statement is totally false. There are lots of criteria they are using to generate the number, of which internet traffic is only 1. To quote TFA:
      he database will be built by continually monitoring the number of stories from all news sources, along with average story length, number with bylines, and number of the bureaux cited, along with how long they have been in business. Google's database will also keep track of the number of staff a news source employs, the volume of internet traffic to its website and the number of countries accessing the site.

      Google will take all these parameters, weight them according to formulae it is constructing, and distil them down to create a single value. This number will then be used to rank the results of any news search.
      • Which is even worse that just picking 200 or so 'trusted sites'.

        Time and again, important news stories put out through blogs are ignored by the big companies with lots of 'reporters' and lots of stories regurgitated from the AP newswire.

        This is nothing more than an idiotic push for more 'corporate' news in an attempt to drown out the smaller people.
  • Trustrank? (Score:5, Funny)

    by awhelan (781773) on Friday April 29 2005, @01:57PM (#12386443) Homepage
    Can we know this for sure? Has google confirmed it? What is the trustrank of newscientist.com ?
  • by Anonymous Coward on Friday April 29 2005, @01:58PM (#12386454)
    1. Google
    2. Apple
    3. Firefox
    4. Your Rights are being stolen!
    5. Microsoft sucks.

    Rinse. Repeat.

  • by lildogie (54998) on Friday April 29 2005, @01:59PM (#12386461)
    I'll be fascinated to see how Slashdot articles get ranked.
  • Good. (Score:5, Interesting)

    by Telastyn (206146) on Friday April 29 2005, @01:59PM (#12386465)
    Now for the gamespy implimentation to see what servers are filled with lamers... Now that'd be useful!
  • Some points (Score:5, Informative)

    by Sv-Manowar (772313) on Friday April 29 2005, @02:00PM (#12386480) Homepage Journal
    The basic ideas and principles of TrustRank:
    http://www.seobook.com/archives/000661.shtml [seobook.com]

    I can see dangers in this becoming an elitist thing though, i.e Trusted Sites only linking to other trusted sites, or creating perceptions in surfer's minds. It will be interested to see how Google develop this one, that's for sure.
    • If the sites selected as parents are hand monitored well enough then google should be able to pick up on them, for example, selling off links for profit. Seeing as there shouldn't be too many, this could be entirely possible if spread across the workforce.
  • Basically what they patented (from what I gather) is the idea of taking many factoids about a news company and putting it into a balanced formula to create a "trust" number.

    However, they are not patenting the formula itself, but just the idea of using such a formula, it seems...

    This seems like another case of taking an obvious idea and trying to block the competition by patenting it.
  • If you just get one link from a "trusted" site to some really un-trusted site, then POW! Down goes Frasier! I guess we'll know when that happens, because the top Google news stories will be teenage sluts, casinos, and cialis.
  • Astroturf (Score:5, Insightful)

    by Doc Ruby (173196) on Friday April 29 2005, @02:07PM (#12386540) Homepage Journal
    "Trust, but verify." - Ronnie "Rayguns" Reagan
  • The idea isn't exactly new...refereed and peer-reviewed professional journals have been doing something along these lines for decades. Google, as I read it, is attempting to apply some basic scholarly principles to their listings.

    Now the bad news...I don't think it will work as described. Counting "number of bureaux cited" or hops from originator isn't exactly going to keep faked articles off their listings; as someone else already pointed out, it may be a matter of shucking out the cash for a domain that suddenly gets listed as "high trust level." Then that domain can be used to post pretty much whatever they want to make people think is accurate journalism.

    Sorry...I like the concept of automating the tedious task of fact-checking, but until Google can get their clusters' AI well beyond the Turing-test phase, it's not going to happen. Humans will still need to examine articles and references and make decisions as to whether a particular submission is, in fact, valid and accurate.

    Just my two cents' worth...save up the change for a root beer or something...

  • by Anonymous Coward on Friday April 29 2005, @02:08PM (#12386565)
    foxnews.com just disappeared entirely from the Internet.
  • Do you really need Google for this? Or is Google validation going to substitute for your own common sense?
    • by lucabrasi999 (585141) on Friday April 29 2005, @02:18PM (#12386686) Journal
      your own common sense?

      Too much work. It's just easier to do what Google tells me to do.

    • by 2short (466733) on Friday April 29 2005, @03:59PM (#12387741)

      I don't intend to substitute Googles validation for my common sense. But when I go to Google for news, and click on one of the top few stories on a particular topic, I want to have some expectation that I'm probably getting a reasonably trustworthy news source.

      Currently I do this by scanning for one I've heard of. Washington Post, New York Times, Wall Street Journal, BBC News, hell, even Al Jazeera and CNN: All of these may have biases, but I have some idea what they are, and they can all be expected to try for accuracy about the basic facts.

      For more obscure stories, this doesn't necessarily work. So I'm perfectly glad to hear Google will be making some attempt to weed out those "news" sources which, to put it kindly, do not try for accuracy about the basic facts.
  • ... didn't Al Franken write that book already?
  • by lbmouse (473316) on Friday April 29 2005, @02:17PM (#12386678) Homepage
    Did they copyright "TrustRank" or "TrustRank Beta"?
  • Well, this is (half) a shameless plug but my newsbot [memigo.com] has been ranking news sources and referring meta-news sources according to "trust" for over 3 yrs now. Findory [findory.com] (no affiliation) does something similar by ranking each individual news story.
  • How will this affect all those litigious bastards or miserable failures out there?
  • Well, the New York Times is one of the most trusted publications in the country. Oh, wait it isn't any more.
  • by kin_korn_karn (466864) on Friday April 29 2005, @02:25PM (#12386764) Homepage
    So Google wants to be the web's authoritative source on what sites can be trusted? I don't think I like the sound of that. No one entity should have that job.

    Ideally everyone would use common sense, but so few people have that anymore that it's almost quaint to suggest it.
  • by G4from128k (686170) on Friday April 29 2005, @02:26PM (#12386773)
    Given that ALL news sources or sites have some bias, isn't trust in the eye of the beholder in many cases? Although the objective facts of some situation may be undisputed (usually these are disputed, too), the interpretation of those facts is subjective and highly dependent on the viewpoint, world model, etc.

    Perhaps Google will need to introduce right-wing and left-wing versions of TrustRank. If it does not, then it will be an example of tyranny of the majority when Google asserts than the majority's bias is trustworthy.
  • Trusted by whom (Score:4, Insightful)

    by hey (83763) on Friday April 29 2005, @02:27PM (#12386776) Journal
    Reblicans trust The Free Republic [freerepublic.com] but distrust The Nation [thenation.com].
    Democrates are exactly the opposite. What should TrustRank do about that? I don't see any way to reconcile stuff like that.
  • Algorithm for Trust (Score:4, Interesting)

    by wronski (821189) on Friday April 29 2005, @02:31PM (#12386822)
    There are two main ways of ranking news (as well as websites and whatnot): Popularity or Quality. Popularity can be easily quantified, but with news you will probably end up with big and well known media source dominating the ranking. Regardless of what one thinks about CNN or BBC, they are not what I look for in googlenews (or I would just type www.cnn.com). I want news sources that I was anware of, to give me a fresh perspective.

    Quality on the other hand is very hard to measure, and any definition will surely be controversial. The metrics they are proposing will also benefit large well knwown news sources.

    What I would really like though is a rank that gave the widest possible perspective. Some algorithm that would take a news event and define some broad categories of news sources (say, on news on Iraq, conservative american media, liberal american, iraqi, arab, etc.). Then one or two representatives of each category would be displayed in the ranks, choosen among the cotegory by the metrics described in the article. The trick of course is to define the categories, but I think one could do that by looking at how different sources 'cluster' together. Sources in one country link overwhelmingly among themselves rather than abroad. Conservative news sources tend to cite and link to other conservatives, ditto for liberals or any other category. This is even more true for blogs, which wouldn't be much cited in the rank itself, but are a sort of glue that binds ideological and national categories together, and thus provide useful information to help classify the news source.

    I hope I'm making sense here. Just my 2cents...
  • by GPLDAN (732269) on Friday April 29 2005, @02:41PM (#12386918)
    IF author = 'Enderle' THEN TRUST=ZILCH AND BONEHEAD = 1.

    IF author = 'DiDio' THEN TRUST=NADA AND MICROSOFT_SHILL = 1

    IF topic = 'SCO' THEN RELEVENCE TO ANYONE = RELEVANCE TO ANYONE - 1000

    IF quoted_source = 'Marc Andressen' THEN WHO_FUCKING_CARES = 1

    IF news_organization = 'FOX' then BULLSHIT_IS_AFOOT = 1.
  • by John Hasler (414242) on Friday April 29 2005, @02:46PM (#12386983)
    From the article I conclude that the old Soviet Union's news services would have ranked very high.

    Oh, well. Google News was nice while it lasted.
  • by Qwavel (733416) on Friday April 29 2005, @02:51PM (#12387045)
    The internet provides us with such unlimited info that the problem is no longer finding information, it is deciding who to trust.

    Everyone probably has some internal list of sources they trust, but maintaining such a list is very hard.

    If TrustRank could be done and done right (???) then it would be a wonderful service.

    But, can any corporation be trusted to do this? Surely they would eventually start to 'sell' higher trust ratings? I 'trust' google, but it is still a corporation and it's job is to make money.
  • by Heraklit (29346) on Friday April 29 2005, @03:08PM (#12387225) Homepage Journal
    There will be political implications of this new ranking system. I bet nobody has really thought this through...

    What "trust rank" would you give Al Jazeera, for example?

    It is a state-owned, but journalistically largely independent big satellite network with a lot of staff and a huge audience. Just from an area of the world that might have different views.
    • MOD PARENT UP. (Score:4, Insightful)

      by HishamMuhammad (553916) on Friday April 29 2005, @04:08PM (#12387838) Homepage Journal
      This will be a very interesting thing to watch.

      I always paid attention to which news sites does the Google algorithm chooses to use as a primary source to a news item, and they are, more often than not, American sites. Anyway, news.google.com has been an useful source, since you can click on "all 1,777 related" and browse through the headlines and see the varying oppinions around the world. Example: this one was picked as a headline:

      Bush Promotes New Plan for Social Security as Benefit to Young

      In the other headlines, you see other views:
      • Bush Speech Fuels Capitol Debate Over Social Security's Future
      • Bush on offensive as ratings hit floor
      • Bush Plan Would Cut Benefits
      • Bush pitches plan to fix Social Security finances
      • For first time, Bush backs benefit cuts to help bolster Social ...
      • Bush vs. The Press
      And from here you can read articles from the various dissenting views. To me, that's the main utility of news.google.com. I wonder if TrustRank is going to start favoring sites such as CNN... after all, they're "America's most trusted news source" (according to themselves, at least).

      In the end, nothing beats reading the news from a number of different news sources. Unfortunately, most people don't do that.
  • Sites that I trust (Score:5, Interesting)

    by sapped (208174) <mstore1.yahoo@com> on Friday April 29 2005, @03:27PM (#12387403)
    How about over time Google learns which sites I trust and modifies the ranking according to that. We could give each site a thumbs up or down rating like you do on a TiVO.
    • Re:Patent? (Score:2, Funny)

      by Anonymous Coward
      algorithm (v, tr) - a sequence of operations to perform a task (modern, from "Al Gore, Rhythm", Albert Gore Jr (1946-?) being a popular politician and American Vice President (1992-2000) reknowned for his command of logic, co-joined with "Rhythm" to infer repetition.)
    • Google is important because millions of people use it multiple times per day. Many people don't even type URLs into their browsers directly anymore... they just enter website names into the google toolbar or homepage.

      So in short, knowing about what makes a #1 result a #1 result is critically important. We are moving from a phase where relevance & the work of hucksters looking for higher rankings drove search ranks to a new era, where "credibility" and political considerations will drive the results.