Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Slashdot Log In

Log In

Create Account  |  Retrieve Password

Google Reacts to Splogs

Posted by ScuttleMonkey on Sat Aug 20, 2005 05:26 PM
from the spam-is-everywhere dept.
labnol writes "Recently, Mark Cuban of Icerocket made the accusation that Blogger is by far the worst offender when it comes to Spam Blogs. Now Google Blogger is introducing Word Verification for user comments to prevent comment spam and another feature called Flag As Objectionable where users can report blogs with questionable content. Google appears to be listening."
+ -
story
This discussion has been archived. No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More
Loading... please wait.
  • Finally (Score:4, Interesting)

    by ImaLamer (260199) <john.lamar@gmail . c om> on Saturday August 20 2005, @05:29PM (#13363760) Homepage Journal
    I've been writing to Blogger/Google about a lot of fake blogs for a while and I'm glad to see Flag as Objectionable come into play. After a while I just got tired of doing it and stopped.

    Up until now there was nothing they or the surfer could do - good work Google.
  • by Julian Morrison (5575) on Saturday August 20 2005, @05:30PM (#13363765)
    Please neologize without sounding like you're spitting on the floor.
      • For people like me with a week vocabulary:

        That should be weak, my friend.

        A week is seven days. This post, correcting your misuse of the word, "week," is weak.

        = )
      • Both of those words you mentioned apply to phenomena that never existed before. You could call them "serial public editorial" and "prerecorded automatically downloadable digital radio" but your jaw would get tired. In fact, the best circumlocution would still miss corner cases which are definitely "blogs" and "podcasts". That's what I take as proof positive that a new word was necessary.

        On the other hand, a lot of neologizing, particularly around "spam", seems to delight in sounding scatological. I wish peo
  • by Anonymous Coward on Saturday August 20 2005, @05:31PM (#13363770)
    It doesn't get much more spammish or objectionable than that!
  • by Reaperducer (871695) on Saturday August 20 2005, @05:33PM (#13363779) Homepage
    Lock the barn. Hide you farm animals. The pigs are nervous.
    This could lead to more cases like this one [scruffydan.com].
  • Slashsplogs (Score:5, Funny)

    by mattjb0010 (724744) on Saturday August 20 2005, @05:39PM (#13363804) Homepage
    Mark Cuban of Icerocket made the accusation that Blogger is by far the worst offender when it comes to Spam Blogs.

    Mark Cuban of Icerocket, allow me to introduce you to Roland Piquepaille of Slashdot...
  • I can see it now.. (Score:5, Insightful)

    by pickyouupatnine (901260) on Saturday August 20 2005, @05:40PM (#13363809) Homepage
    .. Blogger getting bombarded by all sorts of "Questionable Content" flags from all sorts of extremely left / right / PC people ... soon they won't be able to keep up w/ the flags and will just turn off the feature.. :-/
    • by techno-vampire (666512) on Saturday August 20 2005, @06:22PM (#13363948) Homepage
      There's a way to keep people from using Mark as Objectionable to censor political opinions. You have to be logged in to mark, and the blog keeps track of who it is making the mark. Whoever owns the blog can go over the marks, remove any inappropriate ones and, if somebody's abusing the privilege, eject them. Of course, there has to be a way to keep them from simply signing up again, such as checking for a banned email address. Yes, I know how easy it is to create throw-away addresses, but it might slow them down a little.
      • it might slow them down a little

        No. It won't

        They will keep coming and coming and coming until you give up, go home, cry uncle, take Prozac, get a regular day job to replace the one you quit when being an anti-spammer became your full-time job.
        Blog spam will never die [diveintomark.org]
    • by svkal (904988) on Saturday August 20 2005, @06:41PM (#13364034)
      Actually, this probably won't be too much of a problem if they base their system on ratios rather than individual complaints, which I assume they will given the huge number of individual flaggings that will necessarily take place.

      The ratio of flaggings to unique visitors in a given timeframe will generally be higher for spam than controversial opinions. This is because people are more likely to report sites that will actually be deleted, instead of pointless political demonstrations to a one-man audience of some random Blogger employee, and because there is no significant number of unique visitors to a spam site that "agree" with the site's content(as there generally is for a political blog).

      So, for normal circumstances, having an employee periodically go through the sites with the highest flagging ratios will give pretty good results.

      Now, one could also expect campaigning, i.e. higher-traffic sites directing their audience to report lower-traffic sites with "undesirable opinions", but this could only be done for a manageable number of sites. These, after being inspected to make sure that they actually are not spam, could be flagged with an 'innocent' flag by the employee, exempting them from further inspection(after all, political blogs aren't likely to suddenly turn into spam blogs).
  • by SuperBanana (662181) on Saturday August 20 2005, @05:47PM (#13363829)
    Google Blogger is introducing Word Verification for user comments to prevent comment spam

    I once spoke with the VP of a company that was merging with the company I was doing contract work for (both companies were very small, so we had a lunchroom chat).

    He revealed that there were a number of "email blast" (ie email spam outsourcers) that were happy to have dozens of Indian employees on staff ready to do the image-word verification and reply-to-this-email-to-be-whitelisted emails many think-they're-super-smart people had set up.

    Why does anyone think the "illegitimate" spammers don't do exactly the same thing? Especially when, at $5/hr (about what US min wage is, I think) 5 seconds of effort (an overestimate, most likely, after you've been doing it for an hour) works out to about 2/3rds of a CENT...and that has the potential to reach hundreds of people before someone flags it? ONE worker could do 720 an hour...

    • What you say may be true but it certainly isn't an argument against adding this kind of verification. If you make it more costly to do, it will happen less.

      What they really ought to do is use a Bayesian classifier to tell them which blogs are spam and which aren't.
      • What they really ought to do is use a Bayesian classifier to tell them which blogs are spam and which aren't.
        Just about the last thing Google should be doing is full-text statistical analysis. The leap from statistical analysis of blog content to statistical analysis of web content in search of terrorists or dissenting voices or whatever conveniently classified scapegoats is as large a leap as any electron has to make.
  • Oh yeah (Score:5, Informative)

    by dedazo (737510) on Saturday August 20 2005, @05:59PM (#13363874) Journal
    Blogspot is overflowing with these. Take a look at this this [blogspot.com] for example(don't want to go there with IE, BTW), or this one [blogspot.com] or this one [blogspot.com].

    If you use the 'next blog' randomizing feature on blogs you'll see that roughly one out of five 'blogs' are nothing but link farms, worm repositories and bullshit like that.

    And this has been going on for quite a while. We all know that Google has a fondness for indexing Blogger content rather quickly, and so do the spammers. It's about time the company did something about it.

    • Take a look at this this for example(don't want to go there with IE, BTW)

      I have never wanted to go to a website so much after I saw that warning.

      After I submit this post, I'm restarting Mozilla, as that site has caused it to memory leak worse than the Exxon Valdez.
      • Re:Oh yeah (Score:3, Informative)

        It has embedded images in the HTML - it uses the 'cid' format, whatever that is. I think it's a Microsoft thing.

        Firefox kept asking me if I wanted to "launch the application".

    • Thanks for posting those, I flagged all three. After I noticed this feature yesterday, I went through some random blogs and marked the spam ones. I consider it community service :).

      I don't know about Google indexing Blogger sites quickly, though. I created a blog there back in like February. I post their somewhat reliably, and linked to it from my relatively high PageRank website, yet it barely is accessible in Google. I don't think Google has spidered it once (since it only shows the URL when searching, no
      • Re:Oh yeah (Score:3, Interesting)

        I'm not entirely sure how it works, but I'll tell you how I think it works...

        Basically if you're using Blogger as a service and publishing through on your own server, it will index it every time you re-publish the index. That's what I think happens from observation and monitoring the Google cache. I might be wrong.

        If you use the Blogspot service then you need at least *one* incoming external link to be indexed. A friend of mine created a blog there, and I linked to it from my blog published to my own se

  • by John Jorsett (171560) on Saturday August 20 2005, @06:01PM (#13363879)
    That's like saying convenience stores are the worst offenders in armed robbery. Surely the offender is the perpetrator, not the victim.
  • by gregorio (520049) on Saturday August 20 2005, @06:06PM (#13363895)
    Does anyone remember that? Does Google remembers that? Why not 'nofollow' instead of annyoing distorted text confirmation procedures?
    • Because we are talking SPLOGS!

      No, really we are talking about blogs that are spam themselves - no way to 'nofollow' the entire Blogspot domain without ruining the idea of a blog itself. In fact that would ruin Google considering they count on a lot of what their 'Bloggers' are talking about.

      If they decided to 'nofollow' every link I posted in my blog posts I'd jump ship quick.
      • Word Verification can be enabled or disabled by the blog author.

        But anybody who turns it on is likely to run afoul of the Rehabilitation Act, the Americans with Disabilities Act, and/or foreign counterparts.

  • I can't really see a good reason to why Google has that new word verification feature off by default, and like an option...
    Why would a blogger not want to know it's a human behind a keyboard... by default?
    • Well, some bloggers may not like the changes. Look at all the Karma: Normals and ACs here that were pissed about the captchas here when Taco rolled them out...

      I'm going to enable it on mine, though. I've never had comment spam, but seeing as mine is gaining PageRank, I wouldn't be surprised if I DO get some soon...
    • I can't really see a good reason to why Google has that new word verification feature off by default

      Probably something to do with laws requiring companies to make their products accessible to people whose disabilities prevent them from seeing images. (Read More... [w3.org]) Turning on accessibility (that is, turning off word verification) by default means that liability for inaccessible blogs lies with the blog administrator, not with Google.

  • ...when it comes to google groups...
  • by Khyber (864651) <khyberkitsune@gmail.com> on Saturday August 20 2005, @06:14PM (#13363921) Journal
    Ahhh, but will Google solve the problem that LiveJournal has? The problem I'm talking about is the LiveJournal_Abuse team, which has always been made up of volunteers and will ban anyone on any whim for any reason, reasonable or not. I made a community called "DIERIAA" and the purpose of the community was to point people to cool free music. Within twenty minutes of having the community made it was shut down for "promoting the illegal piracy of music." And not one single post had even been made in the community. Will Google be able to solve a problem like that?
    • by Anonymous Coward
      Why would Google address a problem with a blog service they don't own?

      If you don't like LJ's policies, take your business elsewhere.

      Judging by most of the people who have LJ's, I'd say there isn't much you'd be leaving behind.
      • If you don't like LJ's policies, take your business elsewhere.

        This sort of comment comes up repeatedly- "don't warn other people about a bad service, just shut up and find some other service"- how is that supposed to work?

        Capitalism works by consumers having necessary information about a product or service before they buy it or invest time in it, not after. The way to have reliable information about a product is to hear from people who did actually use it and are satisfied or unsatisfied- so publicly decla
    • After reading DIERIAA, I assumed this was going to be a rant about how they assumed it was a misspelling of diarrhea rather than Die RIAA.
  • I hope this isn't part of their Chinese firewall partnership, so they can remove dissenting blogs - one of the last bastions of effective political change.
  • by FuturePastNow (836765) on Saturday August 20 2005, @06:20PM (#13363940)
    What happens when (I didn't say if) affiliates of _________ political party start "flagging as objectionable" blogs written by those they disagree with? What happens when religious wackos flag sex blogs as objectionable? TFA says Blogger tracks the number of times a blog is flagged objectionable and base their action on that, not that they review whether something is actually bad. This could be trouble.
  • Upon reading some of his comments verbatim it is shocking how inarticulate and rambling he is. Seems reasonable to me to label him as radio SPAM - he certainly has the figure for it.
      • People are going to start reporting blogs with which they disagree as spam in an attempt to have it shut down.

        That's not a far cry from some of the moderation I've seen here on Slashdot. Disagree with someone's opinion? Mod them down! In general human beings do not like to face things that make them uncomfortable, and coming face to face with opinions that are diametrically opposed to your own really freaks people out.

        When I have mod points, I try to take care to only mod people down when I feel that they are engaging in personal attacks or other socially disagreeable behavior. I admit that it is difficult for me to mod up comments that are in opposition to my opinions, but if someone has argued a point well and isn't resorting to ad hominem attacks or perversions of fact, I can sometimes get past my biases and up-mod a post. The less important the issue being discussed, of course, the easier it is for me to up-mod an opinion with which I disagree.

        I strongly believe that maintenance of a community that values diversity of opinion is important, both here on Slashdot and in the "real world." Unfortunately it requires effort to maintain community, and much of the communications technology we use today is making it easier and easier for us all to filter out that which we do not want to hear. Perhaps it's not an accident that political discourse in the United States has sunk to such a morass, devoid of any real substance.

  • by SilentReallySilentUs (908879) on Saturday August 20 2005, @06:44PM (#13364055) Homepage
    Is it really that difficult for Google? In addition to the website caches, they have the complete Deja archive at their disposal to train any kind of learning software. Plus, this problem is already solved in Gmail. I agree it hurts when you just spent a few hours writing a blog and the first message you get is "Wow that is really nice! I will read it again. Please see my mortgate site here ..."
  • By who's definition?

    One persons 'objectional material' is another persons religion.. ( for example )

    Yes, i know that its Googles' servers and they get to control content .. bla bla bla bla. Just because its legal doesnt make it the right thing to do.

    Now, controlling spam.. more power to them...
  • by iphayd (170761) on Saturday August 20 2005, @09:13PM (#13364581) Homepage Journal
    This should have been posted by Roland Piquepaille.
  • by Sundroid (777083) on Sunday August 21 2005, @01:11AM (#13365283) Homepage
    Blogger has addressed the "Flag" abuse issue. From their own internal "Blogger Buzz" blog (http://buzz.blogger.com/ [blogger.com]), it says: "From a technical standpoint, we are able to detect when multiple votes come from the same source. We prevent against ballot box stuffing. But most importantly, we're not automatically removing content based on the flags. We're using the feedback from Blog*Spot readers to help assess what the community has noted as potentially objectionable. In the cases where objectionable content has been identified, the most common action is for the support team to 'delist' the blog. This simply means that the blog is not promoted in areas of blogger.com like Recently Updated - but it's still viewable on the web. The content is not blocked or removed in anyway when the blog is delisted."

    So for those who are concerned that their "enemies" might use the "Flag" feature to attack their blogs, relax!
    • Re:good for google (Score:5, Informative)

      by croddy (659025) on Saturday August 20 2005, @05:33PM (#13363778)
      No, this is not about reducing spam in the comments on blogs. This is about reducing the number of blogs whose authors post only spam. The number of such blogs is enormous -- most counts put it between half and 2/3.
      • by Anonymous Coward on Saturday August 20 2005, @05:40PM (#13363811)
        The number of such blogs is enormous -- most counts put it between half and 2/3.

        I'm not sure I'd call any number less than one "enormous".
      • Re:good for google (Score:5, Interesting)

        by ImaLamer (260199) <john.lamar@gmail . c om> on Saturday August 20 2005, @05:41PM (#13363814) Homepage Journal
        This is about reducing the number of blogs whose authors post only spam. The number of such blogs is enormous -- most counts put it between half and 2/3.

        Good thing it is being done too - I'd hate to be excluded from the other search engines because I've got a few blogs with Blogspot/Blogger. Gets rid of that whole "guilt by association" thing.

        BTW: The 'flag as objectional' button hasn't shown up yet on any blogs I post to.
      • Off the above link "We've just introduced the option to require word verification for comments. This option (off by default) gives bloggers a tool to prevent the automatic creation of comments by nefarious ne'er-do-wells (e.g. spammers). Features like comment captcha and flag as objectionable are not complete solutions to the problem of spam. But they are additional tools that can help address it." I may just be missing something but what is word verification about then if not to stop spammer (bots) from
      • No, this is not about reducing spam in the comments on blogs. This is about reducing the number of blogs whose authors post only spam.

        Sheesh, I know it's de rigeur here to not read the article, but at least read the freakin' summary.

        Now Google Blogger is introducing Word Verification for user comments to prevent comment spam

        What part of that don't you understand?

        And I say it's about time too. I have a (very unpopular, sporadically updated) blog on Blogger/Blogspot - linked above - and every single time
    • I have similar qualms about it- however, if Google makes it clear that "Flag as objectionable" is only to be used for spam or flooding, and that they will *not*, under any circumstance, censor anything but spam, then it isn't a big deal.

      Blogger has certain terms of service, which include not being a spamming retard. They don't want to pay to host spam. Do *you* want to pay to host spam? Fine, then; do it on your own servers. Spammers don't have to use Blogger to screw over the internet; they can buy their o
      • As I read it, it was somewhat more nefarious. It was describing something to remove from the public listing all blogs that had content that the public considered objectionable.

        That said, Google claims that they'll do nothing to the blog itself, except possibly put a warning that it's got objectionable content, if it's reported as objectionable enough times. They WILL remove it from the public listing, though.
    • by Anonymous Coward
      Consider that each relatively new (as most Splogs are) blog takes up on average 3 KB of space on a hard drive (size of average e-mail text spam, which is then uplaoded to Splog),

      and based on the fact that Blogger.com uses Maxtor MX830HA hard drives with an average surface capacity of 300 MB per platter (x20 = 60 GB),

      each platter being 212 tracks,

      so each track holds 1.4 MB, and take the radius of 3 inches times 2pi to get circumference of approx 19 inches, which is 48 cm, or about 500 mm.

      So you have 500 mm h