Google Explores Re-Ranking Search Results Using +1 Button Data 126
tekgoblin writes "Google plans to use data from its +1 button to re-order search results and keep spammers at bay. While this would bring Google’s search engine into the social networking era, it would also create a new avenue for blackhats to manipulate search results. From the article: '"Google will study the clicks on +1 buttons as a signal that influences the ranking and appearance of websites in search results," a spokesman wrote. "The purpose of any ranking signal is to improve overall search quality. For +1's and other social ranking signals, as with any new ranking signal, we'll be starting carefully and learning how those signals are related to quality."'"
This is a really bad idea (Score:2, Insightful)
Re:This is a really bad idea (Score:4, Interesting)
This is about the +1 button on search results, not G+. G+ has nothing to do with this.
There is a risk of spamming, but it depends also on how they end up using it. If, say, it ends up promoting things similar to those I've +1'd, but only for my own searches, then the risk of spamming is quite low. If similarity is determined by the +1 from people that tend to +1 similar sites to myself then you can't really spam it at large scale.
A user that +1s every site in sight will not have a similar profile to anybody else but other spammers. If they +1 their own sites and a coherent set of other sites then they'll only target the specific people interested in those kind of sites; too much work for too little effect for it to be an effective manipulation method.
Re: (Score:2)
I would hope the engine would realize that the profiles of those "users" who rampantly promote sites like that are so vastly different from my profile as to not seriously affect my results.
Re: (Score:2)
No, it's actually about G+ too [webpronews.com]. Note that only +1 from G+ profiles will count.
Re: (Score:3)
Which may be why Google is pushing so strongly (rightly or wrongly) for verified identities on G+.
Re: (Score:2)
Re: (Score:2)
I didn't say it was a good reason.
Re: (Score:3)
A user that +1s every site in sight will not have a similar profile to anybody else but other spammers. If they +1 their own sites and a coherent set of other sites then they'll only target the specific people interested in those kind of sites; too much work for too little effect for it to be an effective manipulation method.
Just out of curiosity, what prevents spammers from +1ing a coherent set of sites plus the one(s) they're trying to push? For example, what if you have a spambot advertising dating sites that also +1s a bunch of tech sites (i.e. Slashdot)? Wouldn't that effectively create targeted advertising?
Re: (Score:2)
Re: (Score:2)
Google is very good at detecting this sort of insincere promotion, and compensating for it. That's probably the the hardest problem, and the one most core to its business, that Google addresses. For someone to actually game the system, they end up having to work pretty hard to make good content and appear sincere, and in the end, they might actually be making
Re: (Score:2)
Nothing other than the need to create multiple user accounts.
But even so, there are defenses against that - weighting by age, history, and activity on the account for example. You can weight by coherency as well - the +1 (you mention) can be weighted less because it's an outlier. Etc... etc...
Re: (Score:2)
As I said, that doesn't scale well. You'd need multiple bots doing this for this to have any real effect, and you'd only target those people that read largely the same subset of tech sites.
And remember, they'd only push their own dating sites, or Viagra sites or whatever, over their competition. You need to search for a dating site to begin with for their manipulation to have any effect at all.
Re: (Score:1)
Re: (Score:3)
Re: (Score:3)
...if we're lucky.
Re: (Score:2)
These sorts of systems only work if you can not game it.
All systems can be gamed. The question is not, "Does the system 'work'?" It's, "How well does it work compared to its closest competition and how can it be improved?"
Re: (Score:2)
I think this is why they're insisting on real names for Google+.
When every user is a person, it is more difficult for spam CEO hackers to skew Google results. I suppose they could still try to harvest millions of Google accounts to use as +1 slaves but that's a lot harder than setting up a content farm. Spammers will have to create fake personas or steal real ones in the millions to be able to cheat now.
Re: (Score:2)
I think they're taking care of most of the spam problem by not having a "private message" option on G+ ... at least not one I can find.
Re: (Score:1)
oops, typo: I meant SEO, not CEO.
Re: (Score:2)
combine this with crowdsourcing. KABOOOOOOOOOM.
it doesn't need to be a million hits on one. all that's needed is that the top 40 results for searches are bought.
google should add "don't show any hits from this site to me ever since google is tracking me anyways" button though. that way you could tune down the amount of clone-site hits(seriously, many programming queries now feed you from google with the same thread ON FUCKING TWENTY SITES NONE OF WHICH ARE THE ANSWER).
One signal out of hundreds (Score:2)
Google uses hundreds of signals to determine search rankings. They're researching adding one more. I'm pretty sure they have a good concept of signal-to-noise and spammers gaming the system by now. It's what they do for a living.
Re: (Score:2)
Yes, because Google is so well known for bad search ideas and have no experience dealing with search spammers and SEO. /sarcasm
The more signals a service can plug into, the better. Just look at the Real Time twitter feed that they had. Twitter is full of spammers, but it turned out to be an incredible service (good signal far outweighed the bad). I do think G+ will figure into ranking of trusted +1 sources. With their real name position, those with thousands of followers will most likely be verified. A
Well, it's a good idea (Score:4, Insightful)
However, it's just one more service Google Apps customers ( you know, us paying folks ) can't use.
Only any use if ... (Score:3)
... they also provide a -1 too. I'd like to mod a page down as a method of balancing.
Re: (Score:2)
... they also provide a -1 too. I'd like to mod a page down as a method of balancing.
I thought the same thing when I first saw the silly icon.
The first thing that hit me after I wished for a -1 was, "Oh... Then we'll have wars between spam-happy pushers and corporations. It (the +1, -1 system) will be used so much there will be DoS arising from it no time flat." :)
Re: (Score:2)
This so much.
I want to obliterate certain sites from search results:
Stuff like this, for example: http://tapix/ [tapix] . ru/mobile/t64340 (it is inscredible how many sites which are basically list of sequential numbers there are.)
Various newsgroup, forum and mailing list scrappers.
javadoc and opensource code mirrors.
goddamn www.roseindia . net/java/
etc... There is just too much of redundant clutter.
Re: (Score:2)
I want to obliterate certain sites from search results
You can :)
google.com | gear icon | search settings | Manage blocked sites
Re:Only any use if ... (Score:4, Interesting)
Say 4 of 5 people hold a majority view here. Say there are 400 posts representing the majority view, and 100 posts representing the minority view. Say on average there is 1 randomly selected moderator per 10 posters, and the moderators' views have the same distribution. And say that Slashdot only allowed positive mods.
There are 40 mods giving +1 to 400 majority-view posts, for an average of 40/400 = +0.1 per post.
There are 10 mods giving +1 to the 100 minority-view posts, also for an average of +0.1 per post.
Now toss in negative mods. Say one in ten mods gives a -1 to an opposing viewpoint rather than a +1 to their favored viewpoint.
The majority view gets 400 posts, 36 +1 mods, and 1 -1 mod, for an overall average score of 35/400 = +0.0875 per post.
The minority view gets 100 posts, 9 +1 mods, and 4 -1 mods, for an overall average score of 5/100 = +0.05 per post.
The situation gets worse the more people tend to use negative mods. When the ratio of negative to positive mods matches that of the distribution of views (i.e. 1 negative mod for ever 4 positive mods in my example), the negative mods from the majority completely cancel out the positive mods from the minority and the minority view ends up with a 0 ranking average. If the ratio of negative to positive mods is greater than the ratio of minority to majority views, the posts representing the minority view end up with an average negative ranking. Algebraically:
p = % of positive moderations
n = % of negative moderations
A = majority population
B = minority population
Average majority view ranking = Ap - Bn
Average minority view ranking = Bp - An
It's pretty easy to see that if A > B, this skews the majority rankings to be higher than the minority rankings. And if A >> B, B basically has no say in the rankings, and the rankings are almost entirely determined by A's opinions.
So basically negative mods act as a force multiplier, allowing the majority to influence rankings beyond their actual numbers. That is, negative mods tend to produce rankings which reflect the majority view, rather than a utilitarian view. If you want rankings which reflect how useful a site is to the people who want the info on the site you use a +1-only mod system. If you want rankings to reflect the majority's opinion of a site even if it contains nothing they wanted to find, then you use a +/- mod system.
If Google were to allow -1 mods, expect pages for niche topics like Linux to get pounded into the negative, while pages for larger market-share products like Windows are barely affected. Basically, hundreds of millions of people would do searches on Windows topics, and -1 the occasional Linux page which got into their search results. But those "occasional" -1s against Linux pages would likely far outnumber the +1s given to Linux pages by the few million people doing searches for Linux.
Google tricks (Score:5, Insightful)
I'm truly impressed at Google's ability to makes us work for them without most people even noticing it. This is yet another example of that. Then I hear non-tech people in awe of how smart Google is, and then say to myself, that is one hell of a business model.
Re: (Score:2)
Google's been using us to improve its index from the beginning - the whole idea of PageRank is that the linking patterns of web sites given an indication as to their importance and value.
Re: (Score:2)
That's what he said.
This is just a head shake (in both ways) of it happening again. :)
Re: (Score:2)
Yep, Google is just one big mechanical turk.
Re: (Score:2)
Re: (Score:2)
Heh.
See http://dictionary.reference.com/browse/tool [reference.com]
Scroll to "World English Dictionary", definition 5.
Re: (Score:2)
Unpaid Employees (Score:1)
Yeah, sure, we're just unpaid employees - until their copyrights expire!
Re: (Score:2)
Not all payments are monetary. the use of their service is payment enough for me.
Re: (Score:1)
We are free employees, we use their system for free, and the system itself is an aggregate of all of our previous clicks, so when you search for "test tube" (for example), Google is just listing the most commonly clicked items that other people have selected. We are all sharing our experiences to make your search easier, Google is just compiling that info, and making money off people who want to cheat just a little and pay to be in the sponsored area. Pretty worth while trade for me as user, I would say.
Re: (Score:2)
Ain't that the truth.
My comment I posted, before refreshing and re-reading the comment tree, could use what you said here as its title :)
Comment: http://tech.slashdot.org/comments.pl?sid=2404564&cid=37252488 [slashdot.org]
Re: (Score:2)
BigData - SimpleAlgorthim. Isn't that their motto or something?
Re: (Score:2)
You think Google is bad, trying going to the grocery store sometime. Back in MY day, they used to check you out, bag your groceries, take them out to your car, and help you load them in. Now you're lucky if they don't spit on you when your checking yourself out.
Of course, my father used to tell me that in HIS day, grocery stores would actually deliver to your house. But clearly that old bastard didn't realize that civilization peaked when *I* was young, not when his lame generation was young. Like all gener
Re: (Score:1)
That's curious. Just 2 days ago, I bought groceries. The staff rang up my items, bagged my groceries, and offered to help me take them out.
Maybe you should find a better grocery store.
Re: (Score:2)
Yeah, they still pump your gas for you in New Jersey. That doesn't make it typical.
Re: (Score:1)
Re: (Score:3)
Yeah, they still pump your gas for you in New Jersey.
It's actually required by state law. Personally, I prefer self-service.
Re: (Score:1)
Re: (Score:1)
Of course, my father used to tell me that in HIS day, grocery stores would actually deliver to your house.
Tesco delivered 2 weeks of groceries etc. to my house this morning.
Re: (Score:2)
*Every* grocery store delivered. And my dad lived in a rural town too.
Re: (Score:2)
I couldn't agree more, unless I marked you +1 Insightful.
Oh well, back to meta-moderating.
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
But when it comes to get better search results you feel offended because
Re: (Score:1)
"Proposed", not "plans to" (Score:3)
It's a research project into how useful this metric would be. The article makes it clear that they haven't made any plans to actually implement it until they know how well it works. At this stage, blackhat activity would lead that project to a negative conclusion, and the feature wouldn't be implemented, and it couldn't be manipulated.
Re: (Score:2)
Unless the blackhats were smart, which most are. In which case they will use this testing stage as a way to elicit false confidence in this new avenue while gathering information they will need to make the new changes work in their favour. Black hats almost never do what you'd expect them to do. They often do the exact opposite until just the very proper
Re: (Score:1)
Well if it's trivial for SEO people to exploit, then it's trivial for Google to identify SEO firms and put them in the great ethereal void.
Re: (Score:2)
My exact sentiment. You would either "play nice" or not play at all until it goes live. Even then ... you might wait for the PUBLIC to gain confidence, then launch what ever human/computer methodology to pump up your desired links.
Re: (Score:1)
What this all comes down to is that we have a distinct need present that is not being met. The need we all have is to have a site that reviews specific content types and vets it against objective truth; the best results are measured by scientists, and the result is delivered.
Google used to do this somehow. But since the SEO people basically managed to figure out how to circumnavigate the search engine's security or algorithms to prevent corruption of results, we now see more and more search results that sim
Re: (Score:2)
At this stage, blackhat activity would lead that project to a negative conclusion
What if they narrowed their project to an admittedly database killing subset of only counting +1s from people in your circles?
I don't have any blackhats in my circles. Err, let me rephrase that, I don't have any SEO blackhats in my circles, so far as I know.
If they tried that, aside from vaporizing their DBMS, the blackhats can +1 the entire world of spam for all I care, no one will ever see them.
They could alternatively only pay attention to +1s from "celebrities". I read posts on G+ from Linus all the t
System Law of Change (Score:1)
Changes to any system will elicit unpredictable changes by forces acting upon said system.
+1 data is a good thing if you accept that it is delivered by people who have been vetted by Google. What's to stop Google from then only listening to those that represent Google's interests? From a fiduciary perspective they would be foolish not to listen to those they revere the most.
The +1 data is useful, but is it useful enough? Not without a -1 button, imho. But in the big picture, +/- is just a perspective from a
we don't need a +1 we need a flag spam button (Score:1)
Wouldn't it be a lot easier to have web pages flagged as spam and ranks lowered rather than trying to crazy +1 thing that most people won't use, even if they liked the link?
Re: (Score:1)
They have one. You just have to log into your gmail account, click the "Feedback" button on the bottom of a page of search results, select a radio button option for what you're reporting, then explain what/why you're reporting something (I think it gives you a text box). I don't think it lets you choose a specific result, though. It just lets you report that a "page" of results has spam.
Yes, it's too many steps. But I don't know of another solution. Having a big button next to each result wouldn't be a bett
Re: (Score:2)
I'm surprised at how few /.ers trust Google not to mess up what is, essentially, everything Google stands for - search.
The idea that people might abuse the +1 feature is so obvious that it goes without saying. I'm pretty sure Google realise this. Whatever they do, they won't jeopardise their leadership in the search market and even if they do, well you've always got Bing.
Social ranking (Score:1)
You know this will result in any site with cats being the top ranked search result no matter the topic.
Re: (Score:1)
Re: (Score:2)
I don't have a Google+ account (only a regular Google Account), but I see the +1 buttons in the search results too. Just not being able to use them without getting a Google Profile (which I won't at the moment for obvious privacy reasons).
Leveraging search to get identity entries (Score:3)
Better than that - they're actively leveraging the search engine to collect identity profiles [webpronews.com]. Join G+ or your page falls down the search.
They're actually willing to compromise the search to collect profiles.
Re: (Score:2)
They're talking about possibly using the credibility afforded to authors by other authors as a search signal. And they're currently using G+ profiles as author identifiers.
It reads to me more like "join G+ or your author rank has no effect".
Re: (Score:2)
I agree with that there's probably a reason to be a little paranoid about what they're doing, but I think you've got it absolutely backwards. They're leveraging G+ to improve their search engine. They earn their money from advertising, and the biggest driver for that is search. It's free labor, and it can greatly improve the quality of search results. That's pretty close to free money.
Think about it, they have had the browsing profiles of most people for a long time already, and they could (and most likely
Ummmm.... a given? (Score:2)
I don't know about anyone else, but I sort of (logically) assumed (no ass jokes, please) that this was the case.
In other words, imagine Google and what "it" does. Imagine the amount of money they take in on a daily, monthly, yearly basis. Now tear it down into its components and don't just assume the logo is what makes it all happen; where does all of this money come from. Data - data that's sold.
What happens when a company (extended context: a person, a country, anything that involves a Human) do when i
Re: (Score:2)
What you're talking about isn't a hypothetical. As a high-profile example Microsoft uses telemetry from Windows users to drive UI design, although by all accounts the process of turning user info into better products is, uh, inefficient. Whenever something crashes on your iPhone, it phones home a bug report.
Re: (Score:2)
Linking +1 with their normal antispam measures (Score:2)
One way to mitigate the influence of the SEO asshats is to purge +1's that are associated with accounts and/or IPs that are seen to offer a lot of +1's for known spam/linkfarm pages.
Multi-Algorithm? (Score:2)
Re: (Score:1)
HotBot! (Score:2)
HotBot did this back when... you know back when people still knew about a search engine called HotBot. They had a ranking system where you could add a point to the sites that you felt most resembled what you were looking for.
They need vigilantes (Score:2)
Re: (Score:2)
or just give the option to do that per user. that would be a nice webbrowser plugin nowadays.. that let you add shit like adding ad-referral codes to downrank search results(just for you). that would make mining less of a chore.
Re: (Score:1)
They try to get the computers do do that automatically. They DO have human search quality reviewers who manually do this kind of ranking in an effort to verify how well site rank performs and highlight areas for improvement.
Not all bad .. (Score:2)
The +1 idea is not bad, it is bad only if you don't have any controls. +1 doesn't have to be a static value, it could just be a place to start. For a person that "likes" everything, their +1 doesn't mean as much as someone like me who only +1 on rare occasions. Likewise, a +1 from a "new account" is probably worth less than someone like me who's had their account for years.
All this means is that the +1 is weighted by several factors that prevent skewing results by link farmers and blackhats..
Economics (Score:2)
Re: (Score:1)
We need a truly micro-transaction oriented paid content system, not the current "who can fake more SEO" to get a pay-check for content. Let the customer payments _directly_ decide who is worthy, not the stupid web of lies driven by advertisements.
Re: (Score:2)
obvious (Score:1)
Letting people choose the best needles in a haystack based on their likes is just too invaluable for any search engine with a huge haystack. Clearly, G+ and the +1 buttons on the search page (and on other sites) play their part in curating Google's data. As far as abuse goes, people have tried to abuse every variable in the page rank formula, and succeeded at times. But, it's
No problem ... (Score:2)
fail (Score:1)
every time I click it I get asked to sign up for yet another google account. it already constantly fuck up the 2 I have why would I want yet another for +
should have done -1 instead (Score:2)
If I search for something and find what I want, I'm satisfied, and I'm not going to bother pressing +1. On the other hand, I have seen some bad matches turn up; I would rate them -1 but there wasn't an option for that.
What about intentions? (Score:2)
If I google "Rick Parry" the first two hits are correctly about Colbert. If I google with bing (hehe), Microsoft's not-too-much-information-engine, the first Colbert result is a third of the page down. Instead they talk about a guy I never heard about "Rick Perry"with an e. But someone else who is friends with that Perry guy might find that result to be the better one.
If I type in "cups" I probably look for the web-page of my printing system, and not for sippy cups. But with the +1 button all the young mom
Surprised (Score:1)
Crowds can be sourced (Score:2)
"Crowdsourcing" search is a very bad idea, because crowds can be sourced.
Google shot themselves in the foot this way last October. They used to count reviews on Yelp and Citysearch in determining placement in Google Places, which, until last October, only affected the search engine for Google Maps. Since few people used the search engine for Google Maps to look for businesses by category, it wasn't spammed much.
Then, in October 2010, Google merged Places results into web search. Within two months, Plac [sitetruth.com]
Doubtful (Score:1)
Make the -1 button (Score:1)
As Expected (Score:1)
This is exactly what they said +1 wouldn't do.
When they rolled out +1, it was...
See +1s and get more relevant results/reviews/etc.!
But I don't want to see other people's +1s, and I don't want them to see mine.
You'll only see your friends's +1s, and you can choose not to share your own.
But I don't have any friends on Google, or a Google profile.
No problem, when there aren't enough +1s from your circles, sometimes we'll display +1s from people outside of your circles.
I don't want that shit at all.
Don't worry,
the whole point (Score:2)
The financial stakes of a high ranking are large enough for SEO experts to make search results increasingly meaningless by propelling their irrelevant page high up into the ranks by whatever means they can possibly think of. It is an arms race and I'm sure Google must have a lot of resources devoted to keep this problem under control.
For me it was immediately obvious [slashdot.org] that improving the ranking algorithm was th