Slashdot Log In
To Search Smarter, Find a Person?
Posted by
Zonk
on Tuesday March 25, @12:43PM
from the when-the-man's-right-the-man's-right dept.
from the when-the-man's-right-the-man's-right dept.
Svonkie writes "Brendan Koerner reports in Wired Magazine that a growing number of ventures are using people, rather than algorithms, to filter the Internet's wealth of information. These ventures have a common goal: to enhance the Web with the kind of critical thinking that's alien to software but that comes naturally to humans. 'The vogue for human curation reflects the growing frustration Net users have with the limits of algorithms. Unhelpful detritus often clutters search results, thanks to online publishers who have learned how to game the system.'"
Related Stories
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
Full
Abbreviated
Hidden
Loading ... Please wait.

Will Google... (Score:5, Funny)
But isn't AI and metadata just around the corner? (Score:2)
Re:But isn't AI and metadata just around the corne (Score:3, Insightful)
If I can guarantee anything I
Re:But isn't AI and metadata just around the corne (Score:4, Insightful)
In fact, it's a basic theorem that given sufficient time, human-level intelligence can always beat any system with less than human-level intelligence (aside from trivial cases like a complete firewall). This is because the human's theory of mind can fully encompass the lesser system (so you can understand how it works), while the reverse is not true. Computers can only beat humans at chess when the match is played with a time control.
This doesn't mean that a computer system can never be good enough to solve this problem. However, it does mean that if you could build a computer system that could solve it, then it would insist on being paid.
It also doesn't mean that using human-level intelligence will always solve this problem. Humans can still be beaten, they just start on a level playing field. Hence it's pretty much inevitable that some people will still find ways to game the system.
just around the corner (Score:3, Interesting)
Expect to lose your job soon after the paperless office arrives. It's always just around the corner but something human gets in the way every time. AI will be much the same.
Re: (Score:3)
Re: (Score:3, Insightful)
Re:Old news (Score:4, Funny)
Algorithms are written by people (Score:4, Insightful)
Unless we are talking about Skynet.
Generation Gap (Score:5, Insightful)
Re:Generation Gap (Score:5, Insightful)
Critical thinking comes naturally? (Score:4, Insightful)
New Ingenious Filtering System! (Score:5, Interesting)
Tag article "activelyavoid" and move along.
Interestingly enough, this whole thing sounds like an idea Rob Malda thought up about 10 years ago, except Brijit lacks a discussion and moderation system where experts and opinionated thinkers can vie to share their collective wisdom to enhance the content of the original article.
Everything Old is New Again (Score:5, Insightful)
In the absence of the mythical, impossible strong-AI, there will always be an important role for experts -- you know, thinking meat, sitting there pushing charges through neurons, having opinions about stuff -- and those experts will probably use a lot of mechanized search tools to improve the breadth of their knowledge, their awareness of knowledge, and the accessibility of information. Technology and people work together!
But you're an idiot if you take out the wetware-based BS filter.
It's coordinating all that expert opinion, and filtering out the drivel, that poses the great organizational challenge of our collective information future. Wiki-based approaches are a good first step; maybe a "trusted-wiki" like Citizendium [citizendium.org] will be the next step; it's definitely going to keep evolving. But it's long been recognized by the reasonable that if you want an informed opinion, rather than a pattern match, you ask the librarian. We've known that since Alexandria -- nay, Ur -- and it's a shame we keep forgetting.
Finally (Score:5, Funny)
Like the original Yahoo (Score:4, Insightful)
Economics, Wisdom of Crowds, and Experts (Score:5, Insightful)
A somewhat more interesting thing, in my opinion, is all the "wisdom of crowds" stuff we see so much hype about. It's interesting because it works very well in certain cases - basically the case where the popular thing is the right thing. The main problem with this is that any search engine that shows you 10 results and then counts which ones you click, well, it's not getting your input on result #11, or 23, etc. So before anyone votes, items that happen to be near the top almost certainly stay at the top. Many good items that the algorithm ranked medium might never get voted on!
One way around this is to randomly select some less good results, so that viewers get a chance to vote for the underdogs and bring them to the top of the pile. But this pollutes results for each user, essentially making them pay a "moderation tax" by requiring them to see things that the algorithm has no reason to believe are better results.
All-in-all, social information finding features seem to be much better suited for finding things you didn't even *know* you wanted - StumbleUpon being a great example of a tool for doing that. I would imagine that this could be very useful even in the corporate sector, as many business strategies and engineering techniques have variants or cousins that are similar in function, but may be more obscure. Having the ability to see that "people who searched for X ended up wanting to know about Y too" might save me a lot of time...
Lack of machine intuition is a feature, not a bug (Score:3)
But what if the system being "gamed" is a human-based search engine? Since the publisher must fool humans anyway, the "unhelpful detritus" in the end users' results will blend in. Even if there are fewer false positives, those that remain will be harder to eliminate.
Webrings writ large (Score:3, Interesting)
While I have not RTFA (this is Slashdot, after all), the summary makes it sound like the combination of Webrings and "Top X" lists, both of which are used much less now and don't carry as much weight but still require user interaction on a grand scale.
I'd be interested to see how this kind of search engine turns out- however, you also have the problem of "majority think", so searching for, say, evolution might have a first result for a page "debunking" it. But then I browse at +4, so I shouldn't complain.
Either you pay the editors, or it's crap (Score:4, Insightful)
Wikia shows the problem with this approach. Coverage of Star [Wars|Trek|Gate|Craft] is extensive. Coverage of, say, bank regulation is nonexistent. If you want to find out how we got into the subprime mortgage mess or what to do about it, Wikia search is totally useless. That's what you get from volunteer editors. Wikipedia does better, but most of the good contributions were made years ago.
Today, you pay the editors, or you get fancruft.
It's amusing that the author of the article feels overwhelmed by The Economist. That's a very well written magazine with good reporters; they had the only reporter in Lhasa when the Chinese clamped down, and they have a good analysis this week of the issues surrounding derivatives. If this guy can't handle The Economist, his organization's answers will probably be dumbed down to the level of, say, "People". That level of crap one can get for free, from many existing sources.
Remember Google Answers? Nobody really cared, and Google shut it down.
There's a whole industry of expensive, small-circulation specialist newsletters, but those are niche operations run by specialists in narrow fields.
Applicable (Score:3, Insightful)
Where is the knowledge we have lost in information?"
--T.S. Eliot
Wow, perhaps it's just me, but.... (Score:3, Interesting)
It's not really difficult, many of those sufferers know how to use a library, which is the real world equivalent of searching on the Internet. (not that the Internet is not real world) Most people were taught how to use a library in their school days and that usage has not changed much with time. The usage of Internet searching does change, and there are multiple ways of doing it. People who are not interested in learning new ways will always just say it is too difficult.
Using boolean modifiers or advanced search is always there, people just don't use it. They also don't fix their own lawnmowers or other things. They just replace them or pay someone else to do the 'hard' stuff. There is enough information on the Internet to allow anyone to learn to protect their home computer from infections and malware, yet it still is a problem.
The human problem of search engines will NOT go away, it can only be made to look less with smarter UIs. A tag cloud system of bookmarking could be used to refine search results but would not work in all cases. The URL history with timestamps might help, but not in all cases. Analysis of search results and those pages actually visited might help narrow the criteria to personal bias but not in all cases. That is why the operator has to be smart enough to know what they want and don't. The Internet does not come with your very own personal cruise director to make sure all goes well. People just believe that it is supposed to be easy because they want to do the cool things that they hear about on television and from their friends etc.
Perhaps one day the interface will be fast enough to be considered good when our brains can be plugged into the computer itself, something like The Matrix, reducing click delays and reading to milliseconds. Until then, teaching people how to use complex search strings will help reduce the angst and pain.
"cars +toyota -hummer 2005" aobut 2.98M hits
is better than
"cars 2005" about 19 million hits
but you have to teach people that those extra characters really REALLY do help.
If people don't know how to use a soldering gun, please don't give them one... or something like that. Oh yeah, car analogy: you apparently can't drive on the streets of the USA legally without a license, which you cannot obtain without demonstrating proficient control of the vehicle.
Yahoo! (Score:3, Interesting)
It's not that hard to get rid of the crap (Score:5, Interesting)
We're back to the Yahoo! model because people have figured out how to game the system, namely Google, without adding content that's important to the searcher.
It's not hard to throw out most of the bottom-feeders. [sitetruth.com] We do it. The crowd at Search Engine Watch (which, despite the name, is all about advertising, not search quality) is writing me angry messages for doing that. Now that we've demonstrated that 36% of Google AdSense advertisers are bottom-feeders, they know they're being watched. Some feel they're being targeted.
Bear in mind that most search requests are really, really dumb. [google.com] That's what Google has to answer. In fact, most Google search requests don't hit the search engine at all; there's a cache of common queries and answers in all the front end machines, and a sizable fraction of requests are answered from cache.
Re:Really? (Score:4, Insightful)
There has to be some kind of intelligent filtering. If it's not done for me, it's done by me, when I choose which result to click. The biggest problem with paying someone to do that sorting for you is the simple fact that it's too expensive. Yahoo might have stayed a human-sorted list forever, except that it would have taken an army of "surfers" to do it. The web just got too big to be done that way all the time.
Google results used to be a lot more relevant than they are now. Far too often, I'm interested in X, and search for "X" on Google, I find millions of people who want to sell me X. But I'm not even sure if I want to buy it. I'm looking for information about X. That is getting harder and harder to find. The quote in the summary is correct - people have learned how to "game" the system.
How often do you "google" something, and then just go to the Wikipedia link? I do all the time. That way, I can be sure to get actual information about the subject, rather than a link to its Amazon page. In many ways, because of the search engine optimizers, Wikipedia is already replacing Google as the default source of information.