Slashdot Log In
Next-Generation CAPTCHA Exploits the Semantic Gap
Posted by
kdawson
on Wed Apr 23, 2008 08:03 AM
from the stand-and-identify dept.
from the stand-and-identify dept.
captcha_fun writes "Researchers at Penn State have developed a patent-pending image-based CAPTCHA technology for next-generation computer authentication. A user is asked to pass two tests: (1) click the geometric center of an image within a composite image, and (2) annotate an image using a word selected from a list. These images shown to the users have fake colors, textures, and edges, based on a sequence of randomly-generated parameters. Computer vision and recognition algorithms, such as alipr, rely on original colors, textures, and shapes in order to interpret the semantic content of an image. Because of the endowed power of imagination, even without the correct color, texture, and shape information, humans can still pass the tests with ease. Until computers can 'imagine' what is missing from an image, robotic programs will be unable to pass these tests. The system is called IMAGINATION and you can try it out." This sounds promising given how broken current CAPTCHA technology is.
Related Stories
[+]
Windows Live Hotmail CAPTCHA Cracked, Exploited 362 comments
eldavojohn passes along what may be the last nail in the coffin for CAPTCHA technology. Coming on the heels of credible accounts of the downfall of first Yahoo's and then Gmail's CAPTCHA, Ars Technica is reporting on Websense Security Labs' deconstruction of the cracking and tuning / exploitation of the Live Hotmail CAPTCHA. Ars calculates that a single zombie computer can sign up over 1400 Live Hotmail accounts in a day, and alternate account creation with spamming. Time to dust off Kitten Auth?
Firehose:Next-generation CAPTCHA Exploits the Semantic Gap by Anonymous Coward
[+]
reCAPTCHA Hard At Work, Rescuing Fading Texts 31 comments
sciencehabit writes "Computer scientists have developed a program, called reCAPTCHA, which is being used in lieu of CAPTCHA by several sites, to help digitize old books and newspapers. The reCAPTCHA takes entries from old and faded texts that optical scanners and digital-text readers have trouble with. So every time you solve that string of crooked letters, you may actually be helping historians digitally reconstruct a page from the 1908 New York Times." The Science Now story links to the longer and more informative article at Ars Technica. (We last mentioned this program last year — and now it's good to get some sense of how well it's working.)
This discussion has been archived.
No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
Full
Abbreviated
Hidden
Loading... please wait.

Too hard. (Score:5, Insightful)
This Captcha suffers from the same old problem. As Captchas get harder more humans will fail them.
*or annotate... or centre
Blind people? (Score:5, Insightful)
Parent
Don't forget users of lynx (Score:5, Interesting)
Lynx is the geek slacker's greatest tool, when run in an ssh session from your home server, not only is the traffic unloggable (except for "he's calling home a bit") but it even looks like work to the uninitiated.
Parent
Re:Blind people? (Score:5, Insightful)
Parent
Re:Blind people? (Score:5, Funny)
Parent
Re:Blind people? (Score:5, Insightful)
The Internet is becoming much too important to leave a significant amount of the population (pardon the pun) in the dark. We have the technology to help the blind navigate web sites independently. Unfortunately, CAPTCHAs are hindering much of that progress.
Parent
Re:Blind people? (Score:5, Insightful)
No, spammers are. The root problem of this "solution" is the spammers, who do not care our personal feelings of privacy. They don't care that their messages cause everyone else's costs to rise.
Without CAPTHA technology, none of the web mailers would be usable, as they would all be blocked by every known blacklist.
For this reason, I think the penalties for convicted spammers should be far higher than what they are now. Their actions are subverting the ease of use for a very large group of people.
Parent
Re:Blind people? (Score:5, Insightful)
Others are using letters / numbers that after distortion could be a,d,9,g for example.
Personal, I give a site two tries before I give up and dump them.
Parent
Re:Blind people? (Score:5, Informative)
1. Strip links from messages. The spammers are trying to game Google's (and other search engine's) page ranking, and they can't do this if you don't allow them to post links. The incentive to spam your site has now gone.
2. Insert some primitive captcha. In my case this was just a question asking the user to add 2 small numbers together. The reason this step was necessary was because despite implementing step 1, I was still getting a huge amount of automated spam from spam bots which didn't realise there was no point in spamming my site. Once a human spammer realises you've added captcha he'll come and have a look to see how easy it is to circumvent (very easy in my case). However after running a test personally he'll see there's no point and (hopefully) remove you from his list of sites to spam.
Hope that helps anyone reading this...
Parent
Re:Blind people? (Score:5, Informative)
Parent
Re:Blind people? (Score:5, Interesting)
{SNIPPED}
What's the cost of a system that allows a blind person to access text stored electronically on a computer? Pretty-much negligible.
First, creating content is not negligible in cost.
Second, creating an interface to deliver the content is not Negligable in cost.
Third, Actually delivering the content to the masses isn't negligible in cost either.
Fourth, as has been pointed out in other comments and in the article, the problem involves the creation of a technology that will allow your audience to access the content/service you are providing, while simultaneously preventing the use of automated systems to exploit your services by appearing to be your audience (i.e. a Human), because the failure to do so means that you may lose the entire technology, or at the very least render it substantially less useful and more expensive. Email, for example, is only being used 5% of the time as intended, the other 95% being spam (As seen on
If you use HTML as it was designed to be used, there is no additional cost in making it accessible.
if you are using HTML only, the whole captcha debate is meaningless for you. HTML is designed for PUBLISHING information, captcha applies to web based applications that HTML is only a SMALL part of. After all, the only interactive part of HTML are the form elements. Since YOU aren't actually doing anything with the posted form information, YOU have no need for security and little to no need to verify that the entity on the other end of that pipe is a human, spyder, or spambot.
However, some of us do create applications that need to know this, because we want to provide services for actual humans, but do not want to provide another place for spambots to send out their crap.
Parent
Re:Too hard. (Score:5, Interesting)
This Captcha suffers from the same old problem. As Captchas get harder more humans will fail them.
*or annotate... or centre
Parent
Re:Too hard. (Score:5, Insightful)
To be optimistic, I actually like to think of it the other way around:
CAPTCHAs are providing a valuable evolutionary pressure on machine vision/artificial intelligence development!
=Smidge=
Parent
worthless (Score:5, Insightful)
Re:worthless (Score:5, Funny)
Parent
Lyrical Response Mechanism (Score:5, Funny)
Re:Lyrical Response Mechanism (Score:5, Funny)
"Never gonna give you up"...
Parent
It's still trivially crackable. (Score:5, Insightful)
Then there's also the option of paying Warcraft gold farmers to solve captchas and take a break from the game.
Re:It's still trivially crackable. (Score:5, Interesting)
But really, as long as you have an authentication method which is significantly hard/unique, you'll be safe. Spamming is a "low hanging fruit" operation. Quantity over qualify, 90% of the time. In fact, the answer to killing off spambots might very well be everyone designing their own authentication. Right now, there are a half-dozen major ones. Crack one, and you have access to millions of places. If instead there were thousands, the time required to break one would not necessarily be worth the money you could get from doing it.
Our forums are not worth programming the automated bots to crack, so we're 100% spam free now, for the first time in a few years. It's not a hard authentication - just different from 99.9% of the rest of them. Hell, most people could answer "what color is this page", even if they had to look at the raw html and google the color hex. But for one page, it's not worth programming a bot to do. Unique authentication methods will kill spambots.
Parent
Alternative... (Score:5, Informative)
Stupid Captcha (Score:5, Insightful)
At least a part is Ineffective (Score:5, Insightful)
This is no better, and may be worse, than what we have now, for two reasons.
1) If you fill in the gaps programmatically, and then make the image grayscale, you probably have something you can use for image matching.
2) Much more severely: The interface reduces the number of possible answers by multiple orders of magnitude. For the one I saw I think there were 10 or 15 answers. Even if you kick image recognition to the curb and randomly choose an answer, you'll be right 1/15 times. It'd be trivial to write a program to harvest hundreds of accounts in a day by just picking random answers. Hand that off to a botnet or similar, and this becomes a minor speedbump.
~D
Solution: unproven users = limited access (Score:5, Insightful)
If free mail servers put restrictions on what new accounts could do, with an override to anyone who is willing to go to a lot of trouble to prove they are human, it would short-circuit the spammer problem.
If Yahoo, Gmail, etc. all limited you to 10 outgoing mail recipients a day until you had both 1) had the service for 1 day and replied to 10 messages, AND limited you to 100 outgoing mail recipients a day until you signed up to be a "high volume sender," it would cut most spammers off at the knees. Depending on the service, being a "high volume sender" may involve turning over a credit card number and may not be free. Some services may give "loyalty awards" to long-term customers by removing this restriction for people who have had their accounts for 6 months and show a heavy non-spammy ad-revenue-generating usage pattern.
hotcaptcha (Score:5, Interesting)
http://www.hotcaptcha.com/ [hotcaptcha.com]
Re:Twofo Ghey Niggers (Score:5, Funny)
Parent