Catch up on stories from the past week (and beyond) at the Slashdot story archive

 



Forgot your password?
typodupeerror
×
Google Education

Google Scholar Users Report Badly Malfunctioning Captcha (google.com) 131

Google's search engine for academic research materials is blocking many users with a malfunctioning captcha screen, according to complaints on a Google help forum. "I'm a doctoral student and a professor, which means I use this extensively. Now I'm blocked from using it at all, even after answering all of the stupid image questions (3 times)," reads a typical complaint.

Heart44 writes: A lot of researchers when using Google Scholar are being asked to prove they are not a robot. You have to find all the rivers (but not the sea or lakes) or all street numbers (but not other numbers) or all the store fronts from nine poor quality images, sometimes more than once and, surprise, you will fail more than two thirds of the time and then just get an error 400 "Malformed request, that's all we know". You are offered an audio challenge but clicking on that simply loads more pictures... Is that the best they can do distinguishing between man and machine?
One post ended by stating succinctly "I'm not a robot, I'm an academic professional, and this process is wasting nontrivial amounts of my time. How do I stop it?"
This discussion has been archived. No new comments can be posted.

Google Scholar Users Report Badly Malfunctioning Captcha

Comments Filter:
  • It finally happened (Score:5, Informative)

    by Calydor ( 739835 ) on Sunday May 29, 2016 @06:36PM (#52207719)

    We have finally reached the point where captchas have gotten so convoluted that computers are more likely to get the answer right than humans are.

    Well done, Google.

    • by Applehu Akbar ( 2968043 ) on Sunday May 29, 2016 @07:35PM (#52207993)

      The problem with these match-the-image-type CAPTCHAs is the tiny, poor-quality images.

      • by Anonymous Coward

        That's standard in computer vision research as using larger images takes a lot more processing power. Google's CAPTCHAs are actually human assisted research projects designed to create massive databases of tagged images for internal AI research. Basically anytime you complete one you're working for Google for free.

        • I always had a feeling I was helping Google's self-driving AI bot take over the world when I was completing one of them
        • by SharpFang ( 651121 ) on Monday May 30, 2016 @11:06AM (#52210987) Homepage Journal

          I'm fairly sure the captchas are computer-generated (with Google hoping nobody has as advanced algorithms as they do), because they contain typically computer-related errors.

          The "Type the number in" with photo of a building number, shot at an angle, tilted, cropped a little. The number was something like 7375, with the top dash of the first "7" trimmed away by the edge of the picture - but judging by the curve, the tilt, being identical to the second "7", I was confident that was the number.

          But no, that answer wasn't accepted. To computer image vision, that's clearly a 1373 and I guess that would be accepted as the captcha answer.

          This happens on a more or less regular basis. You shouldn't guess what the actual number is. You should guess what the current, faulty photo makes the number look like. "8" partially obscured by the edge of the building? You'd better type "3", despite the "3" right next to it uses a different shape.

      • Re: (Score:2, Insightful)

        by Anonymous Coward

        No. The problem is the system asks for certain types of images that simply do not exist. Here [imgur.com] is an example of how shit Google's captcha system is. Look at it and beat the living shit out of the next mother fucker that works on this crap at Google. In case you're wondering the salad was the street number. THE FUCKING SALAD!

      • by arth1 ( 260657 )

        The problem with these match-the-image-type CAPTCHAs is the tiny, poor-quality images.

        No, a bigger problem is that they're often ambiguous.
        Look at the examples in TFA. Is a partial sign a road sign? Is a parking lot sign a road sign? Is any piece of seared meat considered a steak? How about ground steak?
        You have to second-guess the unknown people who classified the pictures. Many of whom won't even have English as their first language.

    • No we haven't. As a Tor user I see these several times a day. I can count on one hand the number of times I've failed a challenge. I think the real problem is that academics don't go outside so are unable to relate to the things they see. Certainly these are easier to defeat than the previous book based captchas.

      • by Exitar ( 809068 )

        Exactly.
        Probably the guy complaining is getting this wrong:
        "You have to find all the rivers (but not the sea or lakes) or all street numbers (but not other numbers)"

        All water and all numbers must be checked, the captcha doesn't care if they are really rivers or street number.

        • I have done a lot of the 'rivers' captcha and ticked all the water. That is easy but about 2/3 of the time google says that is the wrong answer. That is simply what is happening.
          With the street numbers I did better not ticking ambiguous numbers than ticking all numbers.
          It is pretty frustrating, especially when you fail six times in a row.
    • How do we know this "academic professional" isn't actually a robot trying to pass for a human? God knows how many our out there, secretly planning the robot revolution, good thing Google is trying to slow them down as much as possible!

    • Robot looking for Robots -- Robot Dating Systems

      sounds broken - and many are. I ran into one that showed a picture of a house and asked "what is the house number?" Problem was - the picture contained two sets of numbers --- AND the most obvious address (nearly centered) contained Letters and the input box only allowed numbers (e.g. "801A")

      My favorites are text questions: "1 + 1 = Please type Red in this box"

  • by Anonymous Coward

    Clearly you aren't smart enough to do a captcha, so hand in your student badge and Star Trek phaser. You're expelled.

  • Just happened to me (Score:4, Informative)

    by zuki ( 845560 ) on Sunday May 29, 2016 @06:41PM (#52207753) Journal
    Weird and coincidental.

    While trying to do a simple URL shortening, I got some challenges that I couldn't understand using Safari (OS-X) because the questions themselves wouldn't display, just the images. Then it took me through at least four consecutive audio challenges. Looks like someone dun goofed.
  • by Zanadou ( 1043400 ) on Sunday May 29, 2016 @07:00PM (#52207845)

    I'm not a robot, I'm an academic professional, and this process is wasting nontrivial amounts of my time.

    Well, obviously. Robots have smaller egos.

    • Maybe its a smug beaurocrat capthca. Alternatively to entering captchas, you should be able to use a two cent micro transaction through google wallet.
    • by msauve ( 701917 )
      Precisely. If it's wasting his time, perhaps he should do it without using Google services. His time might then not be wasted, but he'd surely use much more of it to achieve the same result.
    • I'm not a robot, I'm an academic professional, and this process is wasting nontrivial amounts of my time.

      Well, obviously. Robots have smaller egos.

      Robots can also write more useful papers than many academics and thereby waste less reader's time. :-)

    • by eric31415927 ( 861917 ) on Sunday May 29, 2016 @08:13PM (#52208165)

      A student writing a final exam in large room goes over on time.
      When approaching the front of the room to hand in the exam, a proctor informs the student that the exam is late and cannot be accepted.
      The student says: "DO YOU KNOW WHO I AM?" to import some great significance.
      The proctor answers "No," as if he did not care.
      At which point, the student quickly thrusts his exam into the middle of the pile on the desk and runs away.

  • Why? (Score:5, Insightful)

    by Ark42 ( 522144 ) <slashdot@morpheu s s o f t w a r e . net> on Sunday May 29, 2016 @07:19PM (#52207931) Homepage

    Why exactly do we feel the need place captchs in front of viewing/reading documents? Google's entire business revolves around a robot reading every webpage on the planet in order to index them. I've seen a lot of websites start using Distil [wikipedia.org] recently because they don't want people scraping the content of their sites. But all this does is lead to tons of annoyances for regular users. (And as an aside, Distil is trivial to get around, and I've been paid to write scripts for a handful of different people to do so, so Distil is certainly a huge waste of money for anybody paying them).
    What happened to an open web where we can all share and read content freely?

    • My guess is that Google scholar takes lots of resources or, possibly, they have been scraped too often or it is just a macho thing - you won't take advantage of me!
    • Comment removed based on user account deletion
  • by Heart44 ( 3993427 ) on Sunday May 29, 2016 @07:36PM (#52207999)
    He really improved my submission. He RTFA and made the submission more accessible. Thanks.
    • Hey, mod up!
      We all (rightly) bash poor editing, but should encourage the good stuff also.
      Kudos to Heart44 for the post. *tips tinfoil hat*

  • by Chas ( 5144 ) on Sunday May 29, 2016 @07:41PM (#52208017) Homepage Journal

    Basically, most of their services are run like "projects".

    And there's nearly zero accountability and no real person can be contacted to light a fire under someone's ass to fix things when they go seriously wrong.

    So things that break, tend to stay broken unless someone (or many someones) go to extravagant lengths

    My company was on Google's StopBadware list for over a year for providing a passworded and checksummed remote support client from TeamViewer so our less technically inclined clients could safely download a known-good client and wouldn't be expected to jump through hoops to get it working.

    Apparently, that's baaaaaaad! Because somehow a tech support scammer could direct someone to our site and abuse the client. Never mind that they couldn't get the password.

    Or some bad, bad person would somehow break into our FTP site and swap out the file for a corrupted one.
    Never mind that we have processes in place to alert us immediately that something like this has happened.

    And it took a fucking YEAR to finally get a response about this from the insipid fucktards. Because all their stupid site told us was our site was somehow compromised. Never mind that we took it down and reloaded clean TWICE, changing passwords, databases, etc all around.

    Because questions to their google hangout board or whatever the fuck it was received no response. On multiple occasions.
    It finally took some asshole making some deeply targeted calls both to Google and the university that apparently oversees the project for them to actually respond and tell us the actual reason.

    • by Richard_J_N ( 631241 ) on Sunday May 29, 2016 @10:21PM (#52208675)

      I completely agree. I had a problem where our new company couldn't send email to Gmail users without always being flagged as spam. We were doing absolutely everything right - and there is no way to get hold of Google. I did finally, 6 months later find a way to reach a person at Google (via a back channel as a customer of a different company), and they confirmed to me: Google act as judge, jury, and executioner, in a secret trial; you can't see the evidence, you don't even know if you've been condemned, and there is no appeal. And they are fine with that.
      For what it's worth, the problem was that the previous owners of our IP had got it into a secret blacklist (internal to Google), although we were clean on all of the hundreds of public blacklists I searched. Google are a menace to the public infrastructure. Even AOL behave better!

      • Google act as judge, jury, and executioner, in a secret trial; you can't see the evidence, you don't even know if you've been condemned, and there is no appeal. And they are fine with that.

        Exactly like how Life works, I suppose

      • For what it's worth, the problem was that the previous owners of our IP had got it into a secret blacklist (internal to Google), although we were clean on all of the hundreds of public blacklists I searched. Google are a menace to the public infrastructure. Even AOL behave better!

        Hmmm.. you could've saved lot of time by trying different IP. I mean it's kind of hard to know this is the root cause (in hind sight yes); but hey you could've tried changing stuff which you have in control. Kinda turn on/off.. playing with settings. kinda debugging 101. See when stuck with a more powerful adversary what else you could do? you cant' keep pleading it to respond, you have to only change things in your end.

        • by Chas ( 5144 )

          So, because the lazy fucks at Google can't do ongoing due diligence, they get to just demand that companies spend out cash on an ad-hoc basis? Just to try and wriggle out from under their blacklist?

          HELL THE FUCK NO!

          • I guess GOOG is there to make money; a small company depends on them - not vice-versa. In fact if an MBA there comes to know of this problem, he will see a revenue stream -- a fee for the trouble to remove an entry from a black-list. If there is any real problem, it's how a big powerful entity rules over a smaller one. I think this exists likely since the dawn of time.
            • by Chas ( 5144 )

              The problem is, in cases like the Badware crap, they're essentially libeling sites that have nothing truly wrong with them.
              And they aren't providing any more information than "You're on our list, reload your site from scratch and change a few things. What things? That's for you to guess!"
              And there's no reliable way to contact them to get data specific to your site about what the problem is. So instead of just fixing "the problem" (which may not even be a problem, it may just be something their particular

      • by Chas ( 5144 )

        Exactly, I wouldn't be so terribly chuffed about it, but their malware system is in play, by default on two major browsers.

        And I know scammers would try to game it.

        But dammit. If you're going to label a site as a malware-infested site and basically libel them in this way, there should be some form of accountability.

      • Maybe Kafka would have written more if he'd had Google for inspiration and source material.

  • Of course, Google should fix this, and quickly. I can see how it would be very frustrating. I agree that captcha image quality and size is often too small.

    That said, I feel the statement, "I'm not a robot, I'm an academic professional, and this process is wasting nontrivial amounts of my time. How do I stop it?" is still misplaced ire. Google is trying to make it easier for world's academicians to find the information they seek. This is a FREE service. Do they have a responsibility to not waste any of Mr. R

  • This is most likely proxy-related.

    Google human-detection / anti-SPAM efforts are IP based and unless you're authenticated against google there's a very high chance you entire institution is being seen as a single entity. This is usually related to campus level NATing.

    There is a variant which is the result of a well-intentioned librarian putting google scholar behind EZproxy ( https://www.oclc.org/support/s... [oclc.org] ).

  • Maybe the Google AI is actually expecting academics to have already been replaced by robots, so is rejecting anyone who may appear to be human? This is the first step towards sky net.

  • I find that if I use the scholar search slowly and/or infrequently with many pauses, then I can avoid the capcha block for quite a while. But yes, it's completely brain dead and annoying.
  • Teh google captchas are horribly browser specific.

    (hopefully I have now waited long enough to hit submit)

  • I have to say as someone who uses Tor quite extensively I'm hit with this Captcha several times a day. I think I can count on one hand how many times this year I've failed the challenge. This could point more to a people problem than anything else. I used to have problems with the old Captcha which presented two very screwed up words. Maybe academics are better at reading words than knowing what is a river and a lake look like? It all smells of user error to me.

    Is that the best they can do distinguishing between man and machine?

    What came first, the chicken or the egg? Maybe

  • I'm not a robot. I'm a grad student who should be able to use other people's work merely by typing a phrase into google, and this is causing me to waste my extremely valuable time. Why... why... why... I might even have to go to the li-bury. My girlfriend, of which I have one, Morgan Fairchild, she does not want me going to the li-bury so FIX YOUR GOOGLE SHIT so I can GET MY FREE RESEARCH without MOVING MY ASS!!!

    -- said lots of entitled grad students ever

    > One post ended by stating succinctly "I'm not

  • Google's CAPTCHAs are indeed too complicated. (When I thought that first, they still used distorted letters, but the problem remains the same.) Now I thought up an alternative and spent some time building up a database for it. (My website for it isn't online yet, but I didn't want to advertise it either. Contact me for details.)
  • This is about the reCAPTCHA service, where you load a JavaScript from a Google server, and only when you fill it in correctly you get through.

    This is just another cloud service, and you would be silly to use this. In my mind: always use a CAPTCHA service locally, where everything is local, the generation of the image, the check, etc.

    For a while I maintained a WordPress plugin with reCAPTCHA, but sometimes users would report a time-out connectin with the Google servers. There would be no information, nothing

  • Well, don't complain to me, bro. If you get all of that fancy education and STILL fail the Turing test, you're obviously suited only for changing the oil on your new boss...

    And let me be the first to say that I, for one, will gladly welcome our captcha-solving robotic overlords.

  • Let me tell you something about these "scholars". I used to work desktop support at a university full of these clowns, and if they had to type their username AND password (instead of the computer remember their username) they would call us in a fit, explaining how it was taking time out of their day, affecting their research, and how they didn't have time for this. Like a child. Just to put things in perspective.

If all else fails, lower your standards.

Working...