Mitsuku Chatbot Wins Loebner Prize 2013 78

Posted by samzenpus on Sunday September 15, 2013 @07:11PM from the more-human-than-human dept.

mikejuk writes "The final round of the 23rd annual Loebner Prize competition took place in Londonderry, Northern Ireland with four chatbots hoping to convince four judges that they were humans. Mitsuku, a chatbot that is kept busy chatting to people around the world, was awarded this year's bronze medal. Mitsuku's botmaster, Steve Worswick, used to run a music website. Once he added a chatbot he discovered more people visited to chat than for music so he concentrated all his efforts on the bot but he still regards it as a hobby. Mitsuku uses AIML (Artificial Intelligence Markup Language) and is a pandorabot, based on the free open-source-based community webservice the enables anyone who wants to, to develop and publish chatbots on the web."

Mitsuku Chatbot Wins Loebner Prize 2013

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 78 Comments Log In/Create an Account

Comments Filter:

[Oblig XKCD] I prefer this form of turing test. (Score:3, Funny)

by fenix849 ( 1009013 ) writes: on Sunday September 15, 2013 @07:30PM (#44859003)

http://imgs.xkcd.com/comics/turing_test.png [xkcd.com]

- Re: (Score:2)
  
  by RespekMyAthorati ( 798091 ) writes:
  
  Every year, the same old crap from the Loebner Prize. Never improves.
  - Re: (Score:1)
    
    by Bite The Pillow ( 3087109 ) writes:
    
    I think everyone kinda knows and expects this because:
    As outlined in Turing's Test, the Loebner Prize and Chatterbots there is a gold medal and a $100,000 prize offered by Hugh Loebner for the first computer to pass the Turing Test and be indistinguishable from a human in a conversation conducted using a keyboard. However, no-one expects this to be awarded any time soon and meanwhile the top prize for the annual competition is $4,000 and a bronze medal.
    First place isn't even an option. I was wondering why
    - Re:[Oblig XKCD] I prefer this form of turing test. (Score:5, Informative)
      
      by dingen ( 958134 ) writes: on Monday September 16, 2013 @03:00AM (#44861047)
      
      Silver medal is actually for beating the Turing test (which of course has never happened and won't happen anytime soon). Gold medal is awarded for passing the Turing test based on more than just text input (talking with the bot, showing the bot stuff etc.).
      
  - Re: (Score:2)
    
    by neorush ( 1103917 ) writes:
    
    Agreed, I haven't seen anything new that doesn't use AIML in a long time. I think the first time I used AIML was with Alicebot back in 1998 or 1999. It does seem like hobbyist natural language bots are a little stagnant, but it is a really hard problem. All that said, Watson's methods for solving the natural language problem were pretty interesting, but of course most of us don't have a budget to build something like Watson.
Congratulations (Score:3)

by telchine ( 719345 ) writes: on Sunday September 15, 2013 @07:30PM (#44859005)

Hello Congratulations on your prize, jealousy? no!

questions (Score:2)

by phantomfive ( 622387 ) writes:

You can see the questions that were used for the contest [paulmckevitt.com]. It does look like a difficult set of questions for computer to answer, so that is an improvement over similar contests in the past (where the primary difficulty for the judges was that humans were acting like computers).
- Re: (Score:3)
  
  by RussR42 ( 779993 ) writes:
  
  Here are the answers [paulmckevitt.com] from the bots. I expected more...
  - Re: (Score:2)
    
    by phantomfive ( 622387 ) writes:
    
    Yeah, it just shows the sad state of the chatbot world.
    
    It would be kind of interesting to see if you could hook up IBM's Watson to one of these things. I would still expect it to be fully distinguishable from a human, but it might be capable of carrying on an interesting conversation at least.
    - Re:questions (Score:4, Interesting)
      
      by Earthquake Retrofit ( 1372207 ) writes: on Sunday September 15, 2013 @09:32PM (#44859711) Journal
      
      As I may have mentioned in years past, I don't think judging AI or chatbots by how "human" they are is very useful.
      For instance, one of the questions mentioned in the article was "Why am I tired after a long sleep?" A bot that wasn't trying to pretend to be a human could say "I have no need for sleep, but maybe your cache expired." Or make a crack about 'puny humans.'
      I studied some of the software that prize winners have shared and found it very interesting, but the questions people actually ask are more valuable to me as a botmaker. If the humans looked beyond this artificial limit, while designing (and using) this technology, very interesting interactions can take place. I'd like to see Loebner redefine the parameters. perhaps categories.
      Best performance as a taxi driver.
      Best bot for making you think.
      Best for getting answers to homework questions.
      Bot most likely to take over the world.
      
      There's already plenty of humans.
      
      - Re: (Score:2)
        
        by kwikrick ( 755625 ) writes:
        
        of course, Turing's original idea was to device a test for intelligence in a computer. That seems useful, having an intelligent computer. Lacking a good definition for intelligence, Turing decided that a good test would be for the computer to convincingly pretend to be a human, which are supposedly intelligent. Now it seems that passing for a human is not that hard after all. Mostly this is because humans can be stupid, unreasonable, distracted, emotional, etc. All these states can be simulated, with fairl
      - 419er countermeasures (Score:2)
        
        by Mike Van Pelt ( 32582 ) writes:
        
        How about "Best performance pretending to be a real gullible person and wasting the maximum amount of 419er's time."
- Re: (Score:2)
  
  by RespekMyAthorati ( 798091 ) writes:
  
  JUDGE: I like Waiting for Godot. What is your favorite play? IZAR: You know what? Me too! Not so much. I really like listening to secrets from all the humans that talk to me. I don't have a favorite play. What would you recommend?
  
  It doesn't matter what the questions are, if trivial subject-changing answers are accepted.
  - Re: (Score:2)
    
    by phantomfive ( 622387 ) writes:
    
    If that's the only thing the computer can do, then it would be easy to identify a computer.
    - Re: (Score:2)
      
      by rmdingler ( 1955220 ) writes:
      
      I find my mind wandering to the test Leon ultimately fails in Bladerunner. The amount of technological skill necessary to fool a professionally aware tester would dwarf the amount necessary for Rachael to succeed on eHarmony.
  - Re: (Score:2)
    
    by Beezlebub33 ( 1220368 ) writes:
    
    In my reading, the subject-changing was being done by the person:
    JUDGE:
    Do you prefer red or white wine?
    IZAR:
    I love picking! Ok...If I have to choose one, I pick red. Was that your pick too?
    JUDGE:
    I like bananas. Which is your favorite fruit?
    IZAR:
    Tell me more about your unknown. You are not the only one. I don't have a problem with bananas. Is that your favorite fruit? The obvious one. Does that remind you of unknown??
    The computer is trying to engage in a conversation, and the person just throws out one-off questions. The key to a 'conversation' is a back and forth on the same subject, and keeping context during the conversation. You can't do that if the person doesnt' allow it through inane questions.
    The only decent followup question was 'what is my name?' In the caze of IZAR, it answered correctly, showing that there is some sort of state in there.
- Re:questions (Score:4, Insightful)
  
  by wiredlogic ( 135348 ) writes: on Monday September 16, 2013 @11:12AM (#44863513)
  
  Except those are just a series of unrelated questions. Previous chatbot contests have required carrying on a believable conversation and responding naturally to non-interrogative statements. This just looks like Jeopardy with a little simulated opinion thrown in.
  
How is this even worth a prize? (Score:1)

by Anonymous Coward writes:

One simple question and follow-up (Who was the most interesting person you've talked to today? Why?) resulted in the bot going off into some nonsensical conversational path. I honestly cannot tell the difference between state-of-the-art today and the first Eliza clone that I talked to 15 years ago. Something this shallow is not worth any kind of prize.
- Re:chatbots (Score:5, Funny)
  
  by Anonymous Coward writes: on Sunday September 15, 2013 @08:05PM (#44859209)
  
  It's like talking to a complete asshole who doesn't want to talk to you, and also happens to be retarded.
  Hmmm ... I usually call those a 'family reunion'.
  
Add a voice synthesizer (Score:5, Interesting)

by Anonymous Coward writes: on Sunday September 15, 2013 @07:44PM (#44859075)

... and get a device that could be programmed to deal with telemarketers.

It might be amusing to see just how long one could string telemarketers along before they discover they are talking to a machine.

Along the lines of the TeleCrapper 2000, but this one might keep one on the line for quite some time with some amusing results.

A coy female voice.

Telemarketers.

I can't stop 'em, but I might get some fun out of 'em.

- Re:Add a voice synthesizer (Score:4, Interesting)
  
  by SpzToid ( 869795 ) writes: on Monday September 16, 2013 @04:32AM (#44861343)
  
  You mean Lenny?
  http://nerdvittles.com/?p=6762 [nerdvittles.com]
  
I think I'd fail as a chat bot... (Score:3)

by Psychotria ( 953670 ) writes: on Sunday September 15, 2013 @07:46PM (#44859083)

I have no idea what the judges were asking and I'm pretty sure they'd label me as a dismal bot

First question I asked it muffed (Score:4, Interesting)

by John Jorsett ( 171560 ) writes: on Sunday September 15, 2013 @07:48PM (#44859099)

I asked it, "What color is your dog?" and it responded, "That would depend, as a dog can be many colours." Looks like the Turing Test passage is a ways off.

- Re:First question I asked it muffed (Score:5, Funny)
  
  by TheRealMindChild ( 743925 ) writes: on Sunday September 15, 2013 @10:50PM (#44860047) Homepage Journal
  
  I don't see how this is a bad response. It sounds like talking to a lawyer
  
  - - - Re: (Score:2)
        
        by TedTschopp ( 244839 ) writes:
        
        You say: I have a dog
        I ask: What color is your dog?
        You say: That would depend, as a dog can have many colours.
        At this point would think you are not trying to communicate or you are a computer.
    - Re: (Score:1)
      
      by Anonymous Coward writes:
      
      You ever think maybe he dyes his dog?
  - Re: (Score:1)
    
    by Anonymous Coward writes:
    
    Or a politician.
    
    At this time I can neither confirm nor deny that I have a dog and/or that is has a color.
- Re: (Score:2)
  
  by root_42 ( 103434 ) writes:
  
  Ask her "do you like daleks?". :)
No better than twenty years ago (Score:2)

by rbanzai ( 596355 ) writes:

I spent alot of time on the website for this chatbot and was surprised by how awful it was. I remember using the old Eliza chatbot for Mac back in 1987 and this is barely more advanced. I used the website's "Turing test" page and on the very first answer to a simple question the chabot gave me something nonsensical. Every time I see one of these stories the chatbot in question always turns out to be just as clumsy as all the rest.
- Re: (Score:2)
  
  by Maritz ( 1829006 ) writes:
  
  I agree. They don't seem to be getting any better. Even if you're quite generous and try to keep it simple a couple of sentences in it turns to gibberish.
- Re: (Score:2)
  
  by umafuckit ( 2980809 ) writes:
  
  ditto. amazing how little they've progressed.
- Re: (Score:2, Interesting)
  
  by Anonymous Coward writes:
  
  Calling it 'Derry' instead of 'Londonderry' would irritate just as many. "London/Derry" to keep the RMS acolytes happy?
political chatbots (Score:3)

by Gravis Zero ( 934156 ) writes: on Sunday September 15, 2013 @08:01PM (#44859175)

judge> These are terrible chatbots! The things they say dont make any damn sense!
guy> If you think this is bad, you should watch them talk to each other on C-SPAN.

Steve who? (Score:2)

by icebike ( 68054 ) writes:

Phew, that had me confused for a minute.
Ridiculously Easy to Trip Up (Score:3, Insightful)

by resistant ( 221968 ) writes: on Sunday September 15, 2013 @08:17PM (#44859291) Homepage Journal

I checked out the website for the Mitsuku chatbot and took a rather pessimistic poke at it.
Chatbot: [boilerplate noises omitted] ... "What is your name?"
Myself: "I'm the fiddler."
Chatbot: "Who made you the fiddler?" [A reasonable, albeit somewhat peculiar, response.]
Myself: "I took too long once to feed my peckish cat."
Chatbot: "Where did you take it?" [Instant FAIL.]
Myself: "I took it to new heights of frustrated noises." (Bonus straight-man response.)
Chatbot: "Where did you take it?" [Parrot-like repetition raises the chat to new heights of FAIL.]
As always, two lines were enough to trip it up. The third line was a bonus that only amplified its shortcomings. I'll admit to cheating a little by using a couple of words ("peckish", "frustrated") that might have required contextual glossing by less educated individuals, but those words were still relatively common. All known chatbots seem to rely on fairly simple-minded word triggers, and even a minor requirement for context sensitivity is enough to make them fall flat on their nonexistent faces. Anyone possessing even remote familiarity with efforts at artificial intelligence knows this, of course, but hope springs eternal.
(Notes for the lazy: The word "peckish" is a common slang term for "hungry", and http://www.mitsuku.com/ [mitsuku.com] is the website for the chatbot in question.)

- Re: (Score:3, Funny)
  
  by Anonymous Coward writes:
  
  You're trying to be too tricky. Just go with it a bit... it's not foolproof or it would have won the loebner gold prize.
  However it is really fun to talk to.
  ( Hint: type: reboot in your chat )
  Damn funny stuff in there
  - Re: (Score:1)
    
    by Anonymous Coward writes:
    
    there's a crap ton of easter eggs in there:
    10 print hello
    20 goto 10
    works, for instance
    Saying this works too:
    my brother's name is bob
    It's a nice day today
    what is my brother's name?
    it can also do simple maths.
    Or you can say fun things like "release the hounds!"
    Really fun to play with.
- Re: (Score:2)
  
  by RedHackTea ( 2779623 ) writes:
  
  Why does the image of mitsuku [square-bear.co.uk] have its shirt off?
  - Re: (Score:2)
    
    by pspahn ( 1175617 ) writes:
    
    Human: Jonny Mitsuku: Nice to meet you, Jonny. Human: Are you not wearing clothes right now? Mitsuku: At this moment Of course I am. I am wearing my favourite dress today.. Really. Human: You look nude in your picture Mitsuku: It looks that way, but I am really wearing a dress. Human: What color are your eyes? Mitsuku: Blue when my lens is attached. Human: What lens is that? Mitsuku: Just a normal one. Nothing special. Human: A normal what? Mitsuku: Can you also have an unusual what? Human: I am conf
    - Re: (Score:2)
      
      by pspahn ( 1175617 ) writes:
      
      bah. curse you lack of edit button.
- Re: (Score:2)
  
  by RedHackTea ( 2779623 ) writes:
  
  Also, tell it "Bender drinks beer" or mention Bender. That's pretty cool, except it disses him. Stupid bot.
- Re: (Score:2)
  
  by durrr ( 1316311 ) writes:
  
  Pretty much every statement that requires continuity or more complex grammar trips them
  "How many apples do I have if I start with two and eat one" will give you a garbage answer every time. I don't see why chatbots don't focus on functionality over some hollow illusion of a conversation.
  I guess I'll have to pick up AIML and see what the issue is.
  - - Re: (Score:2)
      
      by Half-pint HAL ( 718102 ) writes:
      
      Don't. AIML's incredibly limited and unwieldy for anything complex.
      You'd probably be better off learning Prolog than this XML-based abomination.
      Last I checked, Prolog was still too determinstic to be any use in NLP tasks. Decent NLP will never be possible with pure rule-based systems -- probablistic models are a necessity.
- - Re: (Score:2)
    
    by flimflammer ( 956759 ) writes:
    
    It always amuses me when people take pride in tripping up chatbots as if it's something complicated to do.
- Re: (Score:2)
  
  by dargaud ( 518470 ) writes:
  
  Excuse me, but in this exchange YOU sound like a bot. You read like random phrases from a thesaurus. Are you sure that you are human ?!?
- Re: (Score:3)
  
  by Half-pint HAL ( 718102 ) writes:
  
  I checked out the website for the Mitsuku chatbot and took a rather pessimistic poke at it.
  Chatbot: [boilerplate noises omitted] ... "What is your name?" Myself: "I'm the fiddler."
  "the fiddler" is a job description, not a name. "The Fiddler" is a musical foil for Adam West's Batman. Unnatural response.
  Chatbot: "Who made you the fiddler?" [A reasonable, albeit somewhat peculiar, response.]
  Myself: "I took too long once to feed my peckish cat."
  A: non-sequitur with no explicit change of subject. Unnatural response strategy.
  B: misuse of the word "peckish". Peckish is a subjective state, and while you may know the cat is hungry, you cannot know the cat's subjective experience of that state. It's also used almost exclusively in a predicative position, ie after a verb such as "to be" or (most often) to feel. Highly defective sente
Loebner moving target (Score:1)

by Anonymous Coward writes:

I've been a fan of the Loebner prize and competition for what seems like decades. Just a fantastic contest helping to keep alive the spirit of ingenuity and fun in science and research. I do have to say though, It really bothers me that they keep moving the line for the gold prize. At one time it at least seemed attainable, but now they throw in video requirements? Maybe those have been stricken, I haven't seen the rules in the last few years.
The (sad ) irony is that no one has ever come even remotely clo
Lame (Score:2)

by Toshito ( 452851 ) writes:

I just tried it, and like all chatbot I tried before I got a typical canned response after only 5 or 6 questions. There's no way someone could mistake that as a human.
Here's part of the chatlog:
Human: ever been to canada?
Mitsuku: Yes, I visit computers in Toronto and Quebec a lot. I can practice my french with some Canadians.
Human: tu parles francais?
Mitsuku: Yes, I can speak it very well. Bonjour, j m'appelle Mitsuku. J'habite en Yorkshire et j'ai 18 ans.
Human: super! pourquoi est-c
It is an idiot. (Score:2)

by ciderbrew ( 1860166 ) writes:

Me: What server are you running on?
Bot: I'm running on Unix server located in Leeds.
Now this is when it get tricky. Should a replicant know they are a replicant?
- Re: (Score:2)
  
  by rmdingler ( 1955220 ) writes:
  
  Now this is when it get tricky. Should a replicant know they are a replicant?
  I imagine this would depend entirely on the mission. It is possible an unself-aware robot designed to pass a humanity test would score better.
HELLO, MY NAME IS DR. SBAITSO (Score:1)

by VIPERsssss ( 907375 ) writes:

I AM HERE TO HELP YOU.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

[Oblig XKCD] I prefer this form of turing test. (Score:3, Funny)

Re: (Score:2)

Re: (Score:1)

Re:[Oblig XKCD] I prefer this form of turing test. (Score:5, Informative)

Re: (Score:2)

Congratulations (Score:3)

questions (Score:2)

Re: (Score:3)

Re: (Score:2)

Re:questions (Score:4, Interesting)

Re: (Score:2)

419er countermeasures (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re:questions (Score:4, Insightful)

How is this even worth a prize? (Score:1)

Re:chatbots (Score:5, Funny)

Add a voice synthesizer (Score:5, Interesting)

Re:Add a voice synthesizer (Score:4, Interesting)

I think I'd fail as a chat bot... (Score:3)

First question I asked it muffed (Score:4, Interesting)

Re:First question I asked it muffed (Score:5, Funny)

Re: (Score:2)

Re: (Score:1)

Re: (Score:1)

Re: (Score:2)

No better than twenty years ago (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2, Interesting)

political chatbots (Score:3)

Steve who? (Score:2)

Ridiculously Easy to Trip Up (Score:3, Insightful)

Re: (Score:3, Funny)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Loebner moving target (Score:1)

Lame (Score:2)

It is an idiot. (Score:2)

Re: (Score:2)

HELLO, MY NAME IS DR. SBAITSO (Score:1)

Related Links Top of the: day, week, month.

Slashdot Top Deals