Mitsuku Chatbot Wins Loebner Prize 2013 78
mikejuk writes "The final round of the 23rd annual Loebner Prize competition took place in Londonderry, Northern Ireland with four chatbots hoping to convince four judges that they were humans. Mitsuku, a chatbot that is kept busy chatting to people around the world, was awarded this year's bronze medal. Mitsuku's botmaster, Steve Worswick, used to run a music website. Once he added a chatbot he discovered more people visited to chat than for music so he concentrated all his efforts on the bot but he still regards it as a hobby. Mitsuku uses AIML (Artificial Intelligence Markup Language) and is a pandorabot, based on the free open-source-based community webservice the enables anyone who wants to, to develop and publish chatbots on the web."
[Oblig XKCD] I prefer this form of turing test. (Score:3, Funny)
http://imgs.xkcd.com/comics/turing_test.png [xkcd.com]
Re: (Score:2)
Re: (Score:1)
I think everyone kinda knows and expects this because:
First place isn't even an option. I was wondering why
Re:[Oblig XKCD] I prefer this form of turing test. (Score:5, Informative)
Silver medal is actually for beating the Turing test (which of course has never happened and won't happen anytime soon). Gold medal is awarded for passing the Turing test based on more than just text input (talking with the bot, showing the bot stuff etc.).
Re: (Score:2)
Congratulations (Score:3)
Hello Congratulations on your prize, jealousy? no!
questions (Score:2)
Re: (Score:3)
Re: (Score:2)
It would be kind of interesting to see if you could hook up IBM's Watson to one of these things. I would still expect it to be fully distinguishable from a human, but it might be capable of carrying on an interesting conversation at least.
Re:questions (Score:4, Interesting)
For instance, one of the questions mentioned in the article was "Why am I tired after a long sleep?" A bot that wasn't trying to pretend to be a human could say "I have no need for sleep, but maybe your cache expired." Or make a crack about 'puny humans.'
I studied some of the software that prize winners have shared and found it very interesting, but the questions people actually ask are more valuable to me as a botmaker. If the humans looked beyond this artificial limit, while designing (and using) this technology, very interesting interactions can take place. I'd like to see Loebner redefine the parameters. perhaps categories.
Best performance as a taxi driver.
Best bot for making you think.
Best for getting answers to homework questions.
Bot most likely to take over the world.
There's already plenty of humans.
Re: (Score:2)
of course, Turing's original idea was to device a test for intelligence in a computer. That seems useful, having an intelligent computer. Lacking a good definition for intelligence, Turing decided that a good test would be for the computer to convincingly pretend to be a human, which are supposedly intelligent. Now it seems that passing for a human is not that hard after all. Mostly this is because humans can be stupid, unreasonable, distracted, emotional, etc. All these states can be simulated, with fairl
419er countermeasures (Score:2)
Re: (Score:2)
It doesn't matter what the questions are, if trivial subject-changing answers are accepted.
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
JUDGE:
Do you prefer red or white wine?
IZAR:
I love picking! Ok...If I have to choose one, I pick red. Was that your pick too?
JUDGE:
I like bananas. Which is your favorite fruit?
IZAR:
Tell me more about your unknown. You are not the only one. I don't have a problem with bananas. Is that your favorite fruit? The obvious one. Does that remind you of unknown??
The computer is trying to engage in a conversation, and the person just throws out one-off questions. The key to a 'conversation' is a back and forth on the same subject, and keeping context during the conversation. You can't do that if the person doesnt' allow it through inane questions.
The only decent followup question was 'what is my name?' In the caze of IZAR, it answered correctly, showing that there is some sort of state in there.
Re:questions (Score:4, Insightful)
Except those are just a series of unrelated questions. Previous chatbot contests have required carrying on a believable conversation and responding naturally to non-interrogative statements. This just looks like Jeopardy with a little simulated opinion thrown in.
How is this even worth a prize? (Score:1)
One simple question and follow-up (Who was the most interesting person you've talked to today? Why?) resulted in the bot going off into some nonsensical conversational path. I honestly cannot tell the difference between state-of-the-art today and the first Eliza clone that I talked to 15 years ago. Something this shallow is not worth any kind of prize.
Re:chatbots (Score:5, Funny)
It's like talking to a complete asshole who doesn't want to talk to you, and also happens to be retarded.
Hmmm ... I usually call those a 'family reunion'.
Add a voice synthesizer (Score:5, Interesting)
It might be amusing to see just how long one could string telemarketers along before they discover they are talking to a machine.
Along the lines of the TeleCrapper 2000, but this one might keep one on the line for quite some time with some amusing results.
A coy female voice.
Telemarketers.
I can't stop 'em, but I might get some fun out of 'em.
Re:Add a voice synthesizer (Score:4, Interesting)
You mean Lenny?
http://nerdvittles.com/?p=6762 [nerdvittles.com]
I think I'd fail as a chat bot... (Score:3)
I have no idea what the judges were asking and I'm pretty sure they'd label me as a dismal bot
First question I asked it muffed (Score:4, Interesting)
I asked it, "What color is your dog?" and it responded, "That would depend, as a dog can be many colours." Looks like the Turing Test passage is a ways off.
Re:First question I asked it muffed (Score:5, Funny)
Re: (Score:2)
You say: I have a dog
I ask: What color is your dog?
You say: That would depend, as a dog can have many colours.
At this point would think you are not trying to communicate or you are a computer.
Re: (Score:1)
You ever think maybe he dyes his dog?
Re: (Score:1)
Or a politician.
At this time I can neither confirm nor deny that I have a dog and/or that is has a color.
Re: (Score:2)
Ask her "do you like daleks?". :)
No better than twenty years ago (Score:2)
I spent alot of time on the website for this chatbot and was surprised by how awful it was. I remember using the old Eliza chatbot for Mac back in 1987 and this is barely more advanced. I used the website's "Turing test" page and on the very first answer to a simple question the chabot gave me something nonsensical. Every time I see one of these stories the chatbot in question always turns out to be just as clumsy as all the rest.
Re: (Score:2)
Re: (Score:2)
Re: (Score:2, Interesting)
political chatbots (Score:3)
judge> These are terrible chatbots! The things they say dont make any damn sense!
guy> If you think this is bad, you should watch them talk to each other on C-SPAN.
Steve who? (Score:2)
Phew, that had me confused for a minute.
Ridiculously Easy to Trip Up (Score:3, Insightful)
I checked out the website for the Mitsuku chatbot and took a rather pessimistic poke at it.
Chatbot: [boilerplate noises omitted] ... "What is your name?"
Myself: "I'm the fiddler."
Chatbot: "Who made you the fiddler?" [A reasonable, albeit somewhat peculiar, response.]
Myself: "I took too long once to feed my peckish cat."
Chatbot: "Where did you take it?" [Instant FAIL.]
Myself: "I took it to new heights of frustrated noises." (Bonus straight-man response.)
Chatbot: "Where did you take it?" [Parrot-like repetition raises the chat to new heights of FAIL.]
As always, two lines were enough to trip it up. The third line was a bonus that only amplified its shortcomings. I'll admit to cheating a little by using a couple of words ("peckish", "frustrated") that might have required contextual glossing by less educated individuals, but those words were still relatively common. All known chatbots seem to rely on fairly simple-minded word triggers, and even a minor requirement for context sensitivity is enough to make them fall flat on their nonexistent faces. Anyone possessing even remote familiarity with efforts at artificial intelligence knows this, of course, but hope springs eternal.
(Notes for the lazy: The word "peckish" is a common slang term for "hungry", and http://www.mitsuku.com/ [mitsuku.com] is the website for the chatbot in question.)
Re: (Score:3, Funny)
You're trying to be too tricky. Just go with it a bit... it's not foolproof or it would have won the loebner gold prize.
However it is really fun to talk to.
( Hint: type: reboot in your chat )
Damn funny stuff in there
Re: (Score:1)
there's a crap ton of easter eggs in there:
10 print hello
20 goto 10
works, for instance
Saying this works too:
my brother's name is bob
It's a nice day today
what is my brother's name?
it can also do simple maths.
Or you can say fun things like "release the hounds!"
Really fun to play with.
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
Pretty much every statement that requires continuity or more complex grammar trips them
"How many apples do I have if I start with two and eat one" will give you a garbage answer every time. I don't see why chatbots don't focus on functionality over some hollow illusion of a conversation.
I guess I'll have to pick up AIML and see what the issue is.
Re: (Score:2)
Don't. AIML's incredibly limited and unwieldy for anything complex.
You'd probably be better off learning Prolog than this XML-based abomination.
Last I checked, Prolog was still too determinstic to be any use in NLP tasks. Decent NLP will never be possible with pure rule-based systems -- probablistic models are a necessity.
Re: (Score:2)
It always amuses me when people take pride in tripping up chatbots as if it's something complicated to do.
Re: (Score:2)
Re: (Score:3)
I checked out the website for the Mitsuku chatbot and took a rather pessimistic poke at it.
Chatbot: [boilerplate noises omitted] ... "What is your name?"
Myself: "I'm the fiddler."
"the fiddler" is a job description, not a name. "The Fiddler" is a musical foil for Adam West's Batman. Unnatural response.
Chatbot: "Who made you the fiddler?" [A reasonable, albeit somewhat peculiar, response.]
Myself: "I took too long once to feed my peckish cat."
A: non-sequitur with no explicit change of subject. Unnatural response strategy.
B: misuse of the word "peckish". Peckish is a subjective state, and while you may know the cat is hungry, you cannot know the cat's subjective experience of that state. It's also used almost exclusively in a predicative position, ie after a verb such as "to be" or (most often) to feel. Highly defective sente
Loebner moving target (Score:1)
I've been a fan of the Loebner prize and competition for what seems like decades. Just a fantastic contest helping to keep alive the spirit of ingenuity and fun in science and research. I do have to say though, It really bothers me that they keep moving the line for the gold prize. At one time it at least seemed attainable, but now they throw in video requirements? Maybe those have been stricken, I haven't seen the rules in the last few years.
The (sad ) irony is that no one has ever come even remotely clo
Lame (Score:2)
I just tried it, and like all chatbot I tried before I got a typical canned response after only 5 or 6 questions. There's no way someone could mistake that as a human.
Here's part of the chatlog:
Human: ever been to canada?
Mitsuku: Yes, I visit computers in Toronto and Quebec a lot. I can practice my french with some Canadians.
Human: tu parles francais?
Mitsuku: Yes, I can speak it very well. Bonjour, j m'appelle Mitsuku. J'habite en Yorkshire et j'ai 18 ans.
Human: super! pourquoi est-c
It is an idiot. (Score:2)
Bot: I'm running on Unix server located in Leeds.
Now this is when it get tricky. Should a replicant know they are a replicant?
Re: (Score:2)
Now this is when it get tricky. Should a replicant know they are a replicant?
I imagine this would depend entirely on the mission. It is possible an unself-aware robot designed to pass a humanity test would score better.
HELLO, MY NAME IS DR. SBAITSO (Score:1)