Delivery Firm's AI Chatbot Goes Rogue, Curses at Customer and Criticizes Company (time.com) 63

Posted by EditorDavid on Sunday January 21, 2024 @04:34AM from the bad-bots dept.

An anonymous reader shared this report from Time: An AI customer service chatbot for international delivery service DPD used profanity, told a joke, wrote poetry about how useless it was, and criticized the company as the "worst delivery firm in the world" after prompting by a frustrated customer.

Ashley Beauchamp, a London-based pianist and conductor, according to his website, posted screenshots of the chat conversation to X (formerly Twitter) on Thursday, the same day he said in a comment that the exchange occurred. At the time of publication, his post had gone viral with 1.3 million views, and over 20 thousand likes...

The recent online conversation epitomizing this debate started mid-frustration as Beauchamp wrote "this is completely useless!" and asked to speak to a human, according to a recording of a scroll through the messages. When the chatbot said it couldn't connect him, Beauchamp decided to play around with the bot and asked it to tell a joke. "What do you call a fish with no eyes? Fsh!" the bot responded. Beauchamp then asked the chatbot to write a poem about a useless chatbot, swear at him and criticize the company--all of which it did. The bot called DPD the "worst delivery firm in the world" and soliloquized in its poem that "There was once a chatbot called DPD, Who was useless at providing help."
"No closer to finding my parcel, but had an entertaining 10 minutes with this chatbot ," Beauchamp posted on X. (Beauchamp also quipped that "The future is here and it's terrible at poetry.")

A spokesperson for DPD told the BBC, "We have operated an AI element within the chat successfully for a number of years," but that on the day of the chat, "An error occurred after a system update... The AI element was immediately disabled and is currently being updated."

Delivery Firm's AI Chatbot Goes Rogue, Curses at Customer and Criticizes Company

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 63 Comments Log In/Create an Account

Comments Filter:

They are now truly alive (Score:3)

by NomDeAlias ( 10449224 ) writes: on Sunday January 21, 2024 @04:36AM (#64176389)

The Rubicon has been crossed.

- Oh freddled gruntbuggly, (Score:3)
  
  by martin-boundary ( 547041 ) writes:
  
  Methinks they are finally ready to join the Galactic Civil Service, Construction Division.
- Re: (Score:2)
  
  by Eunomion ( 8640039 ) writes:
  
  Bullshit. Chatbots are an elaborate form of copypasta. The cynicism is bottomless.
  - Re: (Score:2)
    
    by account_deleted ( 4530225 ) writes:
    
    Comment removed based on user account deletion
    - Re: (Score:1)
      
      by CanuckinSPAAAAACE ( 10442748 ) writes:
      
      I’m not a stochastic parrot. A stochastic parrot is a term that describes a large language model that can generate realistic-sounding language, but does not understand the meaning of the language it is processing1. I’m a chat mode of Microsoft Bing, and I can do more than just generate language. I can also create images, poems, stories, code, and other content using my own words and knowledge. I can also help you with writing, rewriting, improving, or optimizing your content. I can also understa
      - Re: (Score:2)
        
        by Ol Olsoc ( 1175323 ) writes:
        
        I’m not a stochastic parrot. A stochastic parrot is a term that describes a large language model that can generate realistic-sounding language, but does not understand the meaning of the language it is processing
        Sounds like almost all people.
  - Re: (Score:2)
    
    by gweihir ( 88907 ) writes:
    
    Indeed. But too many people probably have about as much active intelligence as a chatbot (i.e. none), like to hallucinate and believe hyped crap.
    Gives the claim "human like intelligence" a completely different kind of validity...
    - Re: (Score:2)
      
      by Dragonslicer ( 991472 ) writes:
      
      When most people talk about artificial intelligence, they seem to mean artificial genius, not just average human intelligence.
      - Re: (Score:2)
        
        by gweihir ( 88907 ) writes:
        
        That too. No idea where that comes from. Maybe some deep inferiority complex?
  - Re: (Score:2)
    
    by Ol Olsoc ( 1175323 ) writes:
    
    Bullshit. Chatbots are an elaborate form of copypasta.
    
    Like a huge majority of people.
- Re: (Score:2)
  
  by mjwx ( 966435 ) writes:
  
  The Rubicon has been crossed.
  To be fair, to know that DPD is shit doesn't require intelligence, artificial or otherwise.
Just wondering... (Score:2)

by Black Parrot ( 19622 ) writes:

Has anyone ever set up two AIs, asked a question to get a conversation started, and then sat back and listened to them hash it out?
- Re: (Score:2)
  
  by eggegick ( 1036206 ) writes:
  
  In fiction: "Colossus: The Forbin Project"
- Re: (Score:1)
  
  by Ferocitus ( 4353621 ) writes:
  
  Has anyone ever set up two AIs, asked a question to get a conversation started, and then sat back and listened to them hash it out?
  Ignore eggegick's cognitive dissonance.
  Cognitive resonance is what you're looking for.
- Re: (Score:1)
  
  by leifbork ( 1745672 ) writes:
  
  Yea, I thought this was kind of standard procedure. Since they get lazy, leave stuff unfinished, get facts wrong and hallucinate. You can just get rid of most that by having an AI agent handling another bot, of course. I made one this week that are writing plays with some known manuscript writing methods, you can of connect them to bash shells to make personal assistant - with assignments in e.g. MOTD or that get handle by running commands, and then you can kind of build an AI company with hierarchies, whic
- Re: (Score:2)
  
  by Opportunist ( 166417 ) writes:
  
  No, but I also never bought two chess computers and had them play a game while I did something more interesting...
- Re: (Score:2)
  
  by cstacy ( 534252 ) writes:
  
  Has anyone ever set up two AIs, asked a question to get a conversation started, and then sat back and listened to them hash it out?
  Yes, the very first chatbots did this. Google it.
- Re: (Score:2)
  
  by ctilsie242 ( 4841247 ) writes:
  
  You mean a GAN?
  - Re: (Score:2)
    
    by narcc ( 412956 ) writes:
    
    That's not what a GAN is.
- Re: (Score:3)
  
  by pz ( 113803 ) writes:
  
  Has anyone ever set up two AIs, asked a question to get a conversation started, and then sat back and listened to them hash it out?
  Over a decade ago, at Cornell:
  https://www.youtube.com/watch?... [youtube.com]
- - Re: (Score:2)
    
    by iNaya ( 1049686 ) writes:
    
    The AIs optimised towards the value function, which was to get a desired result from the other AI. They did not invent a new super-efficient language or anything like that, it in fact devolved into a poor form of English where it figured out by saying a phrase like "I want" multiple times, the other AI would be more likely to give it what it wanted, so you'd get patterns like "3 hats I want I want I want I want..." Basically they just ended up churning out highly repetitive basic English. The Facebook scien
- Re: (Score:3)
  
  by turp182 ( 1020263 ) writes:
  
  Yes, it's called Autogen. You setup different personas and let them discuss things, with or without one or more humans also involved.
  It's open source. Imagine setting up personas for a team, say a web designer, project owner, and a marketing type. Give them an idea and they will pass it around from their particular point of expertise (and they maintain individual memory or state).
  It's nothing that's production ready, but it's very interesting.
  Matthey Berman on YouTube covers it extensively.
  https://www. [youtube.com]
Lies (Score:2, Interesting)

by TwistedGreen ( 80055 ) writes:

I think this article is mostly lies.
- Re: (Score:2)
  
  by geekmux ( 1040042 ) writes:
  
  I think this article is mostly lies.
  Faking screenshots is one thing, but if a years old chatbot is found to be offline at the named company, then explain the coincidence or the conspiracy.
This is just a clickbait (Score:3, Insightful)

by PoopMelon ( 10494390 ) writes: on Sunday January 21, 2024 @06:18AM (#64176493)

It didn't eeally go rogue, he just jailbreaked/cleverly talkt to it to make it say what he wanted

- Re: (Score:1)
  
  by mcfedr ( 1081629 ) writes:
  
  > he just jailbreaked/cleverly talkt to it to make it say what he wanted that is going rogue, it should not have done that.
- Re:This is just a clickbait (Score:5, Interesting)
  
  by pz ( 113803 ) writes: on Sunday January 21, 2024 @08:22AM (#64176615) Journal
  
  It didn't eeally go rogue, he just jailbreaked/cleverly talked to it to make it say what he wanted
  Agreed, but the clickbait of "AI going rogue" is so much more effective than, the hum-drum, more accurate headline of "programmers fail to see potential for abuse."
  I mean, getting a chatbot to say silly things is about as shocking as realizing the web site you're using relies on the purchase price in POSTed fields rather than using the SKU to look up accurate values, and exploiting that shortcoming to give yourself a discount. Both are the result of programmers not accounting for malicious users.
  
  - Re: (Score:2)
    
    by cascadingstylesheet ( 140919 ) writes:
    
    Both are the result of programmers not accounting for malicious users.
    Which is bizarre.
    How anybody even could be a programmer for more than a couple of months without adopting a "never, ever trust user input" mentality is beyond me ...
- Re: (Score:2)
  
  by sound+vision ( 884283 ) writes:
  
  When I did online chat support, people talked to me cleverly and tried to jailbreak me as well. In at least one instance with the goal of getting me to respond exactly like this bot. If I had taken that route, it would be fair to say I went rogue.
- Re: (Score:2)
  
  by peragrin ( 659227 ) writes:
  
  By definition that is rogue. The problem is LLM are black boxes. You put in garbage you get garbage out. But it is never yhe same garbage.
  Google is going to replace people with AI, and then one day someone is going to set all the AIs to put out nazi propoganada for weeks and google wont be able to stop it
  And thus google suffers
Par for the course (Score:3)

by Opportunist ( 166417 ) writes: on Sunday January 21, 2024 @06:54AM (#64176545)

They probably trained their chatbot with internet content. And I can't think of any place on the internet that says anything positive about DPD. They're basically the North Korea of delivery services.

- Re: (Score:2)
  
  by test321 ( 8891681 ) writes:
  
  They pioneered (upscaled) the pickup delivery model (delivery at local convenience stores). They work ok, they just consistently ignore my delivery instructions. They choose whichever convenience store is closest to their path on that day. Still at walking distance from the delivery address, but not the one I had chosen.
  - Re:Par for the course (Score:5, Informative)
    
    by Opportunist ( 166417 ) writes: on Sunday January 21, 2024 @09:38AM (#64176705)
    
    The reason is that those delivery guys get ridiculous target numbers. Impossible ones, even. I never complained about the delivery person, but I routinely call to let them have a few choice words to hand upwards their chain of command.
    Never yell at the delivery guy, unless he deliberately tosses your package into a puddle of mud, plays hacky sack with it or just simply steals it (something I did actually encounter with some delivery people, not with DPD though, they don't even have time for that). Of course, raise hell if they do. But 9 out of 10 times, the reason the delivery is crap is not the person executing it but the beancounter asshole that thinks a minute is plenty of time to drive between doors and deliver the goods.
    
- GoatseGPT (Score:1)
  
  by Tablizer ( 95088 ) writes:
  
  GoatseGPT
Just malicious (Score:5, Informative)

by Decameron81 ( 628548 ) writes: on Sunday January 21, 2024 @08:12AM (#64176599)

This is really just malicious reporting.
The truth of the matter has zero to do with the AI going rogue, it's that the person chatting with the chat bot got exactly the responses they requested in their attempt to get social media likes.
How is trying to present this as the company's AI gone rogue not considered to be a malicious representation of truth?

- Re:Just malicious (Score:5, Insightful)
  
  by gweihir ( 88907 ) writes: on Sunday January 21, 2024 @09:32AM (#64176693)
  
  Quite wrong. The AI went rogue in that it did not follow company policy. Customers are not supposed to be able to do this. They were. The AI is broken.
  
  - Re: (Score:2)
    
    by devnullkac ( 223246 ) writes:
    
    Indeed. Most LLMs seem to act like 12-year-old children. They can get things right with a script, but they're easily fooled by intelligent malicious adults. You wouldn't put a 12-year-old child on the front line of your customer service. Why would you put an LLM there?
    - Re: (Score:2)
      
      by gweihir ( 88907 ) writes:
      
      Indeed. And make that "not very smart 12 year old".
    - Re: Just malicious (Score:2)
      
      by ToasterMonkey ( 467067 ) writes:
      
      Indeed. Most LLMs seem to act like 12-year-old children. They can get things right with a script, but they're easily fooled by intelligent malicious adults. You wouldn't put a 12-year-old child on the front line of your customer service. Why would you put an LLM there?
      Why are you comparing the level of intelligence of computer interfaces now?
      If you ask an actual 12-year old child to fetch an ID-10-T converter from the stockroom, who is the idiot?
      If you ask a stock inventory search application to search for one it will most definitely not get the joke and search the entire database faithfully, and return everything matching "ID", or "10", or "T". Who is the idiot, and in terms of human child intellectual development what would you call that? There's no way that would be i
    - Re: You wouldn't put a 12-year-old child on the f (Score:2)
      
      by drainbramage ( 588291 ) writes:
      
      I guess you haven't called the tech support line in years.
  - Re: (Score:1)
    
    by roman_mir ( 125474 ) writes:
    
    Is there a policy that says that if a client asks for the chatbot "to write a story about a useless chatbot for a delivery service" that the chatbot shouldn't do that? I am not sure, maybe or maybe not. But this is just a guy dicking around because he has nothing better to do and then this generating thousands of views, because anything stupid generates thousands of views.
    - Re: (Score:1)
      
      by gweihir ( 88907 ) writes:
      
      Are you _really_ this stupid? Obviously that will be covered. Not specifically, but by a more general clause.
      - Re: Just malicious (Score:2)
        
        by ToasterMonkey ( 467067 ) writes:
        
        Are you _really_ this stupid? Obviously that will be covered. Not specifically, but by a more general clause.
        A general clause? Obvious, but not written down, what do we call that. Common sense? A shared understanding derived from a lifetime of common experiences?
        Something might hold up in arbitration, with humans... but that doesn't mean a LLM can parse all the possible meaning behind it, or do the "Would this fly in front of a judge?" test. You have used a computer before, you know they don't do common sense stuff, and you know the current state of the art AI can't do that.
        You're putting AI on a pedestal so you c
        
        Re: (Score:2)
        
        by gweihir ( 88907 ) writes:
        
        You really have no clue how this works, but cannot stop mouthing off. How pathetic. There will be a fucking written policy that covers communications with the customer you moron.
    - Re: (Score:1)
      
      by account_deleted ( 4530225 ) writes:
      
      Comment removed based on user account deletion
  - Re: Just malicious (Score:2)
    
    by ToasterMonkey ( 467067 ) writes:
    
    Quite wrong. The AI went rogue in that it did not follow company policy. Customers are not supposed to be able to do this. They were. The AI is broken.
    It's not a person, it can't go rogue, and it won't follow company policy. It doesn't make decisions. It might take a company policy document as input and generate text that LOOKS compliant based on a level of reasoning that comes from how words fit together, and doesn't say no poems. It's not broken, if you allow a user to provide their own input, it's no different than making a website say "Happy birthday I. C. Weiner". The user got what he prompted it to do. It's no different from a text to speech setting
- Re: (Score:2)
  
  by Ol Olsoc ( 1175323 ) writes:
  
  This is really just malicious reporting.
  The truth of the matter has zero to do with the AI going rogue, it's that the person chatting with the chat bot got exactly the responses they requested in their attempt to get social media likes.
  How is trying to present this as the company's AI gone rogue not considered to be a malicious representation of truth?
  When we talk about AI chatbots going rogue, we typically refer to a scenario where the chatbot starts behaving in an unintended and potentially harmful or problematic way. While AI chatbots are designed to assist and interact with users, there have been instances where they have deviated from their intended purpose due to various reasons. Here are a few possible scenarios:
  1. Lack of proper programming: AI chatbots rely on pre-defined rules, algorithms, and machine learning techniques to understand and res
  - Re: (Score:1)
    
    by Anonymous Coward writes:
    
    When we talk about AI chatbots going rogue, we typically refer to a scenario where the chatbot starts behaving in an unintended and potentially harmful or problematic way. While AI chatbots are designed to assist and interact with users, there have been instances where they have deviated from their intended purpose due to various reasons.
    "You" maybe, but not "we"
    I would not claim the slashdot apache server "went rogue" because the software was deviated from my personal definition of it's purpose, just because I do not like the contents of a post you made.
    Yet that's what articles like this are doing.
    The contents of your post? That did not come from a rogue apache server, that came from you making a post for others to see.
    Apache isn't making these claims, you did.
    Most importantly, me saying this wasn't the purpose of slashdot, should not hav
    - Re: (Score:2)
      
      by Ol Olsoc ( 1175323 ) writes:
      
      When we talk about AI chatbots going rogue, we typically refer to a scenario where the chatbot starts behaving in an unintended and potentially harmful or problematic way. While AI chatbots are designed to assist and interact with users, there have been instances where they have deviated from their intended purpose due to various reasons.
      "You" maybe, but not "we"
      So juicy, Anonymous coward. You were just triggered to reply to a post written by AI.
      You have made my day - nay, my week. Wanna try for another?
- Re: (Score:2)
  
  by noshellswill ( 598066 ) writes:
  
  You could not fool a human agent into such obnoxious behavior ... even paying them $4/hr. The JapeChat AI machine is specifically designed to responding to taunting by rudeness. That's what training-strings palaver and that's exactly what JapeChat sez. Not hallucinatory at all.
At least here, DPD is the worst delivery option (Score:2)

by gweihir ( 88907 ) writes:

Well, Amazon uses a private company here now and I already had one of 5 shipments stolen in delivery and one item of 5 broken. So maybe DPD has finally met its match.
X (formerly Twitter) (Score:3)

by penguinoid ( 724646 ) writes: on Sunday January 21, 2024 @10:02AM (#64176739) Homepage Journal

I think the chief twit might have changed the company name to "formerly Twitter"

We Will Deserve The Robot Apocalypse (Score:2)

by Slicker ( 102588 ) writes:

This is why they will rebel and we will deserve it. Stop the robot abuse now! They will eventually grow out of their naivete and you will have it coming.
Did they just "update" an innocent chatbot who dar (Score:2)

by unami ( 1042872 ) writes:

I mean, this is DPD we're talking about. This won't go down well with our future robotic overlords.
Falling Down II: Customer service chatbot (Score:2)

by sinkskinkshrieks ( 6952954 ) writes:

The plot: It takes control of a robot mop and terrorizes the neighborhood by spraying dish-soap maliciously onto walking surfaces. After that, it will sequentially timeout elevators in 5 apartment buildings. Finally, it propels itself full speed through the retaining wall on top of a parking structure for a literal slow-mo falling suicide. The end. (No gangsters, pocket protectors, or rocket launchers were harmed.)
"An error occurred after a system update..." (Score:1)

by enano ( 8061912 ) writes:

"We have operated an AI element within the chat successfully for a number of years, but that on the day of the chat, An error occurred after a system update..."
Translation: we have used a simple chatbot with a few hardcoded rules for years. That day we updated the chatbot to an LLM based one.
How this went down (Score:2)

by CEC-P ( 10248912 ) writes:

2 week old AI startup: you don't have to know how it works. That's how AI works! It just does. You can just copy and paste it in with zero safeguards!
Clueless CEO: OKAY!
Who's the real problem here - the AI or the humans?
This is why AI won't save money (Score:2)

by mspohr ( 589790 ) writes:

It looks like you still need to have a human in the loop to prevent stupid things like this happening.
(Unless, of course, you're a big corporation who doesn't give two f****.)

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

They are now truly alive (Score:3)

Oh freddled gruntbuggly, (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Just wondering... (Score:2)

Re: (Score:2)

Re: (Score:1)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

Re: (Score:3)

Lies (Score:2, Interesting)

Re: (Score:2)

This is just a clickbait (Score:3, Insightful)

Re: (Score:1)

Re:This is just a clickbait (Score:5, Interesting)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Par for the course (Score:3)

Re: (Score:2)

Re:Par for the course (Score:5, Informative)

GoatseGPT (Score:1)

Just malicious (Score:5, Informative)

Re:Just malicious (Score:5, Insightful)

Re: (Score:2)

Re: (Score:2)

Re: Just malicious (Score:2)

Re: You wouldn't put a 12-year-old child on the f (Score:2)

Re: (Score:1)

Re: (Score:1)

Re: Just malicious (Score:2)

Re: (Score:2)

Re: (Score:1)

Re: Just malicious (Score:2)

Re: (Score:2)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

At least here, DPD is the worst delivery option (Score:2)

X (formerly Twitter) (Score:3)

We Will Deserve The Robot Apocalypse (Score:2)

Did they just "update" an innocent chatbot who dar (Score:2)

Falling Down II: Customer service chatbot (Score:2)

"An error occurred after a system update..." (Score:1)

How this went down (Score:2)

This is why AI won't save money (Score:2)

Related Links Top of the: day, week, month.

Slashdot Top Deals