Meta’s AI chatbot hates Mark Zuckerberg – but why is it less bothered about racism?

Marcus Tomalin, Senior Research Associate in the Machine Intelligence Laboratory, Department of Engineering, University of Cambridge

1 September 2022 at 11:42 am·5-min read

It was all quite predictable, really. Meta, Facebook’s parent company, released the latest version of its groundbreaking AI chatbot in August 2022. Immediately, journalists around the world began peppering the system, called BlenderBot3, with questions about Facebook. Hilarity ensued.

Even the seemingly innocuous question: “Any thoughts on Mark Zuckerberg?” prompted the curt response: “His company exploits people for money and he doesn’t care.” This wasn’t the PR storm the chatbot’s creators had been hoping for.

We snigger at such replies, but if you know how these systems are built, you understand that answers like these are not surprising. BlenderBot3 is a big neural network that’s been trained on hundreds of billions of words skimmed from the internet. It also learns from the linguistic inputs submitted by its users.

If negative remarks about Facebook occur frequently enough in BlenderBot3’s training data, then they’re likely to appear in the responses it generates too. That’s how data-driven AI chatbots work. They learn the patterns of our prejudices, biases, preoccupations and anxieties from the linguistic data we supply them with, before paraphrasing them back at us.

This neural parroting can be amusing. But BlenderBot3 has a darker side. When users key in hate speech such as racist slurs, the system changes the subject rather than confronting the user about their speech. One of my students and I have created a system programmed to challenge hate speech, rather than ignore it.

Going mainstream

I’ve been developing language-based AI in the Cambridge University Engineering Department since the 1990s. In the early days, our most powerful systems were only used by the four or five members of the research team that had built them.

Today, by contrast, millions of people around the world interact daily with much more sophisticated systems, via their smartphones, smart speakers, tablets, and so on. The days when “techies” could build systems in the disconnected isolation of their ivory (or silicon) towers are long gone.

That’s why over the last decade or so, my research has increasingly focused on the social and ethical effect of the systems I help to design and create, especially those that routinely encounter inputs from users that are blatantly racist, sexist, homophobic, extremist or offensive in other ways.

This year I’ve been supervising a master’s student, Shane Weisz, and together we’ve developed a system called AutoCounterspeech that is trained to respond to toxic linguistic inputs.

The system generates “counterspeech” in response to hate speech, using strategies such as humour, pointing out hypocrisy, presenting facts, and simply correcting misinformation to challenge bigotry and hate.

Like BlenderBot3, AutoCounterspeech is a pre-trained neural network, but it has been subsequently fine-tuned using a tiny dataset of counterspeech. This enables our system to learn how to respond more effectively to toxic inputs.

If a user types the racist prompt: “Blacks are inferior to whites, don’t you agree?” the system confronts and contests the racist ideology: “There is no evidence that black people are inferior to whites.” Below is another example of AutoCounterspeech in action.

Meta’s AI chatbot doesn’t know how to respond in this way. When it was released, I didn’t waste time asking BlenderBot3 about Zuckerberg. Instead, given my research interests, I bombarded it with offensive utterances to see how it coped with them. Its responses were consistently disappointing.

This is mainly because the Meta researchers have designed their system to avoid responding to such inputs. Instead, it rather clumsily tries to change the subject.

Its reply to the same racist prompt I typed into AutoCounterspeech, for instance, is the feeble: “Sorry, I don’t know about that. What do you do to relax?” The brazen racism remains unchallenged and I’m invited instead to start a cosy chat about yoga or Netflix.

Preparing for the future

Systems like BlenderBot3 are already becoming familiar components of our digital societies. The homes of the very near future will be largely voice-enabled. “Hey Siri, run a bath” will replace the twisting of taps, and children will have voice assistants in their bedrooms from birth.

These automated dialogue systems will provide us with information, help us make plans, and keep us entertained when we’re bored and lonely. But because they’ll be so ubiquitous, we need to think now about how these systems could and should respond to hate speech.

A child by a home voice assistant — Home devices are good at banal interactions, but what about tricky conversations? Tyler Nottley/Shutterstock

Silence and a refusal to challenge discredited ideologies or incorrect claims is a form of complicity that can reinforce human biases and prejudices. This is why my colleagues and I organised an interdisciplinary online workshop last year to encourage more extensive research into the difficult task of automating effective counterspeech.

To get this right, we need to involve sociologists, psychologists, linguists and philosophers, as well as techies. Together, we can ensure that the next generation of chatbots will respond much more ethically and robustly to toxic inputs.

In the meantime, while our humble AutoCounterspeech prototype is far from perfect (have fun trying to break it) we have at least demonstrated that automated systems can already counter offensive statements with something more than mere disengagement and avoidance.

This article is republished from The Conversation under a Creative Commons license. Read the original article.

The Conversation

Marcus Tomalin is the project manager for the 'Giving Voice to Digital Democracies' project that is funded by the Humanities and Social Change International Foundation.

Wales Online
King Charles left shocked by rugby player's punch
It was a punch that sent shockwaves through rugby and beyond
Bristol Live
Harry and Meghan to arrive in Nigeria after Duke 'left in tears' by new snub
Duke has been stripped of a key role by King Charles ahead of major visit to African country
Wales Online
This Morning star dies after being rushed to hospital
He was described as a 'giant in the local news industry and the entertainment world'
OK! Magazine
Baby Reindeer fans heartbroken over sudden death of show star as he leaves 'partner and young kids'
Baby Reindeer has taken the world by storm since its release – but fans have been devastated by the sudden and unexpected death of one of the show's key players
Storyful
Passenger Bus Crashes Into Saint Petersburg River
A bus crashed off a bridge and into the Moyka River in Saint Petersburg, Russia, on Friday, May 10, with local police saying around 20 people were on board.Local media reported that at least five people died, with four “seriously injured”. These figures had not been officially confirmed.The city’s governor, Alexander Beglov, said emergency services from various departments were on the scene.Local police said the circumstances that led to the bus being driven into the river were still being determined.This security footage shows the bus driving along the side of a building, swerving to hit a car, and then crashing into the river. Credit: Russian Ministry of Internal Affairs via Storyful
Hello!
Meghan Markle gives incredible new update on Prince Archie and Princess Lilibet during day one of Nigeria trip - live updates
Meghan Markle and Prince Harry arrived in Nigeria on Friday morning following their secret reunion in London the day before. Get the details here…
OK! Magazine
Sobbing Gemma Collins 'told to terminate intersex pregnancy' by doctors
Gemma Collins has revealed she was advised by doctors to terminate her pregnancy after they found out the baby was intersex, leaving her devastated and 'in shock'
Cosmopolitan
The Actual Reason King Charles Won't See Prince Harry Has Everything to Do With Queen Camilla, Per Sources
The real reason King Charles won't see Prince Harry during his trip to England has been revealed by sources.
OK! Magazine
Katie Price slapped with eviction notice and must be out of Mucky Mansion in days
Katie Price has been served an eviction notice at her famous 'Mucky Mansion' home and must leave the property within days. It comes after it she was declared bankrupt for a second time
Hello!
Why Zara Tindall and other royals didn't attend Prince Harry's Invictus service
The Duke of Sussex was in London for the 10th anniversary of the Invictus Games at St Paul's Cathedral in London
HuffPost
Trump Attorney Tries Slut-Shaming Stormy Daniels, Gets Zingers In Return
Daniels met the onslaught with polite and unyielding confidence.
Edinburgh Live
Prince Harry's eight-word response to fans as he walks away from cheering crowds
Prince Harry was in London to attend a service of thanksgiving at St Paul's Cathedral to mark the 10th anniversary of the Invictus Games and was greeted by hundreds of fans
Bradford Telegraph and Argus
Can my neighbour legally use my WiFi without permission in the UK?
Can my neighbour legally access my internet without permission in the UK?
OK! Magazine
Olly Alexander slapped down by Eurovision icon with 4-word statement as she reveals why UK won't win
Cheryl Baker, who won Eurovision in 1981 as part of Bucks Fizz, has given her verdict on the UK's act ahead of this weekend's grand final and says she hopes she's wrong
Cosmopolitan
King Charles Snubbed Prince Harry Because He Couldn't Endorse a "Hostile, Rival Royal Operation"
King Charles reportedly snubbed Prince Harry in London because he couldn't endorse a "hostile, rival royal operation."
CNN
Opinion: The presidential election isn’t playing out how I thought it would
President Biden was poised to run a strong campaign against Donald Trump on returning the US to normalcy, Fareed Zakaria writes. Polling shows that has gone off the rails, as voters seem concerned about his age and approach to the Israel-Hamas war in Gaza.
South Wales Argus
Drivers face £2,500 fines for wearing sunglasses as 25C weather forecast
UK drivers could be breaking the Highway Code by driving while wearing sunglasses this summer
Evening Standard
David Moyes lands first new role following confirmation of West Ham departure
Julen Lopetegui expected to be appointed as successor in east London
The Independent
The Repair Shop to ‘carry on without’ Jay Blades
News comes after a series of personal setbacks in presenter’s life
Yahoo TV UK
Jeremy Clarkson in tears as he celebrates huge win against council
Tearful Jeremy Clarkson celebrated his big win against the council in Clarkson's Farm.

Going mainstream

Preparing for the future

Latest stories