AI chatbots are bad at planning, but this could soon change

Nello Cristianini, Professor of Artificial Intelligence, University of Bath

12 April 2024 at 8:31 am·4-min read

<span class="attribution"><a class="link " href="https://www.shutterstock.com/image-photo/asian-women-using-laptops-communicate-business-2177108789" rel="nofollow noopener" target="_blank" data-ylk="slk:Aree_S / Shutterstock;elm:context_link;itc:0;sec:content-canvas">Aree_S / Shutterstock</a></span> — Aree_S / Shutterstock

We might soon see AI step up to the next level, with impending upgrades to artificial intelligence (AI) systems developed by OpenAI and Meta. OpenAI’s GPT-5 will be the new “engine” within the AI chatbot ChatGPT, while Meta’s upgrade will be named Llama 3. Among other things, the current version of Llama powers chatbots on Meta’s social media platforms.

Statements to the media by executives at both OpenAI and Meta suggest that some ability to plan ahead will be incorporated into these upgraded systems. But how exactly will this innovation change the capabilities of AI chatbots?

Imagine you are driving from home to work and want to select the best route – that is, the sequence of choices that is optimal in some sense, based on cost or timing, for example. An AI system would be perfectly capable of choosing the better of two existing routes. But it would be a far more difficult task for it to generate the optimal route from scratch.

A route ultimately consists of a sequence of different choices. However, making individual decisions in isolation is not likely to lead to an optimal overall solution.

For instance, sometimes you have to make a little sacrifice at the start, to reap some benefit later on: maybe joining a slow queue to enter the motorway, in order to move faster later on. This is the essence of a planning problem, a classic topic in artificial intelligence.

There are parallels here with board games such as Go: the outcome of a match depends on the overall sequence of moves, and some moves are aimed at unlocking opportunities that can be exploited later on.

The AI company Google DeepMind developed a powerful AI to play this game called AlphaGo, based on an innovative approach to planning. It was not only able to explore a tree of available options, but also to improve on that ability with experience.

Of course, the real point is not about finding optimal routes for driving or playing games. The technology that powers products such as ChatGPT and Llama 3 are called Large Language Models (LLMs). What is at stake here is providing these AI systems with the ability to consider the long term consequences of their actions. This skill is also necessary to solve mathematical problems, so it potentially unlocks other capabilities for LLMs.

Large language models are designed to predict the next word in a given sequence of words. But in practice, they are used to predict long series of words, such as the answers to questions from human users.

This is currently done by adding one word to the answer, then another word and so on, thereby extending the initial sequence. This is known in the jargon as “autoregressive” prediction. However, LLMs can sometimes paint themselves into corners that are impossible to get out of.

Expected development

An important goal for LLM designers has been to combine planning with deep neural networks, the type of algorithms – or set of rules – that sit behind the models. Deep neural networks were originally inspired by the nervous system. They can improve at what they do through a process called training, where they are exposed to large sets of data.

The wait for LLMs that can plan might be over, according to the comments by OpenAI and Meta executives. However, this comes as no surprise to AI researchers, who have been expecting such a development for some time.

Late last year, OpenAI’s CEO Sam Altman was fired and then rehired by the company. At the time, the drama was rumoured to have involved the company’s development of an advanced algorithm called Q*, although this explanation has since been superseded. Although it’s not clear what Q* does, at the time, the name rang bells with AI researchers because it echoed names for existing methods for planning.

Commenting on those rumours, Meta’s head of AI, Yann LeCun, wrote on X (formerly Twitter that replacing the process of auto regression with planning in LLMs was challenging, but that almost every top lab was working on it. He also thought it was likely that Q* was OpenAI’s attempt to incorporate planning into its LLMs.

LeCun was onto something in what he said about the top labs, because recently, Google DeepMind published a patent application that hinted at planning capabilities.

Intriguingly, the listed inventors were members of the AlphaGo team. The method described in the application looks much like the one that guides AlphaGo towards its goals. It would also be compatible with the current neural network architectures used by large language models.

That brings us to the comments by executives at Meta and OpenAI about the capabilities of their upgrades. Joelle Pineau, vice-president of AI research at Meta, told the FT newspaper: “We are hard at work in figuring out how to get these models not just to talk, but actually to reason, to plan . . . to have memory.”

If that works, we might well see progress on planning and reasoning, moving from simple, step-by-step word generation to planning entire conversations, or even negotiations. Then we might really see AI step up to the next level.

This article is republished from The Conversation under a Creative Commons license. Read the original article.

The Conversation

Nello Cristianini does not work for, consult, own shares in or receive funding from any company or organisation that would benefit from this article, and has disclosed no relevant affiliations beyond their academic appointment.

The Independent
Stranded Boeing astronauts are stuck on International Space Station, Nasa says in urgent update
The astronauts stranded on the International Space Station are still not able to come home, Nasa has said. Two astronauts went to the space station almost 50 days ago as part of a test of Boeing’s Starliner capsule. Test pilots Butch Wilmore and Suni Williams were supposed to visit the orbiting lab for about a week and return in mid-June, but thruster failures and helium leaks on Boeing‘s new Starliner capsule prompted Nasa and Boeing to keep them up longer.
Futurism
Terrifying NASA Video Shows America Spewing CO2 Into Atmosphere
Trapped Gases NASA has released a new visualization that shows copious amounts of carbon dioxide swirling around the Earth's atmosphere. The video shows how concentrations of the gas move across the planet, driven by wind and atmospheric circulation, from January through March 2020. The level of detail is truly astonishing, allowing us to "zoom in […]
Futurism
Astronaut Shows Photo He Shot in Space That Would Be Impossible to Take Now
Pinpoint Stars In 2003, when the International Space Station was a mere three years old, NASA astronaut Donald Pettit took a gorgeous picture of the Earth's atmosphere, with countless stars frozen in time in the background. But as Pettit revealed in a Reddit post earlier this week, the same photo "cannot be taken anymore" — […]
AFP
NASA Mars rover captures rock that could hold fossilized microbes
NASA's Perseverance Mars rover has made what could be its most astonishing discovery to date: possible signs of ancient life on the Red Planet.The quest to confirm ancient Martian life is far from over, however.
CNN
Boeing, NASA may have found ‘root cause’ of Starliner spacecraft’s issues, but astronauts are still in limbo
After weeks of testing, NASA and Boeing officials say they better understand the issues plaguing the Starliner spacecraft, but still aren’t ready to name a return date.
Business Insider
Rats, roaches, and other creatures are winning against climate change, and it's bad news for humans
In the game of climate change, there are winners and losers. These four animals will come out on top, but you probably won't be happy about it.
Futurism
NASA Rocket for First Crewed Moon Mission Since Apollo Arrives at Launch Site
The rocket designed to carry the first humans to the Moon in over half a century has officially made its way to NASA's Kennedy Space Center (KSC) just over a year ahead of its tentative launch date. The enormous, 212-foot rocket stage made its way from NASA's Michoud Assembly Facility in New Orleans to […]
HowStuffWorks
Is Africa Splitting in Two? Really? Here's the Scoop
The notion of Africa splitting has the attention scientists and geologists worldwide, as the Great Rift Valley stretches and tears at the Earth's crust.
Time
CERN Science Gateway: World's Greatest Places 2024
Find out why CERN Science Gateway is one of the World's Greatest Places 2024
Futurism
When AI Is Trained With AI-Generated Data, It Starts Spouting Gibberish
What happens when you feed AI-generated content back into an AI model? Put simply: absolute chaos. A fascinating new study published in the journal Nature shows that AI models trained on AI-generated material will quickly experience rapid "model collapse." Basically, as an AI model cannibalizes AI-generated data, that AI model's outputs become increasingly bizarre, garbled, […]
The Independent
How to cook like a Neanderthal: Scientists recreate surprisingly precise recipes of our ancestors
Archaeologists used a flint tool to butcher two carrion crows, two collared doves and a wood pigeon, all of which Neanderthals ate
Futurism
Scientists Outraged at Canceled NASA Moon Mission Plead Congress to Reconsider
Last week, NASA made a shocking announcement. It would not be sending its $450 million rover, called the Volatiles Investigating Polar Exploration Rover (VIPER), to the Moon, where its state of the art capabilities were anticipated to uncover secrets about water ice just beneath the lunar surface. The reason, according to NASA officials, is that […]
The Telegraph
Nasa finds Mars rock that may have hosted tiny lifeforms
A rock on Mars may have hosted microscopic life billions of years ago, Nasa believes.
Northwich & Winsford Guardian
Primary school 'nurturing the scientists of tomorrow’ earns prestigious award
A SCHOOL is celebrating earning a prestigious award.
People
Astronaut Reveals Reason She Wears Friendship Bracelets in Space — and Why She Won't Stop (Exclusive)
In 2026, Kellie Gerardi will lead an all-female team of international researchers to space
The Telegraph
Everything you need to know about La Niña, the climate phenomenon behind this year’s extreme weather
For months the world endured droughts, heat waves, floods and cyclones as one of the strongest El Niño events on record brought chaos to global weather systems.
PA Media: UK News
Rollout of payment schemes causing ‘widespread uncertainty’ for farmers – report
The changes have come at a time when extreme weather, market conditions and sudden rises in input costs are putting farms under immense pressure.
Associated Press Videos
NASA says no return date yet for astronauts and Boeing capsule at space station
Already more than a month late getting back, two NASA astronauts will remain at the International Space Station until engineers finish working on problems plaguing their Boeing capsule. There is no date for returning the astronauts to Earth.
OK! Magazine
Meghan Markle 'hysterical' after breaking Prince Harry's strict orders in interview
Meghan Markle was given strict instructions by Prince Harry before she sat down for an interview with Vanity Fair - but the Duchess of Sussex seemingly didn't listen
Evening Standard
Elderly woman was 'rammed with trolley' sparking Manchester airport police 'stamping' incident
Brothers confronted man who had argued with their mother on flight before pushing trolley into her, it is claimed

Expected development

Latest stories