Oxford University research set to improve reliability of AI-tools like ChatGPT

Lucy Williams

22 June 2024 at 0:00 pm·1-min read

Oxford research is set to make AI-generated material more accurate. <i>(Image: PA)</i> — Oxford research is set to make AI-generated material more accurate. *(Image: PA)*

Researchers from the University of Oxford have made a significant advance towards ensuring that information produced by generative artificial intelligence (AI) is robust and reliable.

Currently, so-called hallucinations, when an AI tool invents facts that sound plausible but which are imaginary, are a critical factor holding back wider adoption of large language models (LLM) like ChatGPT or Gemini.

These errors can make LLMs unreliable, with researchers referencing examples of a US lawyer getting in legal trouble for citing a case invented by ChatGPT, and can also be dangerous when used in medical diagnosis.

In a new study published today in Nature, the Oxford researchers demonstrated a new way to detect when an LLM is likely to ‘hallucinate’.

This advance could open up new ways to deploy LLMs in situations where "careless errors" are costly such as legal or medical question-answering.

The researchers focused on hallucinations where LLMs gave different answers each time the tool was asked a question - even if the wording was identical - known as confabulating.

Study author Dr Sebastian Farquhar said: “LLMs are highly capable of saying the same thing in many different ways, which can make it difficult to tell when they are certain about an answer and when they are literally just making something up.

“With previous approaches, it wasn’t possible to tell the difference between a model being uncertain about what to say versus being uncertain about how to say it.

"But our new method overcomes this.”

South China Morning Post
OpenAI's ban on Chinese access to ChatGPT to spur growth of local alternatives, experts say
OpenAI's upcoming ban on China-based developers' access to its service is set to contribute to the growth of the Chinese artificial intelligence (AI) sector rather than hinder its progress, industry insiders and analysts said. "OpenAI halting China market access will only accelerate the Chinese large language model (LLM) industry's growth," Zhou Hongyi, founder and chief executive at cybersecurity firm Qihoo 360, said in a social media post on Wednesday. Zhou - whose Beijing-based company has de
Reuters
AI chatbot startup Character.AI launches new calls feature
AI-boom has prompted startups to release new features to their chatbots as competitors such as Microsoft-backed OpenAI, Google and Amazon.com-backed Anthropic are looking to gain market share by engaging new users. The availability of free two-way voice calls on Character.AI's app has come following OpenAI's announcement on Tuesday that the ChatGPT maker's rollout of the advanced voice mode of its latest model GPT-4o is delayed by one month. Character.AI, co-founded by Noam Shazeer, who helped invent "transformer" AI architecture while at Google that underpins systems like Gemini and ChatGPT, enables users to create customized AI companions with specific personalities and values.
Herald Scotland
Mystery of ancient Greek device 'solved' by Glasgow astronomers
It is one of the oldest machines known to have been developed by human hand, yet its true purpose has been lost to the mists of time.
Futurism
China Finds Something Strange in Sample Retrieved From Moon
Lunar Graphene Chinese scientists have made an unusual discovery while analyzing the sample Chang'e-5 collected from the Moon's surface in December 2020. They found naturally occurring "few-layer graphene" for the first time, as state-run news agency Global Times reports, which could have major implications for our plans to make use of local resources once on […]
Sky News
Space station astronauts forced to shelter as Russian satellite breaks into more than 100 pieces
Astronauts on the International Space Station were forced to take shelter last night after a Russian satellite broke into more than 100 pieces. The nine astronauts living on the space station were told to shelter in their respective spacecraft, according to NASA, after the debris was spotted. NASA astronauts Butch Wilmore and Sunni Williams boarded their Starliner spacecraft, the Boeing-built capsule that has been docked since June 6 in its first crewed test mission on the station.
CNN
A weird sea creature was anatomically unlike anything ever seen — flipping it around led to a revelation
Researchers have long puzzled over the peculiar innards of an ancient sea creature. A new study says scientists were looking at the Pikaia fossil the wrong way.
Reuters Videos
Astronauts stuck aboard ISS as Starliner issues delay return
STORY: Problems with Boeing’s Starliner capsule have upended the original plans for the return of two astronauts to Earth, leaving them aboard the International Space Station as teams look at last-minute fixes. :: June 5, 2024Since its liftoff on June 5 the capsule has had five helium leaks, five maneuvering thrusters go dead and a propellant valve fail to close completely.The current problems center on Starliner’s expendable propulsion system, which is needed to back away from the ISS and position it to dive through Earth’s atmosphere. Starliner can stay docked at the ISS for up to 45 days, according to comments by NASA’s commercial crew manager Steve Stich. He said recent test-firings of the thrusters gave mission teams confidence in a safe return, though tests and reviews are ongoing. A source who spoke on the condition of anonymity said internally, NASA’s latest targeted return date is July 6. That would mean the mission, originally planned for eight days, would instead last a month. Even with the propulsion issues NASA has said Starliner would still be capable of returning the astronauts to Earth if absolutely necessary – that is, if the capsule must serve as an escape pod.If Starliner is deemed incapable of safely returning Barry "Butch" Wilmore and Sunita "Suni" Williams, one option would be sending them home aboard SpaceX's Crew Dragon.NASA and Boeing officials, as well as engineers familiar with the program, told Reuters nothing about Starliner’s current problems indicates that this would be needed.This is Starliner's first mission to orbit carrying astronauts - the final test needed before NASA can certify it as the U.S. space agency's second ride to the ISS.
CNN
Decades after the famed Kyrenia shipwreck’s discovery, researchers have a new estimate of when it sank
The timeline of a Hellenistic Kyrenia shipwreck stumped researchers for decades. But thanks to a cache of ancient almonds, a new study may have a better estimate.
The Independent
SpaceX lands lucrative NASA contract to destroy ISS after 2030
Nasa has noted signs of wear and tear on space station and intends to deorbit it after 2030
The Independent
Astronauts forced to take shelter on space station after Russian satellite blows up
More than 100 pieces of debris were thrown around space, officials said
PA Media: UK News
‘Remarkably preserved’ 500-million-year-old sea creature discovered
New findings have allowed scientists to document features in the extinct animal never seen before.
ABC News
Fossilized skull of Neanderthal child with Down syndrome reveals communal caregiving among species
Fossils from the skull of a Neanderthal child that likely had Down syndrome shed light into the collaborative and communal caregiving that likely helped the child survive to the age of 6, according to new research. The fossil fragments, excavated from the Cova Negra archaeological site in Valencia, Spain, were determined to be from a Neanderthal child's ear. The child likely lived 273,000 years ago and showed congenital malformations consistent with Down syndrome, according to a study, which was published Wednesday in Science Advances.
Time
How Furry Pet Rabbits Can Become Invasive Feral Pests
A new study found what makes the furry critters such masterful colonizers of countries across the world.
The Independent
How to watch ‘planet killer’ asteroid big enough to ‘end civilization’ fly near Earth this week
The 7,600-foot long celestial object has earned the nickname ‘planet killer,’ flying at speeds of 58,000 miles per hour
Business Insider
2 'potentially hazardous' asteroids will streak by Earth this week, one as big as a mountain. You can watch it live.
On Thursday and Saturday, two different asteroids will hurtle past Earth at close range. You can watch them both live.
Business Insider
Scientists want to pump carbon into a hole at the bottom of the ocean in a $60 million pilot project to help stop climate change
Offshore drilling could help fight climate change in the future. Instead of extracting oil from the sea floor, scientists want to trap carbon there.
Futurism
NASA Investigating Why Water Spewed From Spacesuit During Spacewalk
On Monday, NASA had to suddenly cut a planned spacewalk outside of the International Space Station short after astronaut Tracy Dyson discovered water squirting from her spacesuit and obscuring her visor with ice. Dyson and fellow astronaut Mike Barratt were originally planning on removing a faulty electronics box and checking in on samples of […]
The Independent
New drone footage shows 97% of coral is dead in northern Great Barrier Reef
Bleaching event, which follows a similar one last summer, is fifth in eight years after parts of reef subjected to extreme heat stress
The Guardian
Two US astronauts stuck in space as Boeing analyzes Starliner problems
Barry Wilmore and Sunita Williams in spacecraft attached to International Space Station as engineers fix problem
Benzinga
NASA Faces Setback As RTX's Collins Aerospace Backs Out Of Spacesuit Deal
Collins Aerospace, a subsidiary of RTX Corp (NYSE:RTX), is in discussions with the National Aeronautics and Space Administration, famously known as NASA, to withdraw from its contract to develop new spacesuits for the International Space Station (ISS). The move marks a setback for NASA as it grapples with its aging spacewalking suits. The contract, part of a $3.5 billion agreement awarded in 2022, aimed to create new spacesuits for the ISS and future lunar missions. Collins initially received $9

Latest stories