UPDATED 20:42 EST / FEBRUARY 12 2025

Report says companies ‘playing with fire’ as AI chatbots fail when trying to summarize news

Four of the major artificial intelligence chatbots presented “significant inaccuracies” when they summarized news stories, according to a report issued this week by the BBC.

This comes a month after Apple Inc. suspended its news summarizing feature for the iPhone after it was revealed the feature was making substantial mistakes, effectively writing misinformation. “We are working on improvements and will make them available in a future software update,” Apple said at the time.

In this new test, staff at the BBC fed 100 news articles from the company website to OpenAI’s ChatGPT, Microsoft’s Copilot, Google’s Gemini and Perplexity. The bots were asked questions about the articles, which led to what the BBC reported as “significant inaccuracies” and distortions.

In total, 51% of the summaries produced were incorrect or contained a falsity. A further 19% of the summaries “introduced factual errors, such as incorrect factual statements, numbers, and dates,” said the report.

Deborah Turness, chief executive of BBC News and Current Affairs, who led the tests, said AI brings “endless opportunities,” but said the rush to let AI chatbots loose on the serious issue of telling the news was “playing with fire.” She added, “We live in troubled times, and how long will it be before an AI-distorted headline causes significant real-world harm?”

Some of the mistakes were quite outlandish. One of the summaries made by ChatGPT still seemed to believe Scotland and England had their former prime ministers. Perplexity misquoted a correspondent in relation to conflict, saying Iran showed “restraint” and Israel was “aggressive,” when that wasn’t what the correspondent had said. Gemini had the NHS saying something about health and vaping that apparently hadn’t been uttered.

The best performances came from ChatGPT and Perplexity, with Copilot and Gemini having more “significant” issues. Nonetheless, the report explained that all the bots “struggled to differentiate between opinion and fact, editorialized, and often failed to include essential context.”

The report said companies might need to “pull back” or at least reassess what they are doing with news summaries considering “the scale and scope of errors and inaccuracies they produce.”

OpenAI was the only company to respond immediately. A spokesperson told BBC News: “We’ve collaborated with partners to improve in-line citation accuracy and respect publisher preferences, including enabling how they appear in search by managing OAI-SearchBot in their robots.txt.”

Image: DALL-E

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About SiliconANGLE Media

SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.

Report says companies ‘playing with fire’ as AI chatbots fail when trying to summarize news

Image: DALL-E

A message from John Furrier, co-founder of SiliconANGLE:

LATEST FROM THECUBE

UPCOMING CUBE EVENTS

RECENT CUBE EVENTS

Celosphere 2025

Dell AI Data Platform Event 2025

Nvidia GTC Washington, D.C. 2025

The AI Security Summit 2025

Audit & Beyond 2025

Report says companies ‘playing with fire’ as AI chatbots fail when trying to summarize news

Image: DALL-E

A message from John Furrier, co-founder of SiliconANGLE:

LATEST STORIES

LATEST STORIES

Celosphere 2025

Dell AI Data Platform Event 2025

Nvidia GTC Washington, D.C. 2025

The AI Security Summit 2025

Audit & Beyond 2025

Cookies