UPDATED 17:51 EDT / JANUARY 08 2024

OpenAI argues New York Times’ AI copyright lawsuit is ‘without merit’

OpenAI claims that a copyright lawsuit brought against it and Microsoft Corp. by The New York Times is “without merit.”

The artificial intelligence developer made the argument in a blog post published today. The response comes less than two weeks after the Times filed its lawsuit, which accuses OpenAI of using millions of the paper’s articles to train its AI models. Additionally, ChatGPT is alleged to have displayed paywalled content in response to some user prompts.

Rumors that the Times may pursue legal action against OpenAI first emerged in August. That month, the paper updated its terms of service with a provision prohibiting companies from scraping its content for AI training purposes. According to NPR, the Times began weighing litigation after negotiations with OpenAI about a potential content licensing deal became “contentious.”

The AI developer has not detailed what datasets it used to train its latest large language models. However, OpenAI did disclose that LLMs released before GPT-3.5 drew on an open-source dataset called Common Crawl. That dataset, the Times’ lawsuit states, contains about 16 million records from websites operated by the paper.

A second argument included in the lawsuit is that ChatGPT sometimes displays paywalled articles when prompted to do so by users. The issue allegedly also affects Microsoft’s Bing Chat service, which is based on the same GPT-4 model as ChatGPT.

In addition to claiming that the lawsuit is without merit, the blog post OpenAI published today pushes back against two of the core copyright concerns the Times has raised.

The AI developer argues that training AI models using publicly available content is fair use. Its blog post goes on to state that the Times’ “content didn’t meaningfully contribute to the training of our existing models and also wouldn’t be sufficiently impactful for future training.”

The blog post also addresses the Times’ concerns about ChatGPT providing access to paywalled articles. According to OpenAI, the phenomenon is a “rare bug” that it’s currently working to fix. The AI developer’s blog post goes on to claim that “The New York Times is not telling the full story.”

“The regurgitations The New York Times induced appear to be from years-old articles that have proliferated on multiple third-party websites,” OpenAI stated. “It seems they intentionally manipulated prompts, often including lengthy excerpts of articles, in order to get our model to regurgitate. Even when using such prompts, our models don’t typically behave the way The New York Times insinuates.”

The lawsuit is the latest in a series of legal complaints brought against generative AI developers over the past few quarters. Previously, OpenAI and Microsoft were sued for the manner in which they used open-source code to train the GitHub Copilot programming assistant. A separate lawsuit filed last January accused Stability AI Ltd., DeviantArt Inc. and Midjourney Inc. of using copyrighted images to develop their AI models.

Photo: Unsplash

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About SiliconANGLE Media

SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.

OpenAI argues New York Times’ AI copyright lawsuit is ‘without merit’

Photo: Unsplash

A message from John Furrier, co-founder of SiliconANGLE:

LATEST FROM THECUBE

UPCOMING CUBE EVENTS

RECENT CUBE EVENTS

KubeCon + CloudNativeCon EU 2026

RSAC 2026 Conference

Nvidia GTC 2026

Google Cloud AI Agents in Action Series 2025/2026

MWC Barcelona 2026

OpenAI argues New York Times’ AI copyright lawsuit is ‘without merit’

Photo: Unsplash

A message from John Furrier, co-founder of SiliconANGLE:

LATEST STORIES

LATEST STORIES

KubeCon + CloudNativeCon EU 2026

RSAC 2026 Conference

Nvidia GTC 2026

Google Cloud AI Agents in Action Series 2025/2026

MWC Barcelona 2026

Cookies