UPDATED 21:09 EDT / NOVEMBER 20 2024

Chinese AI startup DeepSeek’s newest model surpasses OpenAI’s o1 in ‘reasoning’ tasks

Chinese artificial intelligence startup DeepSeek has unveiled a new “reasoning” model that it says compare very favorably with OpenAI’s o1 large language model, which is designed to answer math and science questions with more accuracy than traditional LLMs.

The startup, which is an offshoot of the quantitative hedge fund High-Flyer Capital Management Ltd., revealed on X today that it’s launching a preview of its first reasoning model, DeepSeek-R1.

Reasoning models are different from standard LLMs thanks to their ability to “fact-check” their responses. To do this, they typically spend a much longer time considering how they should respond to a prompt, allowing them to sidestep problems such as “hallucinations,” which are common with chatbots like ChatGPT.

When OpenAI released the o1 model in September, it said it’s much better at dealing with queries and questions that require reasoning skills. That’s because it relies on a machine learning technique known as “chain of thought” or CoT, which allows it to break down complex tasks into smaller steps and carry them out one-by-one, improving its accuracy.

DeepSeek works in a similar way, planning ahead when presented with complex problems, solving them one after the other to ensure it can respond accurately. The process can take a while though, and like o1, it might need to “think” for up to 10 seconds before it can generate a response to a question.

The model’s thought process is entirely transparent too, allowing users to follow it as it tackles the individual steps required to arrive at an answer.

The startup says DeepSeek-R1 bests the capabilities of o1 on two key benchmarks, AIME and MATH. The former uses other AI models to evaluate the performance of LLMs, while the latter is a series of complex word problems. In addition, the model showed it correctly answered a number of “trick” questions that have tripped up existing models such as GPT-4o and Anthropic PBCs Claude, VentureBeat reported.

However, DeepSeek-R1 does suffer from a number of issues, with some commenters on X saying that it appears to struggle with logic problems such as Tic-Tac-Toe. That said, o1 also struggled with the same kinds of problems.

Users also reported that DeepSeek doesn’t respond to queries that the Chinese government likely deems to be too sensitive. When asked about incidents such as the Tiananmen Square massacre, Chinese President Xi Jingping’s relations with Donald Trump, and the potential of China invading Taiwan, it consistently replied that it’s “not sure how to approach this type of question.”

DeepSeek’s rejection of politically sensitive queries likely stems from the need for Chinese developers to ensure their models “embody core socialist values.”

That said, some users also revealed that it’s quite easy to jailbreak DeepSeek, and prompt it in a way that it ignores its guardrails. For example, one user found a way to get it to provide a detailed recipe and instructions for creating methamphetamine, which is, of course, highly illegal in most countries.

DeepSeek is a rather unusual AI startup thanks to its backing by a quantitative hedge fund that aims to use LLMs to enhance its trading strategies. It’s not new on the AI scene, having previously released an LLM called DeepSeek-V2 for general-purpose text and image generation and analysis. It was founded by a computer science graduate called Liang Wenfeng, and has the stated aim of achieving “superintelligent” AI.

DeepSeek-R1 can be accessed via the DeepSeek Chat application on the company’s website. Although it’s free to use, nonpaying users are limited to just 50 messages per day. The company is also planning to make DeepSeek-R1 available through an application programming interface.

Image: SiliconANGLE/Freepik AI

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About SiliconANGLE Media

SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.

Chinese AI startup DeepSeek’s newest model surpasses OpenAI’s o1 in ‘reasoning’ tasks

Image: SiliconANGLE/Freepik AI

A message from John Furrier, co-founder of SiliconANGLE:

LATEST FROM THECUBE

UPCOMING CUBE EVENTS

RECENT CUBE EVENTS

Nutanix .NEXT 2026

KubeCon + CloudNativeCon EU 2026

RSAC 2026 Conference

Nvidia GTC 2026

Google Cloud AI Agents in Action Series 2025/2026

Chinese AI startup DeepSeek’s newest model surpasses OpenAI’s o1 in ‘reasoning’ tasks

Image: SiliconANGLE/Freepik AI

A message from John Furrier, co-founder of SiliconANGLE:

LATEST STORIES

LATEST STORIES

Nutanix .NEXT 2026

KubeCon + CloudNativeCon EU 2026

RSAC 2026 Conference

Nvidia GTC 2026

Google Cloud AI Agents in Action Series 2025/2026

Cookies