UPDATED 14:21 EDT / MARCH 04 2024

Anthropic debuts Claude 3 model series with ‘near-human’ capabilities in some areas

Anthropic PBC today introduced a new family of large language models, the Claude 3 series, that it says can outperform GPT-4 and Google LLC’s Gemini Ultra.

The series includes three models that vary in their sophistication and price. The most advanced LLM, Claude 3 Opus, is touted as having “near-human levels of comprehension and fluency on complex tasks.” It’s joined by two other models, Claude 3 Sonnet and Claude 3 Haiku, that trade off some response quality for a reduction in inference cost.

All three models feature significant improvements over Anthropic’s previous flagship LLM. Compared with Claude 2.1, they are less likely to generate biased answers or reject harmless prompts that don’t breach the company’s terms of service. Another major difference is that the Claude 3 series isn’t limited to processing text: Users may also input photos, technical diagrams and other visual assets.

A prompt entered into a Claude 3 model may contain up to 200,000 tokens, units of data that each hold a few letters or numbers. According to Anthropic, all three models can theoretically ingest much larger prompts with 1 million tokens or more. The company said it “may make this available to select customers who need enhanced processing power.”

The Claude 3 series is headlined by Opus, an LLM that can answer complex questions twice as accurately as Claude 2.1. Anthropic claims this accuracy boost enables it to outperform GPT-4 and Gemini Ultra across several popular artificial intelligence benchmarks.

One of the benchmarks the company tested, GSM8K, comprises a large number of grade school math problems. Anthropic says that Claude answered 95% of the questions correctly, while Gemini Ultra and GPT-4 scored 94.4% and 92%, respectively. Opus also demonstrated a slight edge over its rivals across two other benchmark tests, MMLU and GPQA, that evaluate AI models’ familiarity with topics such as physics.

The other models in the new Claude 3 series have more limited reasoning capabilities, but will be available for customers at a lower price. They also generate prompt responses quicker.

Anthropic says that the fastest and most affordable Claude 3 model, Haiku, can read a research paper containing 10,000 tokens’ worth of information in less than three seconds. Customers also have access to a third model, Sonnet, which is positioned as a midrange option between Haiku and Opus. It’s not as quick as the former model, but offers higher response quality and can still generate prompts about twice as fast as Anthropic’s previous flagship LLM.

Sonnet and Opus are available today through an application programming interface, as well as Anthropic’s free Claude.ai chatbot. Haiku, in turn, is set to roll out “soon.” Further down the road, Anthropic plans to enhance the Claude 3 series with additional features such as the ability to take actions in third-party applications.

The company is also bringing the LLM family to Amazon Web Services Inc.’s public cloud. Sonnet is available today via Amazon Bedrock, a managed service that provides access to foundation models from AWS and other companies. Opus and Haiku are set to follow suit soon.

Anthropic’s new AI benchmark records may soon be challenged by rivals. In November, OpenAI disclosed that it has begun developing a successor to GPT-4 with more advanced capabilities. More recently, Google detailed a new iteration of Gemini that promises to provide “dramatic improvements” over the current version and has demonstrated the ability to process prompts with up to 10 million tokens.

Image: Anthropic

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About SiliconANGLE Media

SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.

Anthropic debuts Claude 3 model series with ‘near-human’ capabilities in some areas

Image: Anthropic

A message from John Furrier, co-founder of SiliconANGLE:

LATEST FROM THECUBE

UPCOMING CUBE EVENTS

RECENT CUBE EVENTS

Oracle Data Deep Dive NYC 2026

HPE World Quantum Day 2026

Qlik Connect 2026

Nutanix .NEXT 2026

KubeCon + CloudNativeCon EU 2026

Anthropic debuts Claude 3 model series with ‘near-human’ capabilities in some areas

Image: Anthropic

A message from John Furrier, co-founder of SiliconANGLE:

LATEST STORIES

LATEST STORIES

Oracle Data Deep Dive NYC 2026

HPE World Quantum Day 2026

Qlik Connect 2026

Nutanix .NEXT 2026

KubeCon + CloudNativeCon EU 2026

Cookies