UPDATED 12:00 EDT / MARCH 14 2024

Anthropic releases affordable, high-speed Claude 3 Haiku model

Anthropic PBC, an artificial intelligence startup that builds trustworthy AI models rivaling OpenAI’s GPT-4, Wednesday released Claude 3 Haiku, the newest addition to its Claude 3 family of models designed for speed and affordability.

Anthropic introduced the Claude 3 family of large language models earlier in March with three models. The most advanced model, Claude 3 Opus, the company says has significant processing power that rivals even the best-in-class from industry giants such as OpenAI and Google LLC. Its other sibling Sonnet balances speed for cost.

The company says that Haiku is three times faster than its peers when processing most workloads, making it perfect for use cases that require sheer speed and low latency. That makes it ideal for use cases such as customer service, fieldwork, question-and-answer and other applications where quick answers are ideal.

“Speed is essential for our enterprise users who need to quickly analyze large datasets and generate timely output for tasks like customer support,” the company said in the announcement. “It also generates swift output, enabling responsive, engaging chat experiences and the execution of many small tasks in tandem.”

According to Anthropic, Haiku is capable of processing up to 21,000 tokens, or around 30 pages of text, per second for prompts under 32,000 tokens.

Like the rest of the models in the Claude 3 family, Haiku is capable of responding to basic questions and requests. It has a maximum prompt size of 200,000 tokens, which is around 150,000 words, or more than 500 pages of material. The company said that all three models have enhanced capabilities when it comes to content creation, code generation and analysis, as well as improved fluency in non-English languages such as Spanish, Japanese and French.

The company also put particular focus on making the model affordable, placing a 1:5 pricing model on the input-to-output token ratio for enterprise workloads where longer prompts are common. Businesses often rely on LLMs to digest and analyze extremely large documents and this can lead to higher costs. Anthropic said that the model could analyze 400 Supreme Court cases or 2,500 images for just $1.

“Businesses can rely on Haiku to quickly analyze large volumes of documents, such as quarterly filings, contracts, or legal cases, for half the cost of other models in its performance tier,” the company said.

As part of the announcement, Anthropic said that Haiku is joining Sonnet on Amazon Web Service Inc.’s public cloud through Amazon Bedrock, a managed service that provides access to AI foundation models from AWS and other companies. The company said that the model will also be coming soon to Google Cloud Vertex AI, a platform from Google LLC for training and deploying generative AI models.

Customers and developers can also use Haiku through the company’s application programming interface or with a Claude Pro subscription via claude.ai.

Image: Anthropic

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About SiliconANGLE Media

SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.

Anthropic releases affordable, high-speed Claude 3 Haiku model

Image: Anthropic

A message from John Furrier, co-founder of SiliconANGLE:

LATEST FROM THECUBE

UPCOMING CUBE EVENTS

RECENT CUBE EVENTS

VMware Explore 2025

Future of Data Platforms Summit 2025

WOW: World of Workato 2025

Supermicro Open Storage Summit 2025

Black Hat USA 2025

Anthropic releases affordable, high-speed Claude 3 Haiku model

Image: Anthropic

A message from John Furrier, co-founder of SiliconANGLE:

LATEST STORIES

LATEST STORIES

VMware Explore 2025

Future of Data Platforms Summit 2025

WOW: World of Workato 2025

Supermicro Open Storage Summit 2025

Black Hat USA 2025

Cookies