UPDATED 14:30 EDT / MARCH 13 2025

AI

Cohere releases a low-cost AI model that requires only two GPUs

Artificial intelligence startup Cohere Inc. today unveiled Command A, its latest large language model capable of high-performance capabilities for business needs with minimal hardware requirements than competitors’ AI models.

The startup touted the LLM as capable of exceeding leading proprietary and open models such as OpenAI GPT-4o and DeepSeek-V3. The company added that in private deployments the LLM can run across two graphics processing units, Nvidia Corp.’s A100 or H100, while competing models can take up to 32.

This size differential can be important because customers that require internal deployments, such as finance and healthcare, often must place their AI models inside their firewalls. This means buying costly AI accelerator hardware and having high-performing models that can run within their enterprise perimeter is a must.

“In head-to-head human evaluation across business, STEM, and coding tasks, Command A matches or outperforms its larger and slower competitors — while offering superior throughput and increased efficiency,” Cohere said. It detailed that Command A can deliver tokens at a rate of up to 156 tokens/sec, which is 1.75x faster than GPT-4o and 2.4x faster than DeepSeek-V3.

With business use in mind, the model also has a larger context window at 256,000 tokens, which is twice the size of the industry average, including Cohere’s Command R+ model. It means that the model can ingest a sizable number of documents at once or up to a 600-page book.

“We are only training our model to make you better at your job,” Cohere co-founder Nick Frosst said. “It should feel like getting into a mech for your mind. So, we are training it to empower you. So, it should feel specifically good at that.”

The company stated that it focused on developing capabilities in the model designed to enable the scalable operation of AI agents. Agentic AI has recently become a prominent trend in the industry, aiming to create artificial intelligence systems that can analyze data, make decisions and carry out tasks with minimal or no human involvement. In practice, this requires massive amounts of compute and doing so efficiently and accurately based on company information requires well-trained AI models.

Cohere said Command A will integrate directly with its secure AI agent platform, North, which allows enterprise business users to use the full potential of their company data. The platform is designed to enable enterprise AI agents to use customer relationship management, resource planning software and other tools to automate tasks.

Image: Cohere

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

  • 15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
  • 11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.
About SiliconANGLE Media
SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.