UPDATED 06:00 EDT / OCTOBER 20 2025

AI

IBM partners with Nvidia rival Groq to accelerate AI deployment

IBM Corp. and Groq Inc. today announced a strategic partnership aimed at speeding enterprise deployment of agentic artificial intelligence by combining IBM’s watsonx Orchestrate with Groq’s hardware-accelerated inference technology.

The collaboration will give IBM clients access to Groq’s language processing unit through IBM’s platform with the aim of reducing the cost of low-latency AI at large scale.

Groq is a hardware-centric AI infrastructure company specializing in inference acceleration through its proprietary LPU architecture. IBM said Groq’s custom LPU delivers more than five times faster and more cost-efficient inference than traditional graphics processing units such as those made by Nvidia Corp., maintaining consistent performance even as workloads scale up.

The companies will also extend support for virtual large language models and Red Hat’s llm-d framework for large-scale, distributed inference on Groq’s LPU architecture. IBM’s Granite models will run on GroqCloud, an inference platform that provides developers with access to high-speed, low-cost processing for large language models.

IBM said Groq provides a secure, compliant platform for AI deployment that meets the most stringent regulatory and security requirements. Watsonx Orchestrate is used to automate business processes using prebuilt or custom-designed AI agents. Integration with watsonx Orchestrate is aimed at helping IBM customers speed purpose-built agents to production faster and at lower cost.

Groq has raised $1.8 billion in funding, including a $750 million round last month that valued the company at $6.9 billion.

IBM said a combination of speed and orchestration is critical for regulated industries such as healthcare and finance. It cited one health insurance provider that uses Groq technology to handle thousands of patient and provider questions simultaneously and in real time.

“Many large enterprise organizations have a range of options with AI inferencing when they’re experimenting, but when they want to go into production, they must ensure complex workflows can be deployed successfully to ensure high-quality experiences,” Rob Thomas, IBM senior vice president of software and chief commercial officer, said in a statement.

IBM said access to Groq’s capabilities is available immediately.

Image: Groq

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

  • 15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
  • 11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.
About SiliconANGLE Media
SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.