UPDATED 11:33 EDT / MAY 21 2025

Todd Brannon, senior director of ecosystems solutions at Cisco, and Chris O'Brien, senior technical marketing director at Cisco, discuss Red Hat's llm-d project at Red Hat Summit 2025. AI

Inference at scale: Cisco and Red Hat step up with llm-d

The partnership between Red Hat Inc. and Cisco Systems Inc. has strengthened in the era of artificial intelligence, as shown by Red Hat’s announcement of llm-d.

The llm-d community was made to handle generative AI inference at scale. Cisco partnered with Red Hat on the project, extending its support for AI development using Red Hat’s open-source tools.

The set of theCUBE at Red Hat Summit 2025, where Red Hat revealed partnerships with Cisco and others on projects such as llm-d.

The set of theCUBE at Red Hat Summit 2025.

“You basically harness that immense power of an open-source ecosystem and all those contributors, but you harden it, you support it for the enterprise, and so [Red Hat] plays a critical role, and they’ve played a critical role for a long time,” said Todd Brannon (pictured, left), senior director of ecosystems solutions at Cisco. “They’ve been woven into all of our solutions over time. Originally, with networking, compute as we got into that, security. They’re really intrinsic to all the solutions that we’ve had.”

Brannon and Chris O’Brien (pictured, right), senior technical marketing director at Cisco, spoke with theCUBE’s host Rebecca Knight and theCUBE Research’s Rob Strechay at Red Hat Summit, during an exclusive broadcast on theCUBE, SiliconANGLE Media’s livestreaming studio. They discussed the llm-d project, as well as other AI-focused collaborations. (* Disclosure below.)

Doing AI at scale through llm-d and more

As Cisco looks to build out infrastructure for large-scale enterprise IT, Red Hat’s suite of open source solutions has been a boon. The llm-d project, in particular, allows for more efficient and scalable inference by integrating advanced virtual large language model-based inference capabilities into existing enterprise IT infrastructures, according to Brannon.

“Inference is the point of delivery for all this AI work,” he said. “Modern applications are these constellations of microservices, and so being able to scale out horizontally is essential. That’s what that llm-d piece is doing for us. [It] help(s) us spread this across multiple nodes … It’s Red Hat taking the lead to bring really important capabilities in to help our enterprise customers scale.”

The rise of AI models has led some companies to seek more on-premises solutions or build their own private AI clouds for improved security. In response to this trend, Cisco has acquired Isovalent, Inc., a cloud native networking and security company. The goal, O’Brien emphasizes, is to give customers what they want.

“For us at Cisco, just to be point-blank, we want to enable our customers to go where they want to go,” he said. “And if it happens to be OpenShift virtualization or they want to stay with Broadcom or what, we want to bring the same consistent approach to the fabric as well as to the compute.”

Here’s the complete video interview, part of SiliconANGLE’s and theCUBE’s coverage of Red Hat Summit:

(* Disclosure: Cisco Systems Inc. sponsored this segment of theCUBE. Neither Cisco nor other sponsors have editorial control over content on theCUBE or SiliconANGLE.)

Photo: SiliconANGLE

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

  • 15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
  • 11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.
About SiliconANGLE Media
SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.