UPDATED 10:52 EDT / APRIL 10 2026

Anindo Sengupta, VP of product management at Nutanix, and Dan Ciruli, VP and GM, cloud-native, at Nutanix, talk to theCUBE about agentic AI infrastructure. - Nutanix .NEXT 2026 CLOUD

Nutanix expands agentic AI infrastructure platform as token costs threaten to spiral

Managing AI infrastructure across the full stack is getting more complex — and more expensive. Now, Nutanix Inc. is tackling both problems with an expanded agentic AI infrastructure platform that gives service providers and enterprises a single control plane for accelerated computing.

The expansion focuses on two additions to the company’s AI stack, according to Anindo Sengupta (pictured, left), vice president of product management at Nutanix. Service Provider Central lets providers build multi-tenant GPU clouds and sell AI service catalogs, including GPU-as-a-service and Kubernetes-as-a-service, to enterprises facing long silicon wait times. Simultaneously, a new AI gateway inside Nutanix Enterprise AI governs which agents access which models and at what cost.

“The AI gateway is really around cost and governance,” Sengupta told theCUBE. “As agents sprawl, models and tools need to be controlled and governed. What we’ve announced is the capability to really drive governance around models and tools using Nutanix’s agentic AI.”

Sengupta and Dan Ciruli (right), vice president and general manager of cloud-native at Nutanix, spoke with theCUBE’s John Furrier and co-host Alison Kosik at Nutanix .NEXT, for an exclusive broadcast on theCUBE, SiliconANGLE Media’s livestreaming studio. They discussed how Nutanix is positioning its agentic AI infrastructure platform as the middleware layer between models and chips for the enterprise. (* Disclosure below.)

Agentic AI infrastructure gets a governance and cost layer

Underpinning both announcements is Nutanix Kubernetes Platform Metal, which the company describes as the only dual-native platform supporting any combination of VMs, virtualized Kubernetes and bare metal Kubernetes from a single control plane, according to Ciruli. NKP also ships with CN-AOS, an enterprise-grade storage layer, and an AI platform-as-a-service catalog of open-source AI projects announced at Nvidia GTC. This will give developers a prepackaged environment for building agentic applications.

“When you walked into the .NEXT keynote, the large font letters said ‘Run anything, anywhere,'” Ciruli said. “That’s our mission. We do want to enable that. I think that customers — enterprises — are going to find many reasons to run in service providers.”

But where enterprises choose to run their workloads will increasingly come down to one thing: cost. The economics of agentic AI will push enterprises to rethink where they run inference, Ciruli noted. A single user action in an agentic workflow can trigger hundreds of downstream agent calls, each consuming tokens at scale and driving up costs. Navigating those tradeoffs will give rise to an entirely new discipline: AI FinOps.

“Right now it’s very, very easy to get access to a model — it’s just an API call to get access to a model, but they will charge you per token,” Ciruli said. “I think customers will very quickly have to start thinking about, ‘Do we call an API where we’re going to pay per token? Do we use some infrastructure at a service provider where we’re paying for time, but then we get to generate all the tokens? Or does it make economic sense to buy some hardware, run it on-prem and now we’re just buying electricity?’ Absolutely, there’ll be AI FinOps to help you optimize that.”

Here’s the complete video interview, part of SiliconANGLE’s and theCUBE’s coverage of Nutanix .NEXT:

(* Disclosure: TheCUBE is a paid media partner for Nutanix .NEXT 2026. Sponsors of theCUBE’s event coverage do not have editorial control over content on theCUBE or SiliconANGLE.)

Photo: SiliconANGLE

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

  • 15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
  • 11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.
About SiliconANGLE Media
SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.