UPDATED 10:00 EDT / MARCH 20 2026

AI

Nvidia CEO Jensen Huang bids to own the entire AI factory stack

Nvidia dominated tech news this week, as its hold on the artificial intelligence factory boom only tightened at its annual GTC conference in San Jose.

It introduced a raft of updated chips and software as well as partnerships with just about everyone in sight. Check out our extensive coverage below for all the details, and tune in today starting at 10 a.m. PDT for a day of interviews on theCUBE with many of those partners plus analysts from theCUBE Research and elsewhere.

As his company’s market cap hovers above $4 trillion, CEO Jensen Huang (pictured) just keeps rolling, predicting revenue will double to $1 trillion — or more! — by the end of 2027 from the previous two-year period. And China chip sales are just starting up again too.

At this bigger-than-ever GTC, Huang made it clear that Nvidia is gunning to command the levers of the entire AI factory hardware and software stack, though of course it’s leaving plenty of room for other hardware providers such as Dell, Hewlett Packard Enterprise, Cisco Systems and others to use its chips in their computers and networking gear.

As Huang sees it, there’s no choice but to make sure all the pieces of the AI factory, from chips to storage to networking to AI models (including its own) to the software orchestrating everything (including its own CUDA computing platform and programming model), work together as seamlessly as possible — what he calls “extreme co-design.” “We are a vertically integrated computing company,” he declared this week. “There is no other way.” That doesn’t quite make Nvidia the Apple of AI, but they rhyme now.

The reason for this emphasis is that what matters now is less training of massive new models, which Nvidia’s graphics processing units are aces at, than inference — the process of providing answer to queries. For that, a different kind of processing works better and costs a lot less — and that’s why Nvidia is rushing to market later this year a language processing unit chip from Groq, whose team it acquired in December for $20 billion. And it’s also why it’s pairing its own Rubin GPU with its own Vera central processing unit in a tightly integrated unit. “Inference is not a one-chip pony,” Ian Buck, VP and GM of Nvidia hyperscale and high-performance computing, told me in an interview. “Inference is harder because it has a real-time component.

The greater emphasis on inference helped prompt an expanded deal with Amazon Web Services that include not only 1 million GPUs but LPUs and Nvidia’s Spectrum-X networking chips —  all this even though AWS is constantly designing its own chips, networking and more.

This is especially important in the dawning age of agentic AI. Millions or even billions of agents will be talking to each other constantly and interfacing with software humans use at speeds that leave us in the dust.

Elsewhere, in another nod to the need to reduce the cost of AI inference, both OpenAI and Mistral released new, more hardware-efficient but still quite capable models.

OpenAI continued its quest for enterprise business with the planned acquisition of the Python tooling startup Astral. It needs to get cracking, because Anthropic leads in AI tools among enterprises by a mile.

Jeff Bezos wants to use AI to transform manufacturing in a wide variety of industries, and this week he reportedly gallivanted around the Middle East and Southeast Asia to raise a cool $100 billion to do it.

Next week are two big enterprise shows that SiliconANGLE and theCUBE will be covering onsite: RSAC, the cybersecurity conference in San Francisco, for which many companies announced products in a vain attempt to beat the crowd, and KubeCon+CloudNativeCon EU in Amsterdam.

Here’s all of this week’s enterprise and emerging tech news and views from SiliconANGLE and beyond:

AI and data: Nvidia is the AI factory now

Analysis and food for thought

The AI spending flip: Anthropic is now capturing over 73% of all spending among companies buying AI tools for the first time (per Axios)

AI agent goes rogue at Meta, exposing sensitive company and user data to employees without proper access (per The Information)

The human skill that eludes AI: Why can’t language models write well? (per The AtlanticBut they’re going to anyway: Your AI agent can now create, edit and manage content on WordPress.com Ugh. Massive slop incoming. So we’ll have AI-written posts, with headlines rewritten by Google AI, probably read by a bunch of AI agents. Who needs people?

Apple’s cheap AI bet could pay off big He hopes. Yet he could be right: Apple is way behind in AI — and still making a fortune from it (per Wall Street Journal)

Why tech bros are now obsessed with taste (per The New Yorker)

Coverage from Nvidia GTC

AI inflection point: As Nvidia’s Jensen Huang outlines vision for agents and the AI factory, he forecasts big jump in revenue

Nvidia CEO Jensen Huang reveals chip sales in China are about to restart

Special Breaking Analysis from theCUBE Research’s Dave Vellante: Nvidia moves even further down the stack: Why STX signals a new battleground in storage for AI factories

Nvidia launches NemoClaw, Agent Toolkit to enhance AI agents

Nvidia expands open AI model portfolio and enlists partners for frontier development

Nvidia reinvents the CPU for the age of agentic AI

Nvidia debuts the Groq 3 language processing unit, a dedicated inference chip for multi-agent workloads

Upping the stakes for AI infra, Nvidia launches turbocharged Vera Rubin platform

Nvidia introduces platform for large-scale AI training and inference

Dell expands AI Factory with new data platform, infrastructure and agentic AI features

HPE broadens AI factory with new Blackwell servers, private cloud upgrades and Rubin infrastructure

Nvidia builds partnerships in effort to connect AI-driven robots to the real world

Beyond the plumbing: How Cisco and Nvidia are industrializing the ‘token economy’

The AI workforce is now ‘hirable’: How Nvidia is rewiring healthcare from the inside out

Nvidia expands physical AI with communication and data processing infrastructure blueprints

Nvidia introduces BlueField-4 STX reference architecture for AI storage systems

Nvidia previews Vera Rubin Space-1 Module for orbital data centers

Akamai launches Nvidia AI Grid intelligent orchestration for distributed inference across 4,400 edge locations

Nvidia GTC 2026: Jensen Huang’s Groq ‘Mellanox moment’ and the inference land grab

GTC preview: Inside the AI factory — The $1T infrastructure war under the hood of the AI economy

Nvidia and AWS expand compute capacity in the agentic AI era: AWS will start deploying Groq LPUs and more than 1 million Nvidia GPUs, as well as its Spectrum-X networking platform.

Money matters

Jeff Bezos is planning to raise $100 billion to speed up manufacturing automation

OpenAI acquires open-source Python tooling startup Astral

IBM closes $11B deal for Confluent

IT automation startup Standard Template Labs raises $49M

Deeptune raises $43M to accelerate AI learning through virtual training gyms

BusRight bags $30M to ensure school buses always arrive, right on time

Corridor raises $25M Series A round to secure AI coding at the source

Hosted.ai raises $19M to pool GPU capacity, increasing the efficiency of neocloud infrastructure

Autoscience builds automated research lab for machine learning models with $14M

Multiply raises $9.5M to launch ‘self-learning’ advertising platform

Respan raises $5M to bring proactive observability to AI agents

New models and services

OpenAI to create desktop super app, combining ChatGPT app, browser and Codex app

OpenAI, Mistral AI release new hardware-efficient language models

Google expands availability of its Personal Intelligence tool

Snowflake previews project to automate workflows with AI agents

Snowflake invests in Bedrock Data to strengthen agentic AI system governance

Google upgrades its Stitch AI interface development tool

Vibe coding startup Cursor launches programming-optimized Composer 2 model

Workday introduces Sana: an AI knowledge discovery and work automation platform

Nutanix rolls out software solution to scale enterprise agentic AI rollouts at lower cost

Swa launches multi-agent generative AI orchestration solution for enterprise businesses

Accenture and Databricks accelerate enterprise adoption of AI applications and agents

Qualtrics adds AI-powered synthetic data and research tools to speed customer insights

The intelligent green: How AWS and the PGA Tour are reimagining the fan experience through agentic AI

Policy

Court rules Perplexity’s AI bots can stay on Amazon

Teens launch lawsuit against xAI over Grok deepfakes

Around the enterprise: AWS’ AI rocket ride

Money matters

Amazon CEO Andy Jassy forecasts cloud revenue to hit $600B by 2036, thanks to AI

On-demand GPU startup Andromeda raises funding at $1.5B valuation

Turquoise vows to bring transparent pricing and same-day payments to healthcare after raising $40M

Nebius secures $27B AI infrastructure commitment from Meta Platforms

Frore Systems scores $143M in funding at a $1.64B valuation to help AI chips run cooler

Earnings

Micron’s earnings crush forecasts on memory chip demand and it guides for an even bigger beat – but shares fall anyway

Foxconn expects AI demand to remain strong, sees limited Mideast impact

Docusign beats on revenue, outlook for next year is optimistic

New products and services

Dell workstations get major AI-focused upgrades

Hammerspace storage platform feeds distributed data directly to GPUs

Anori, new spinout from Alphabet’s X, goes after one of the world’s most expensive bureaucratic nightmares (per TechCrunch)

Policy

Super Micro co-founder, employee and contractor smuggled Nvidia chips to China, US prosecutors charge

Cyber beat: Getting out ahead of RSAC

New services

Menlo Security takes on AI agent risk with new browser security platform

Rubrik unveils Google Workspace protection offering with rapid recovery and air-gapped backups

Torq unveils Agentic Builder to automate security workflows from natural language intent

NinjaOne launches AI-driven vulnerability management to speed detection and remediation

Cato Networks rolls out Neural Edge and AI Security to protect enterprise AI workloads

1Password introduces Unified Access platform and partner API for AI agent security

Okta unveils new framework to manage AI agents and upcoming Okta for AI Agents platform

Druva launches Identity Resilience to protect Okta, Microsoft Active Directory and Entra ID environments

Panther rolls out AI SOC Platform with agents that learn and improve over time

SpecterOps adds Okta, GitHub and Mac coverage to BloodHound Enterprise platform

Theori launches Xint Code AI platform to uncover hidden vulnerabilities in massive codebases

Vanta unveils agents and enterprise features to streamline governance, risk and compliance workflows

Money matters

Oasis Security raises $120M to secure nonhuman identities across AI and cloud environments

Automated vulnerability detection startup Xbow nabs $120M

Surf AI launches agentic security operations platform with $57M funding round

RunSybil raises $40M to automate offensive security with AI agents

Cybersecurity startup Raven raises $20M for runtime application security platform

Manifold raises $8M to secure autonomous AI agents on enterprise endpoints

Attack & response

Researchers discover zero-day DarkSword exploit chain in iOS 18

Elsewhere in tech: Zuck’s metabomb

The long farewell to Mark Zuckerberg’s metaverse (per the New York Times)

Verily steps out from under Alphabet’s shadow after raising $300M to advance precision health

Amazon acquires startup Rivr to test robots for ‘doorstep delivery’

Uber to invest up to $1.25B in Rivian robotaxis

Mastercard agrees to buy stablecoin platform BVNK for up to $1.8 billion

RoboForce raises $52M to develop physical AI robots for industrial labor

Report: Meta could lay off 20% of its staff and replace many of them with AI workers

Microsoft CEO Satya Nadella is shaking up Copilot organization leadership, bringing the Copilot system across commercial and consumer together as “one unified effort,” Nadella said in a post. Jacob Andreou will lead the Copilot experience across consumer and commercial as EVP of Copilot. Ryan Roslansky, Perry Clarke and Charles Lamanna will lead M365 apps and the Copilot platform. The changes are intended to enable Mustafa Suleyman, head of Microsoft’s AI group, to focus on developing generative AI models.

Jasjeet Sekhon from Bridgewater Associates joined Google DeepMind as chief strategy officer.

Agentic AI cybersecurity platform Kai appointed Alfredo Hickman chief information security officer.

AI search infrastructure startup You.com promoted Saahil Jain to chief technology officer.

What’s next

Events

March 23-26: RSAC, San Francisco: SiliconANGLE and theCUBE will be onsite at the biggest cybersecurity conference of the year.

March 23-26: KubeCon+CloudNativeCon EU, Amsterdam: TheCUBE will onsite with interviews and analysis, and SiliconANGLE will have the major news.

Photo: Robert Hof/SiliconANGLE

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

  • 15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
  • 11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.
About SiliconANGLE Media
SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.