UPDATED 11:00 EST / AUGUST 23 2024

Nvidia to present AI and data center performance innovations at Hot Chips conference

Nvidia Corp. today revealed details about what it will discuss during the Hot Chip 2024 semiconductor technology conference in Cupertino, California, on Monday, which includes advancements to its Blackwell platform, research on liquid cooling for data centers and AI agents for chip design.

“Nvidia Blackwell is a platform, the GPU is just the beginning,” said Dave Salvator, director of accelerated computing products at Nvidia.

It comprises multiple different Nvidia chips including the Blackwell graphics processing unit, the Grace central processing unit, the Bluefield data processing unit, the ConnextX network interface card, the NVLink Switch, the Spectrum Ethernet switch and the Quantum InfiniBand switch. All work together to power large language model inference and accelerated computing.

Nvidia unveiled the Blackwell GPU architecture in March, during GPT 2024 when the company said it will be capable of running real-time generative AI models powered by colossal, 1 trillion-parameter large LLMs. It will also be able to handle them at an impressive 25 times lower cost and power consumption than Nvidia’s existing H100 GPUs, based on the older Hopper architecture.

“As we’ve seen, models grow in size over time and the fact that most generative AI applications are expected to run in real-time,” said Salvator. “The requirement for inference has gone up dramatically over the last several years. One of the things that real-time LLM inferencing needs is multiple GPUs and in the not-so-distant future multiple servers.”

An example of the Blackwell as a platform is the multi-node GB200 NVL72 solution, which provides low-latency, high-throughput token generation for extremely large LLMs. It acts as a unified system capable of delivering inference for trillion-parameter LLMs, such as GPT-MoE-1.8T, at 30 times the speed of the HGX H100 system and four times the training speed compared to the H100.

In addition to the new hardware, Nvidia will showcase the Quasar Quantization System, a new piece of software that uses Blackwell’s Transformer Engine to support high accuracy on lower precision models. Through a technique called FP4, using four bits of floating point precision per operation — new to the Blackwell processor, as Hopper had eight — models can take up less memory, perform better and still retain high accuracy.

Liquid cooling in data centers

On Sunday, Ali Heydari, director of data center cooling and infrastructure at Nvidia will present several designs for hybrid-cooled data centers. Although air cooling is common for moving heat away from servers, water is becoming a much more sustainable solution in combination with air.

Liquid-cooling techniques can move heat away from hot components more efficiently than air, which can keep components from overheating and throttling themselves and extending their lifespans. This is especially important given the bigger workloads that AI represents. Liquid-cooling systems also take up less space than air-cooling systems, Nvidia said.

One system that Nvidia will present is a warm water direct chip-to-chip approach that can deliver up to a 28% reduction in data center facility power.

“As the name implies, this system does not use chillers, which makes water cold, which uses a compressor, like a refrigerator works for instance,” said Salvator. “By going with this solution of using warm water we don’t have to use chillers and that gets us some energy savings.”

AI agents for chip design

Semiconductor chips are also a place where design quality and productivity can benefit from AI helping engineers better understand the microscopic effects of the placement of tiny circuits and the field effects on silicon.

Mark Ren, director of design and automation at Nvidia, will lead a presentation on Sunday of AI models that can assist with answering questions, generating code and debugging design problems. Nvidia has even developed an LLM to accelerate the production of Verilog code, a hardware description language used to model electronic systems, to assist engineers in building better chips.

Image: Nvidia

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About SiliconANGLE Media

SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.

Nvidia to present AI and data center performance innovations at Hot Chips conference

Liquid cooling in data centers

AI agents for chip design

Image: Nvidia

A message from John Furrier, co-founder of SiliconANGLE:

LATEST FROM THECUBE

UPCOMING CUBE EVENTS

RECENT CUBE EVENTS

CES 2026

AWS re:Invent 2025

Microsoft Ignite 2025

SC25

Refresh North America 2025

Nvidia to present AI and data center performance innovations at Hot Chips conference

Liquid cooling in data centers

AI agents for chip design

Image: Nvidia

A message from John Furrier, co-founder of SiliconANGLE:

LATEST STORIES

LATEST STORIES

CES 2026

AWS re:Invent 2025

Microsoft Ignite 2025

SC25

Refresh North America 2025

Cookies