Providing a supercomputing solution for generative AI: Inside HPE and Nvidia’s collaboration
Artificial intelligence is revolutionizing the way the enterprise is approaching IT architecture, necessitating a scalable system that can handle a single workload instead of relying solely on virtualization, such as with traditional cloud-native architecture.
Working closely with Nvidia Corp., Hewlett Packard Enterprise Co. has developed an AI supercomputer in a box that allows for easy scaling, according to Justin Hotard (pictured), executive vice president and general manager of the HPC and AI Business Group at Hewlett Packard Labs. The solution comes with all necessary software and services and is liquid-cooled, catering to the need for on-premise AI solutions.
“There’s a couple of things you’re seeing with gen AI,” Hotard said. “First of all, lots of energy around foundation large language models, but there are actually different parts to generative AI. People are training models for video, for voice, for image generation … and what we realized is … one size doesn’t fit all, but people need an easy way to get started. We’re working closely with Nvidia … to build a scalable AI supercomputer in a box.”
Hotard spoke with theCUBE industry analysts Rob Strechay and Rebecca Knight at the HPE Discover Barcelona event, during an exclusive broadcast on theCUBE, SiliconANGLE Media’s livestreaming studio. They discussed how HPE is emphasizing the importance of on-prem AI solutions, sustainability and responsible and ethical AI standards. (* Disclosure below.)
Growing opportunities in AI
Data location is important for training models, and it’s necessary to consider data privacy regulations and move compute to where the data is, according to Hotard.
The growing opportunities in AI are in innovation for enterprise use cases, such as retrieval augmented generation for better customer experience and accuracy. There is increasing demand for specialized AI use cases, as seen in the U.K.’s deployment of its largest supercomputer focused on AI, and an AI-native architecture starts with prioritizing data, Hotard explained.
“The U.K. government, who literally in a very short period, is going to announce and deploy their largest supercomputer, completely focused on AI with this similar stack to what we just talked about a couple weeks ago,” Hotard said. “This is an example where the U.K. realizes we need to get ahead, we need to enable AI for our scientists and accelerate, and I think we’re seeing more and more of that both in private enterprises as well as the public sector.”
Here’s the complete video interview, part of SiliconANGLE’s and theCUBE’s coverage of the HPE Discover Barcelona event:
(* Disclosure: TheCUBE is a paid media partner for the HPE Discover Barcelona event. Neither Hewlett Packard Enterprise Co., the main sponsors of theCUBE’s event coverage, nor other sponsors have editorial control over content on theCUBE or SiliconANGLE.)
A message from John Furrier, co-founder of SiliconANGLE:
Your vote of support is important to us and it helps us keep the content FREE.
One click below supports our mission to provide free, deep, and relevant content.
Join the community that includes more than 15,000 #CubeAlumni experts, including Amazon.com CEO Andy Jassy, Dell Technologies founder and CEO Michael Dell, Intel CEO Pat Gelsinger, and many more luminaries and experts.