UPDATED 12:55 EDT / SEPTEMBER 27 2024

David Kanter, founder and head of MLPerf at MLCommons Association, and Adi Gangidi, RDMA systems for AI training at Meta Platforms, talk to Rakesh Kumar, senior engineering leader at Juniper Networks, about AI cluster technology at Seize the AI Moment 2024. AI

AI cluster technology driving efficient infrastructure for AI deployment

Artificial intelligence is undergoing a major transformation as AI cluster technology advances, revolutionizing how industries integrate and implement this powerful tool.

This emerging technology is not only speeding up AI processes but also making them more efficient and cost-effective, driving broader adoption and accessibility across various sectors, said David Kanter (pictured, middle), founder and head of MLPerf at MLCommons Association.

“One of the things that I’m excited about that the team has done recently is first of all, we’ve added a lot of gen AI benchmarks,” Kanter said. “Then we also added power measurement so that you can see whether it’s data center inference or training of these large-scale models, how much power and energy are you using? And we’ve seen in the five years that we’ve been around, we were able to get something like 50 times better performance, which is way faster than what we would expect.”

Kanter and Adi Gangidi (right), RDMA systems for AI training at  Meta Platforms Inc., spoke with the host Rakesh Kumar (left), senior engineering leader at Juniper Networks Inc. at the Seize the AI Moment event, during an exclusive broadcast on theCUBE, SiliconANGLE Media’s livestreaming studio. They discussed how AI cluster technology and collaborative benchmarking efforts are driving the next phase of AI adoption by enhancing infrastructure efficiency, scalability and performance. (* Disclosure below.)

Best practices in AI cluster technology for scalable performance

One of the key drivers behind this exponential growth in AI performance is collaboration between organizations, academia and engineers. MLCommons, a nonprofit industry consortium, exemplifies this approach by bringing together diverse players to create standardized benchmarks that measure AI performance, Kanter added.

“So MLCommons really got started with the MLPerf benchmarks as part of what’s bringing us all together today,” he said. “And it was sort of founded in the early days of machine learning and we didn’t have good standard ways of measuring performance. And so, we got the whole community together. We built some standard measures for AI training, which became MLPerf training.”

This collaborative spirit is echoed by Meta, a founding member of MLCommons. Meta’s commitment to benchmarking, particularly in the Chakra work group focused on improving communications performance, demonstrates the integral role of network engineering in optimizing AI performance, Gangidi concluded.

“I think benchmarking and the work that David and MLCommons team is doing is important because with benchmarks you can understand how ML models stress infrastructure or what tasks they’re able to do,” he said. “You’re able to reproduce them and repeat them. If you cannot reproduce something, then it’s hard to improve it or to make it more reliable.Bbenchmarking is a very fundamental aspect of what helps scale these clusters.”

Here’s the complete video interview, part of SiliconANGLE’s and theCUBE Research’s coverage of the Seize the AI Moment event:

Here’s the complete event video playlist:

(* Disclosure: Juniper Networks Inc. sponsored this segment of theCUBE. Neither Juniper Networks nor other sponsors have editorial control over content on theCUBE or SiliconANGLE.)

Photo: SiliconANGLE

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

  • 15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
  • 11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.
About SiliconANGLE Media
SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.